comment on

(Straying a bit from the OP, but this is fun, no? And anyway, I rather expect this may be interesting/useful for him too).

An alternative (in postgresql) would be to use regex-index, which can be used when the submitted search-string or regex is anchored:

select count(*) from azjunk6; -- 1 million rows random data:
  count  
---------
 1000000
(1 row)

-- without index:
 select * from azjunk6 where txt ~ '^car[sz]';

                                      txt                             
+           
----------------------------------------------------------------------
+------------
 carsxbutsvamedynximrftmimgtzirtuorik lunamb qpjvwmixlxpmcu  mm  rzotj
+jnfxr syfrj
 carzfhndjznvpgcpwqb fp bqpljspqqpzfbbswefzs pjoocqztqkjxyvbr qalcfzme
+bezz ftmyi 
 carziicmi zzzvt beqsupgdwkhdg luvvmhhay bj b r soaiyfftiqgq hs brdzaf
+dztmtvfvrdn
 carziogaizohcqcphs ksucyeod q yvfallob  pctvmwplm igzsqalyy dqsjpiikx
+wyyxesenbeq
 carzw rcfwlqcweao jzeyxkchgc g vyvujtbsbeiewj inuelmldsa mpjevzmo  pc
+pwi  kfajug
 carzxrk qyk palimcwokbw hbdcsmxehcsnrop prrokygyi ssngegzksrzvged cu 
+oxr yozt ca
(6 rows)

Time: 1147.420 ms

-- now make a text_pattern_ops index:

 create index azjunk6_text_pattern_ops_idx on azjunk6 (txt text_patter
+n_ops);
 Time: 7282.579 ms

-- with index:
 select * from azjunk6 where txt ~ '^car[sz]';

                                      txt                             
+           
----------------------------------------------------------------------
+------------
 carsxbutsvamedynximrftmimgtzirtuorik lunamb qpjvwmixlxpmcu  mm  rzotj
+jnfxr syfrj
 carzfhndjznvpgcpwqb fp bqpljspqqpzfbbswefzs pjoocqztqkjxyvbr qalcfzme
+bezz ftmyi 
 carziicmi zzzvt beqsupgdwkhdg luvvmhhay bj b r soaiyfftiqgq hs brdzaf
+dztmtvfvrdn
 carziogaizohcqcphs ksucyeod q yvfallob  pctvmwplm igzsqalyy dqsjpiikx
+wyyxesenbeq
 carzw rcfwlqcweao jzeyxkchgc g vyvujtbsbeiewj inuelmldsa mpjevzmo  pc
+pwi  kfajug
 carzxrk qyk palimcwokbw hbdcsmxehcsnrop prrokygyi ssngegzksrzvged cu 
+oxr yozt ca
(6 rows)

Time: 12.524 ms  -->  100x faster
[download]

(It can be handy to have both a 'normal' btree index *and* such a text_pattern_ops regex index.)

See also: PostgreSQL index opclasses

You can get another interesting indextype from pg_trgm, a postgresql extension. This will give you not indexed regexen but indexed trigrams: PostgreSQL pg_trgm extension. (disadvantage: large index-size)

And FWIW: in 9.2devel, there is work ongoing to make it possible to combine the two: regexed trigram indexes...

In reply to Re^6: same query, different execution, different performance (Pg's text_pattern_ops) by erix
in thread same query, different execution, different performance by punkish

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.