Re^4: How to match more than 32766 times in regex?

Replies are listed 'Best First'.
Re^5: How to match more than 32766 times in regex? by karlgoethebier (Abbot) on Dec 02, 2015 at 09:55 UTC
"...the algorithm needs to be coded" It seems like someone did it already: Algorithm::NeedlemanWunsch. Regards, Karl «The Crux of the Biscuit is the Apostrophe»	[reply]
Re^6: How to match more than 32766 times in regex? by FreeBeerReekingMonk (Deacon) on Dec 02, 2015 at 23:36 UTC
wow... totally missed that one. Thanks for pointing that one out. $perl_coolness_factor++;	[reply]
Re^5: How to match more than 32766 times in regex? by rsFalse (Chaplain) on Dec 01, 2015 at 22:26 UTC
Ah yes, in this node Re^2: Complex regular subexpression recursion limit I didn't get an answer :/ . Today I was solving another problem (and encountered same limitation). Full problem was: given a string (up to 1e5 length) consisting of '0' and '1', answer what is the length of the longest alternating subsequence if you are able to choose and invert one substring. For example, given a string '100111', I can invert substring from 3rd to 4th character ( `substr $line, 2, 2, (substr $line, 2, 2) =~ y/01/10/r` ), and then string become '101011' and has alternating subsequence (indexes: 0,1,2,3,4 or 0,1,2,3,5). I wanted to solve that problem with regexes (I knew that I can solve it other way), so I tried to count /1+/ and /0+/ (this is the answer of longest alternating subsequence if no inversions are made). I thought that I can do: `$line =~ y/1/,/; $len = split /\b/, $line;` [download] , but I decided to stay with zeroes and ones, and wrote `() = $line =~ /(.)\1/g` (as I shown). Later I add to $len: `/(.)\1\1\|(.)\2.(.)\3/ + /(.)\1/`, because each regex if succedes it gives +1 to the possible length of subsequence after one inversion. I often try to solve problems from competitive programming online sites or sites like projecteuler.net and I practise do it with Perl. After I used to calc all the sum: `$len = + (() = /(.)\1\1\1\1/g) + /(.)\1\1\|(.)\2....(.)\3/ + /(.)\1/` [download] - it consumed too much time when solving input line '01' x 5e4; upd: was bad example with reversion, now fixed to inversion.	[reply] [d/l] [select]
Re^6: How to match more than 32766 times in regex? by Anonymous Monk on Dec 02, 2015 at 02:12 UTC
`$len = + (() = /(.)\1\1\1\1/g) + /(.)\1\1\|(.)\2....(.)\3/ + /(.)\1/` [download] You know, that doesn't make any sense whatsoever.	[reply] [d/l]
Re^7: How to match more than 32766 times in regex? by Anonymous Monk on Dec 02, 2015 at 03:28 UTC
After some pondering... Is this what you tried to do: `use strict; use warnings; my @strs = ( '010111', '0' x 1_000_000, '01' x 1_000_000, '011' x 1_000_000, ( '01' x 1_000_000 ) . '111', ); for my $str (@strs) { my $len = ( () = $str =~ m{ 0+ \| 1+ }xg ) + ( $str =~ m{ 000 \| 111 \| (.)\1 .* (.)\2 }x ? 2 : 0 ); print $len, "\n"; }` [download] (Perls regex optimizer is pretty smart about the second regex, btw!)	[reply] [d/l]
Re^8: How to match more than 32766 times in regex? by Anonymous Monk on Dec 02, 2015 at 04:06 UTC
Re^9: How to match more than 32766 times in regex? by Anonymous Monk on Dec 02, 2015 at 06:21 UTC
Re^8: How to match more than 32766 times in regex? by rsFalse (Chaplain) on Dec 02, 2015 at 16:58 UTC
Re^9: How to match more than 32766 times in regex? by Anonymous Monk on Dec 02, 2015 at 17:39 UTC


Syntactic Confectionery Delight
	PerlMonks