use regex to split sentence

dyno has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: use regex to split sentence by jmcnamara (Monsignor) on Nov 01, 2002 at 09:37 UTC
The following regex should do what you require. It uses a zero-width positive look-ahead as explained in perlre. `@match = /X\d.+?(?=X\d\|$)/g;` Also, following on from jryan's idea here is a version that uses split: `$_ = "X1aaX2bbX3heeX4...X5loveUX6XXXX7X8" ; my @match = split /(X\d)/; print "String: " . $_; foreach my $word (@match){ print $word; print "\n" unless $word =~ /X\d/; }` [download] -- John.	[reply] [d/l] [select]
Re: Re: use regex to split sentence by jryan (Vicar) on Nov 01, 2002 at 18:17 UTC
Or, improving on both of ours, just: `@match = split (/(?=X\d)/,$_);` [download] For some retarded reason last night, I thought that split "kept" the part that it split on, forgetting that the trick to do that was to use a lookahead. :)	[reply] [d/l]
Re: Re: use regex to split sentence by kelan (Deacon) on Nov 02, 2002 at 05:08 UTC
In Perl6: `### <Perl6> @match = m:e/ X \d : .+? <before X \d \| $ > /; ### </Perl6>` [download] I do think they should give shortened versions of the <before > and <after > assertions. Oh well, maybe they'll throw it in. (Note for onlookers: The colon within the regex is thrown in for a small bit of engine optimization, and actually also helps in detecting malformed strings.) kelan Perl6 Grammar Student	[reply] [d/l]
Re: use regex to split sentence by jryan (Vicar) on Nov 01, 2002 at 07:03 UTC
Its much easier to use split: `$_ = "X1aaX2bbX3heeX4...X5loveU"; @match = split (/X\d/,$_);` [download]	[reply] [d/l]
Re: Re: use regex to split sentence by dyno (Initiate) on Nov 01, 2002 at 07:15 UTC
That's not what I want. `aa bb hee ... loveU` [download] in fact I realy want to know what pattern can match the result I expected--I mean below: `X1aa X2bb X3hee X4... X5loveU` [download]	[reply] [d/l] [select]
Re: Re: Re: use regex to split sentence by djantzen (Priest) on Nov 01, 2002 at 07:56 UTC
Try this: `@match = /X\d[^X]+/g;` [download] Basically it slurps up everything that's not an 'X' after doing matching 'X\d'.	[reply] [d/l]
Re: Re: Re: Re: use regex to split sentence by dyno (Initiate) on Nov 01, 2002 at 08:58 UTC
Re: Re: Re: Re: use regex to split sentence by dyno (Initiate) on Nov 01, 2002 at 09:01 UTC