Newbie RegEX question

jmaya has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Newbie RegEX question by suaveant (Parson) on May 13, 2003 at 14:25 UTC
`$file =~ /^(\d+)([A-Z][a-z]+)([A-Z][a-z]+)\.(.*)$/; print "Serial : $1\n"; print "Name : $2\n"; print "Category : $3\n"; print "Extension: $4\n"; #though... you may want to change [a-z] to include any other possible +chars` [download] Untested, but should work... - Ant - Some of my best work - (1 2 3)	[reply] [d/l]
Re: Newbie RegEX question by jdporter (Paladin) on May 13, 2003 at 14:26 UTC
Something like this: `my( $serial, $name, $category ) = $filename =~ /^(\d+)([A-Z][a-z])([A-Z][a-z])/;` [download] Or if you want to use the POSIX character classes: `my( $serial, $name, $category ) = $filename =~ /^(\d+)([[:upper:]][[:lower:]])([[:upper:]][[:lower:]])/;` [download] Updated: Normally I don't update my nodes, but Aristotle makes an important point below, and I'd be afraid that someone might cargo-cult my original code, which had `[:lower:]` instead of `[[:lower:]]`, etc. jdporter The 6th Rule of Perl Club is -- There is no Rule #6.	[reply] [d/l] [select]
Re^2: Newbie RegEX question (posix charclass syntax) by Aristotle (Chancellor) on May 13, 2003 at 14:57 UTC
Have you ever used them? The character classes you use will any of the characters in `:epru` and `:elorw` respectively. POSIX classes are used like this: `/^(\d+)([[:upper:]][[:lower:]])([[:upper:]][[:lower:]])/;` [download] Makeshifts last the longest.	[reply] [d/l]
Re: Newbie RegEX question by broquaint (Abbot) on May 13, 2003 at 14:27 UTC
Assuming your data is as simple as that provided `$_ = '001WhitePottery.jpg'; my($serial, $name, $cat) = m< ^ (\d+) ([A-Z][a-z]+) ([A-Z][a-z]+) >x; print "($serial) Serial\n", "($name) Name of Product\n", "($cat) Category\n"; ___output__ (001) Serial (White) Name of Product (Pottery) Category` [download] See. `perlre` and `perlop` for more info. HTH `_________ broquaint`	[reply] [d/l]
Re: Re: Newbie RegEX question by Limbic~Region (Chancellor) on May 13, 2003 at 15:07 UTC
broquaint, I agree - ".. the data is as simple as that provided". I would only say that it might be worthwhile to create a catch bucket for the purpose of refining the RE over time to catch more and more a-typical cases. `my @bitbucket; $_ = '001WhitePottery.jpg'; if (m< ^ (\d+) ([A-Z][a-z]+) ([A-Z][a-z]+) >x) { my($serial, $name, $cat) = ($1, $2, $3); print "($serial) Serial\n", "($name) Name of Product\n", "($cat) Category\n"; } else { push @bitbucket , $_; } print "The following files didn't match rule - please fix"; print "$_\n" foreach (@bitbucket);` [download] I only point this out as the OP claimed newbie status. Cheers - L~R	[reply] [d/l]