Regular Expression Translation Help!

Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Regular Expression Translation Help! by kennethk (Abbot) on Nov 30, 2010 at 20:04 UTC
Using YAPE::Regex::Explain, the code `use YAPE::Regex::Explain; print YAPE::Regex::Explain->new(qr/(^USA\|BRA\|CAN)?(.?)(XY)?$/)->expla +in();` [download] outputs The regular expression: (?-imsx:(^USA\|BRA\|CAN)?(.?)(XY)?$) matches as follows: NODE EXPLANATION ---------------------------------------------------------------------- (?-imsx: group, but do not capture (case-sensitive) (with ^ and $ matching normally) (with . not matching \n) (matching whitespace and # normally): ---------------------------------------------------------------------- ( group and capture to \1 (optional (matching the most amount possible)): ---------------------------------------------------------------------- ^ the beginning of the string ---------------------------------------------------------------------- USA 'USA' ---------------------------------------------------------------------- \| OR ---------------------------------------------------------------------- BRA 'BRA' ---------------------------------------------------------------------- \| OR ---------------------------------------------------------------------- CAN 'CAN' ---------------------------------------------------------------------- )? end of \1 (NOTE: because you are using a quantifier on this capture, only the LAST repetition of the captured pattern will be stored in \1) ---------------------------------------------------------------------- ( group and capture to \2: ---------------------------------------------------------------------- .? any character except \n (0 or more times (matching the least amount possible)) ---------------------------------------------------------------------- ) end of \2 ---------------------------------------------------------------------- ( group and capture to \3 (optional (matching the most amount possible)): ---------------------------------------------------------------------- XY 'XY' ---------------------------------------------------------------------- )? end of \3 (NOTE: because you are using a quantifier on this capture, only the LAST repetition of the captured pattern will be stored in \3) ---------------------------------------------------------------------- $ before an optional \n, and the end of the string ---------------------------------------------------------------------- ) end of grouping ---------------------------------------------------------------------- [download] This is then fed into the Conditional Operator (? :), selecting either the second group (`(.?)`) if it matched or the whole string if it didn't. Finally, this resultant string is used as a symbolic reference. Good luck. I suspect that the author intended `(^USA\|BRA\|CAN)` to actually be `^(USA\|BRA\|CAN)`	[reply] [d/l] [select]
Re: Regular Expression Translation Help! by roboticus (Chancellor) on Nov 30, 2010 at 20:13 UTC
You already got a nice answer from kennethk. I just wanted to add that you may want to move the caret left one position though: Currently USA must be at the start of the string, but BRA and CAN can be anywhere.... ...roboticus When your only tool is a hammer, all problems look like your thumb.	[reply]
Re^2: Regular Expression Translation Help! by Anonymous Monk on Nov 30, 2010 at 20:43 UTC
Would this be OK or the right thing is to move the caret left outside? Could the first way work? `${ ($acc =~ /(^USA\|^BRA\|^CAN)?(.?)(IN)?$/i) ? $2 : $acc }` [download] or `${ ($acc =~ /^(USA\|BRA\|CAN)?(.?)(IN)?$/i) ? $2 : $acc }` [download]	[reply] [d/l] [select]
Re^3: Regular Expression Translation Help! by kennethk (Abbot) on Nov 30, 2010 at 20:47 UTC
The results in the two cases are identical since `^` is a zero-width match. I personally think the second option is better because it is clearer and has fewer characters, hence is less sensitive to typos, but that is wholly subjective.	[reply] [d/l]
Re^4: Regular Expression Translation Help! by Anonymous Monk on Nov 30, 2010 at 21:02 UTC
Re^5: Regular Expression Translation Help! by kennethk (Abbot) on Nov 30, 2010 at 21:23 UTC
Some notes below your chosen depth have not been shown here
Re^5: Regular Expression Translation Help! by AnomalousMonk (Archbishop) on Dec 01, 2010 at 00:58 UTC
Re: Regular Expression Translation Help! by samarzone (Pilgrim) on Dec 01, 2010 at 09:32 UTC
This statement simply removes one of "USA", "BRA" or "CAN" from the start of the string and "XY" from the end of string, if found.	[reply]