I thought I had a reasonable handle on regexes but occasionally problems shatter my confidence. I am trying to match Roman Numerals in the range i to xxxix. The numerals maybe preceded by a left bracket and maybe followed by i. a period (ii) a right bracket or (iii). a right bracket followed by a period. The entire pattern is always terminated with a space.
It all appears to work ok except when the string being searched comprises just a single space, whence it is (incorrectly IMHO) matched.
The part of the pattern to the left of the cluster that contains the space that is matched (ie the roman numeral cluster) has a quantifier of {1,1}, so I really don't undertand, if there are no valid characters in the string before the space, why the space should be matched.
Any help would be greatly appreciated.
This code outputs Matched ' ' in ' '
use strict; #use re 'debug'; { my ($rv, $linestr, $pattern); $linestr = ' '; $pattern = '^\({0,1}(((ix)|(iv))|(x{0,3}((ix)|(iv)))|(x{0,3}(v{0,1}i +{0,3}))){1,1}((\)\. )|(\) )|(\. )|( )){1,1}'; if( $linestr =~ m/$pattern/i) { print ("Matched '$&' in '$linestr'\n"); } else { print ("Not matched\n"); } }
In reply to Pattern Matching by davidas
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |