in reply to Re: Break a long regex across multiple lines of code, with comments
in thread Break a long regex across multiple lines of code, with comments
[^e] # ensure there isn't an 'e' here
stevieb: A small point, but occasionally a very important one (measured in terms of how much of your hair you may pull out): your comment isn't quite right. For instance, in the string 'asd', 'sd' is not followed by an 'e', but it will not match:
That's because [^e] asserts that a character must be present, and that character must not be an 'e'. (The string 'asdx' will match.)c:\@Work\Perl>perl -wMstrict -le "$_ = 'asd'; ;; print 'match' if m/ (?<=a) sd [^e] /x; print 'qed'; " qed
To assert simply "an 'e' must not be present" and have a match with 'asd', use a negative look-ahead:
(This will also match 'asdx', but will not match 'asde'.)c:\@Work\Perl>perl -wMstrict -le "$_ = 'asd'; ;; print 'match' if m/ (?<=a) sd (?! e) /x; print 'qed'; " match qed
See the "Look-Around Assertions" sub-section of the "Extended Patterns" section of perlre. See also perlretut.
Update: Just making the character class optional with [^e]? won't work because 'asde' will then match. You could exclude 'asde' while matching 'asd' and 'asdx' by adding the \z "absolute end of string" assertion
[^e]? \z
but then 'asdxy' won't match! (Would maybe [^e]* \z work? Have to know the precise data.)
Update 2: A more accurate comment for the original regex element would be
[^e] # insure a character is present that is not an 'e'
Give a man a fish: <%-{-{-{-<
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^3: [OT: Pedantic] Break a long regex across multiple lines of code, with comments
by stevieb (Canon) on Sep 23, 2015 at 23:43 UTC |