in reply to Re^2: Non-greedy substitution
in thread Non-greedy substitution
maybe this helps, you can reproduce it in the debugger started with perl -de0
DB<22> p $_ = join "," , A..E A,B,C,D,E DB<23> p m/(,.*,)/ # longest possibility from first comma to last c +omma ,B,C,D, DB<24> p m/(,.*?,)/ # shortest possibility from first comma to next +comma ,B, DB<25> p m/(,.*$)/ # longest possibility from first comma to end of + line ,B,C,D,E DB<26> p m/(,.*?$)/ # shortest possibility from first comma to end o +f line ,B,C,D,E DB<27>
the regex-engine tries to find a solution step by step:
But all solutions provided by other monks made sure that only the rightmost comma allowed to be a match.
For instance
DB<27> p m/(,[^,]*)$/ # comma followed by non-commas till EOL ,E
The engine will actually try to first match all other commas to the left but always fail because it encounters other commas before reaching the EOL.
we can actually make the regex display it's intermediate attempts to match while "backtracking"
DB<32> ; m/(,[^,]*) (?{say $1}) $/x #show all intermediate attempts +to match $1 until it doesn't fail ,B , ,C , ,D , ,E DB<33>
The difference with non-greedy quantifier *? matching is that the engine goes from shortest to longest attempts while backtracking
DB<34> ; m/(,[^,]*?) (?{say $1}) $/x #show all intermediate attempts + to match $1 , ,B , ,C , ,D , ,E DB<35>
Is this clearer now? :)
HTH!
Cheers Rolf
(addicted to the Perl Programming Language :)
see Wikisyntax for the Monastery
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: Non-greedy substitution
by Bod (Parson) on Nov 21, 2024 at 23:14 UTC | |
by LanX (Saint) on Nov 22, 2024 at 15:31 UTC | |
by Bod (Parson) on Dec 04, 2024 at 19:47 UTC |