I have what seems like a simple parsing problem, but am stumped. I have single lines in input with repeated patterns of arbitrary length that match this general format (whown with 3 patterns):
I want to build a parser that loops through the input line and repeatedly extracts either the token "AND" or "OR", and then eats up everything in the line until the next "AND" or "OR", or EOL whichever comes first. I've tried this:$str = q!AND (random text) AND (more random text) AND (yet more)!; $str = q!OR (random text) OR (more random text) OR (yet more)!;
but that doesn't work -- the 2nd pattern eats everything to the EOL. Since the 2nd pattern is bounded by parens, I've tried this:while ($str =~ /\G(AND|OR)\s+(.+?)/g) { printf("%s %s\n", $1, $2); }
This does work, however, if the "random text" includes any parens itself, then the pattern match fails, by matching the first end-parenthesis in the text. E.g. this string breaks the pattern match:while ($str =~ /\G(AND|OR)\s+(\(.+?\))/g) { printf("%s %s\n", $1, $2); }
$str = q!AND (random text) AND (yet (more))!;
What's a good way to eat up the line, capturing the "AND" or "OR" token in $1, and the "random text" in $2, on and on until we hit the EOL. Your regex expertise is appreciated.
In reply to simple perl regex question (or is it?) by cadphile
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |