comment on

This is not just a "lex"-like scanner as noted in perlre, but a fully recursive parser (note the calls to expr() inside some of the actions). It will properly handle multiple levels of parentheses.

The advice I would give is to instrument it up with debug prints to see what's happening, also adding a recursion level counter to track recursive calls.

It is loosely based on this expression parser I wrote a while ago:

#!/usr/bin/perl

use strict;   # mini.pl - modified Pratt parser by tybalt89
use warnings; # https://en.wikipedia.org/wiki/Pratt_parser
sub error { die "ERROR  ", s/\G/ <@_> /r, "\n" }

sub expr # two statement parser - precedences: (3 **) (2 * /) (1 + -)
  {
  my $answer =
    /\G\s* ((?:\d+(?:\.\d*)?|\.\d+)(e[+-]?\d+)?) /gcxi ? $1 :
    /\G\s*\(/gc ? (expr(0), /\G\s*\)/gc || error 'missing )')[0] :
    /\G\s* - /gcx ? -expr(3) :  # unary minus
    /\G\s* \+ /gcx ? +expr(3) :  # unary plus
    error 'bad operand';
  $answer =
    $_[0] <= 3 && /\G\s* \*\* /gcx ? $answer ** expr(3) :
    $_[0] <= 2 && /\G\s* \* /gcx   ? $answer * expr(3)  :
    $_[0] <= 2 && /\G\s* \/ /gcx   ? $answer / expr(3)  :
    $_[0] <= 1 && /\G\s* \+ /gcx   ? $answer + expr(2)  :
    $_[0] <= 1 && /\G\s* \- /gcx   ? $answer - expr(2)  :
    return $answer while 1;
  }

for ( @ARGV ? @ARGV : scalar <> ) # source as commandline args or stdi
+n
  {
  my $answer = expr(0);
  /\G\s*\z/gc ? print s/\s*\z/ = $answer\n/r : error 'incomplete parse
+';
  }
[download]

which was also the basis for this Re: Parsing Boolean expressions

but it doesn't need the precedence stuff.

In reply to Re^3: Regular Expression Test by tybalt89
in thread Regular Expression Test by leoberbert

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.