This is probably a klunky solution, but it reminded me of a parser I had written.. (
Evaluate Expressions.)
Quickly hacking at it and only testing a few simple cases (you've been warned)
You could use the returned structure to search for whatever criteria you desire. I'm sure someone here has a much more elegant solution, but my regex abilities are rather limited...
#!/usr/bin/perl
use warnings;
use strict;
use Data::Dumper;
my $input = '(a,b,(c,d,(e,f,g)))';
print Dumper parse_expression($input);
sub parse_expression {
my $exp = shift;
my @tokens = ();
$exp=~s/\s*([()])\s*/ $1 /go;
# Get tokens
push @tokens, $1 while $exp=~/\G\s*(".*?")/gc or $exp=~/\G\s*('.*?
+')/gc or $exp=~/\G\s*(\S+)/gc;
# Find any parens.
my (@lp,@rp) = ();
for (my $p =0; $p < @tokens; $p++){
if ($tokens[$p] eq '('){
push @lp,$p;
}elsif($tokens[$p] eq ')'){
push @rp,$p;
}
}
if ( @lp != @rp){
warn "Mismatched parens in expression.\n";
return;
}
my @temp = @tokens;
for (my $i=0; $i < @rp; $i++){
my @wanted;
for (my $j = $#lp; $j >= 0 ; $j--){
if ( defined $lp[$j] && $lp[$j] < $rp[$i] ){
(undef,@wanted) = @tokens[ $lp[$j] .. ($rp[$i] - 1 )
+] ;
@tokens[ $lp[$j] .. ($rp[$i]) ] = [ grep {defined $_
+} @wanted];
push @temp, map {split /\s*,\s*/} @wanted;
$lp[$j] = $rp[$i] = undef;
last;
}
}
}
return $tokens[0];
}
__DATA__
# OUTPUT
$VAR1 =
[
'a,b,',
[
'c,d,',
[
'e,f,g'
]
]
];
-Lee
"To be civilized is to deny one's nature."
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.