Hey all, I am currently writing a postfix log parser, a reasonable example is in $string.
a whole heap of key=value pairs, easy enough.
but I want to see if I can optimise what I already have done:
my $string = 'to=<adam.clark@ngv.vic.gov.au>, relay=monet1.ngv.vic.gov +.au[10.10.10.20]:25, delay=0.54, delays=0.06/0.02/0/0.46, dsn=2.0.0, +status=sent '; $LogLineHash = qr { ^ ([^=]*)=<?(.*?)>?,?\s+) (.*?) $ }xi; while ( $string ){ print "String: $string\n"; ( $junk , $key , $value, $string ) = split( /$LogLineHash/ , $string ); print "Key: $key\nValue: $value\nLeft Over: $string\n\n"; $Hash{$key}=$value; } while ( my ($key, $value) = each(%Hash) ) { print "$key => $value\n"; }
Which gets me:
String: to=<adam.clark@ngv.vic.gov.au>, relay=monet1.ngv.vic.gov.au[10 +.10.10.20]:25, delay=0.54, delays=0.06/0.02/0/0.46, dsn=2.0.0, status +=sent Key: to Value: adam.clark@ngv.vic.gov.au Left Over: relay=monet1.ngv.vic.gov.au[10.10.10.20]:25, delay=0.54, de +lays=0.06/0.02/0/0.46, dsn=2.0.0, status=sent String: relay=monet1.ngv.vic.gov.au[10.10.10.20]:25, delay=0.54, delay +s=0.06/0.02/0/0.46, dsn=2.0.0, status=sent Key: relay Value: monet1.ngv.vic.gov.au[10.10.10.20]:25 Left Over: delay=0.54, delays=0.06/0.02/0/0.46, dsn=2.0.0, status=sent String: delay=0.54, delays=0.06/0.02/0/0.46, dsn=2.0.0, status=sent Key: delay Value: 0.54 Left Over: delays=0.06/0.02/0/0.46, dsn=2.0.0, status=sent String: delays=0.06/0.02/0/0.46, dsn=2.0.0, status=sent Key: delays Value: 0.06/0.02/0/0.46 Left Over: dsn=2.0.0, status=sent String: dsn=2.0.0, status=sent Key: dsn Value: 2.0.0 Left Over: status=sent String: status=sent Key: status Value: sent Left Over: relay => monet1.ngv.vic.gov.au[10.10.10.20]:25 to => adam.clark@ngv.vic.gov.au dsn => 2.0.0 status => sent delay => 0.54 delays => 0.06/0.02/0/0.46
I was hoping that I could get my regex to dynamically grab all the key value pairs at once with a (?: )+ style grouping such that my regex would be:
^(?:([^=]*)=<?(.*?)>?,?\s+)+$
essentially grabbing arbitrary number of key value pairs, but it's not to be as:

Sample code:
my @bits = split( /^(?:([^=]*)=<?(.*?)>?,?\s+)+$/ , $string ); foreach ( @bits ){ print "$_\n"; }
gives me:
status sent
which is the last key value pair.
Is what I want to do possible?

also, can someone fill me in on when using split(), the first array element is always nothing. hence my $junk variable.

In reply to Adapting parenthesis in regexps by cyzza

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.