comment on

I am trying to parse a log file that comes from a price checker scanner, looking specifically for scanning transactions. A typical transaction is below:

Oct 31, 2013 10:40:05 AM PCKLog log
INFO: 10.20.33.251: recv - <2>747599306525<3><5><4> 
Oct 31, 2013 10:40:05 AM PCKLog log
INFO: connect PLU server = 128.20.20.211, PLU port = 31415
Oct 31, 2013 10:40:05 AM PCKLog log
INFO: connected.....
Oct 31, 2013 10:40:05 AM PCKLog log
INFO: PLU send - 12 bytes
Oct 31, 2013 10:40:05 AM PCKLog log
INFO: PLU send - 747599306525
Oct 31, 2013 10:40:06 AM PCKLog log
INFO: PLU recv - 124 bytes
Oct 31, 2013 10:40:06 AM PCKLog log
INFO: PLU recv - <?xml version="1.0" encoding="utf-8"?><PLU><desc>GHIR
+ARDELLI MINT</desc><dept>110</dept><prc1>600</prc1><deal>2</deal></PL
+U>
Oct 31, 2013 10:40:06 AM PCKLog log
INFO: disconnected.....
Oct 31, 2013 10:40:06 AM PCKLog log
INFO: 10.20.33.251: send - <2>\x0B\x1B[1F\x1B[08;08TMBheader\x1B[2002F
+\x1B[000;24CGHIRARDELLI MINT\x1B[6F\x1B[000;36C2/$6.00<3>8<4>
Oct 31, 2013 10:40:11 AM PCKLog log
INFO: 10.20.33.251: send - <2>\x0B\x1B[1F\x1B[008;08T\x1B[1002J<3>8<4>
[download]

Using Parse::RecDescent, I have developed the following code:

#!C:\Perl\bin\perl
use strict;
use warnings;
use Parse::RecDescent;
use Data::Dumper;

BEGIN{ $::RD_AUTOACTION=q{ [@item[1..$#item]] };  }

my $grammar = q{
        transaction:    date|scan|date|connectPLU|date|connected|date|
+sending|date|sent|date|receiving|date|received|date|disconnected|date
+|display1|date|display2
                 date:          /([Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep
+|Oct|Nov|Dec]\\\\s+\\\\d{1,2}, \\\\d{4}) (\\\\d{1,2}:\\\\d{1,2}):\\\\
+d{1,2} ([AM|PM]) PCKLog log/ { print"$item[0]: $item[1] - $item[2]$it
+em[3]\\\\n"; } |
         scan:          /INFO: (\\\\d{1,3}\\.\\d{1,3}\\.\\d{1,3}\\.\\d
+{1,3}): recv - \\<\\d\\>(\\d+)\\<\\d\\>\\<\\d\\>\\<\\d\\>/ |
                 connectPLU:    /INFO: connect PLU server = \\d{1,3}\\
+.\\d{1,3}\\.\\d{1,3}\\.\\d{1,3}, PLU port = \\d+/ |
                 connected:     /INFO: connected\\.\\.\\.\\.\\./ |
                 sending:       /INFO: PLU send - \\d{1,2} bytes/ |
                 sent:          /INFO: PLU send - (\\d+)/ |
                 receiving:     /INFO: PLU recv - \\d{1,3} bytes/ |
                 received:      /INFO: PLU recv - \\<\\?xml .*\\?\\>\\
+<PLU\\>\\<desc\\>(.*)\\<\\/desc\\>\\<dept\\>\\d{1,4}\\<\\/dept\\>\\<p
+rc1\\>(\\d{1,4})\\<\\/prc1\\>\\<deal\\>\\d{1,3}\\<\\/deal\\>\\<\\/PLU
+\\>/ |
                 disconnected:  /INFO: disconnected\\.\\.\\.\\.\\./ |
                 display1:      /INFO: (\\d{1,3}\\.\\d{1,3}\\.\\d{1,3}
+\\.\\d{1,3}): send - .*/ |
                 display2:      /INFO: (\\d{1,3}\\.\\d{1,3}\\.\\d{1,3}
+\\.\\d{1,3}): send - .*/
};

my $parser = new Parse::RecDescent($grammar) or die "Bad grammar: $!\\
+n";

my($INFILE,$storelog);
my @log;

$storelog = "SingleTrans.txt";

open $INFILE,"<",$storelog or die "Can't open $storelog: $!\\n";
@log=<$INFILE>;
close $INFILE;

my $tree=$parser->transaction(@log);

print Dumper($tree);
[download]

What I am getting with the above code is just the word 'date' printed out when I dump the $tree variable. I went through most of the resources here on PM, as well as some other stuff I found on the web.

What I need to pull from the scan log is the date/time the transaction is occuring, the IP address of the scanner that is making the request, the PLU that is being sent and the response back, which is the product description, so the information based on the above listed transaction would be:

Date:        Oct 31, 2013 - 10:40
Scanner:     10.20.33.251
PLU:         747599306525
Description: GHIRARDELLI MINT
[download]

Any help would be greatly appreciated as always.

TStanley
--------
People sleep peaceably in their beds at night only because rough men stand ready to do violence on their behalf. -- George Orwell

In reply to P::RD and grammar by TStanley

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.