OK, I agree it's going too far to ask us to write the code. However, hints seem more than reasonable - here is a starting point:

First define the grammar that your input obeys in a format that can be read into a parser. For example, if you choose yapp, Parse::Yapp will also tell you how to define the grammar file it needs to parse your input and how to run the relevant lexer and parser. And when you know that, the code generation should be a relatively trivial matter.

Update: The lexer, albeit simple enough for your example input, also has to be written yourself, but the linked documentation explains that thoroughly. 'Tokens' refer to the named categories of substrings you will return as grammatical elements for the parser - these might be 'TK_DIGITS' or 'TK_ID' etc. and 'values' refer to the actual value found under that category for each execution of your lexer - each such execution therefore returning a single substring and the category under which it falls.

The documentation doesn't mention that one also normally writes a throw routine to consume comments and whitespace.

One world, one people


In reply to Re: How can I parse grouped data? by anonymized user 468275
in thread How can I parse grouped data? by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.