I honestly don't see much gain in it. Html that is regular enough to be worth doing this to would be better served in XML. Html that isn't so regular still wouldn't match well even with your theoretical grammar.

Html is too irregular to treat in a regular manner and the benefit is too low. Thus the people that most wanted this sort of thing, (like /.'s slashboxes and such) tended to encourage more regular standards in order to get what they wanted without ad-hoc parsing that needs to be tweaked every week or month.

Still, if you take a stab at it, I'll tinker with it. It's an interesting example of what Larry Wall has been calling "little languages". Yah never know, you might open up some flood gates and set a buncha people off in a direction you never expected.

--
$you = new YOU;
honk() if $you->love(perl)


In reply to RE: A grammar for HTML matching by extremely
in thread A grammar for HTML matching by mcelrath

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.