HTML is not XML, and you cannot parse HTML with an XML parser.

Hi

Hmm, lets see, html libxml, [google://site:perlmonks.org html libxml]

..scanning... 2003 HTML tidy, using XML::LibXML

second check, html twig ... 2004 XML::Twig and HTML Entities

I'm sure a check of the previously linked docs would have revealed the same , xml parsers can read html

Even when I'm confident in my memory, I always check to make sure

You might also want to make an account, so you can edit your posts and fix your typos, like "steaming mode" in the post above.

Thanks , I already have account

xmltwig.org is all about "streaming mode" as a concept (dont load whole document into memory)

XML::LibXML also supports it -- I checked before I posted

Also both documentations mention "stream"

A person can't know/remember everything, thats why we have perlmonks and search

When Duty Calls, real pedants check the fact not just their memories


In reply to Re^6: Match text from txt to html by Anonymous Monk
in thread Match text from txt to html by corfuitl

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.