in reply to Re: Parsing semi-complex HTML
in thread Parsing semi-complex HTML

Actually, I've never had problems using XML::LibXML on broken HTML:

use XML::LibXML; my $parser = XML::LibXML->new(); $parser->recover(1); $parser->recover_silently(1); my $doc = $parser->parse_html_string($stuff);

Good Day,
    Dean

Replies are listed 'Best First'.
Re^3: Parsing semi-complex HTML
by ikegami (Patriarch) on Jul 07, 2010 at 16:11 UTC
    Thanks, good to know! I never tried.

      If we were in the same office, I'd throw something at you right now. I've been using the recover/recover_silently in my XML::LibXML HTML parsing examples--I think in some threads you were in on--for a couple of years. :(

        I knew ->recover existed, but I had no idea how well it worked. I haven't personally used XML::LibXML to parse HTML.