in reply to perl and DOM

XML::LibXML includes a HTML parser.