in reply to convert whole html-files to xml

Well, first thing you do is forget about doing that, and you're done -- html isn't xml and vice versa

Oh look, HTML::Tidy::libXML

Replies are listed 'Best First'.
Re^2: convert whole html-files to xml
by CountZero (Bishop) on Sep 15, 2013 at 07:10 UTC
    Not at all! HTML is far less strict than XML. Perhaps you were thinking of XHTML?

    CountZero

    A program should be light and agile, its subroutines connected like a string of pearls. The spirit and intent of the program should be retained throughout. There should be neither too little or too much, neither needless loops nor useless variables, neither lack of structure nor overwhelming rigidity." - The Tao of Programming, 4.1 - Geoffrey James

    My blog: Imperial Deltronics

      Not at all! HTML is far less strict than XML. Perhaps you were thinking of XHTML?

      I was thinking <small. na-nana-boo-boo Oh look, HTML::Tidy::libXML $xml/$xhtml