in reply to Fixing ill-formed XML

Isn't there a module called HTML-Tidy (or something similar) which can clean up ill-formed HTML?

If I remember well, it could unmix mixed-up HTML-tags. Perhaps it can give you some pointers how to do it.

CountZero

"If you have four groups working on a compiler, you'll get a 4-pass compiler." - Conway's Law

Replies are listed 'Best First'.
Re^2: Fixing ill-formed XML
by Ionizor (Pilgrim) on Dec 20, 2002 at 02:46 UTC
    CountZero++! However, HTML Tidy is not a Perl module, it's a stand alone application that corrects common HTML errors. You can get a copy here. The technical term for such a piece of software is a "Lint". Don't ask me, I don't know where they got the name.

      Thank you Ionizor. I see that there is a PERL-wrapper for HTML-Tidy, so that must have been mixed up in my memory.

      CountZero

      "If you have four groups working on a compiler, you'll get a 4-pass compiler." - Conway's Law