marvell has asked for the wisdom of the Perl Monks concerning the following question:
I promise, I've already been through the Super Search route, but I have to wonder about the validity and up-to-dateness of nodes in the present wave of XML fever.
In a nutshell, all I want to do is check if some XML is well formed. I don't want to know anything about it, just see if it's well formed.
The background is that I have 20,000 hand written HTML files from which I have stripped the "useful" data. This comes to in the form of a snippet of HTML. Now, the client has come back informing me that they want it in XML, or at least a snippet of well formed XML.
XML::Parser with no handlers seemed a good plan, but then, it can only be used once per instance and croaks of the XML is not well formed.
OK, so can wrap it up in eval, but then, is all seems a bloated. But then, I'm not really in a position to comment, on the performance overhead.
Another plan was to preconvert the HTML to XHTML, but that looks to take ages.
You wisdom would be appreciated.
--
Brother Marvell
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: well formed xml
by davorg (Chancellor) on Feb 28, 2001 at 20:34 UTC | |
by marvell (Pilgrim) on Feb 28, 2001 at 20:44 UTC | |
|
Re: well formed xml
by mirod (Canon) on Feb 28, 2001 at 20:41 UTC | |
|
Re: well formed xml
by ZZamboni (Curate) on Feb 28, 2001 at 20:28 UTC | |
|
Re: well formed xml
by sierrathedog04 (Hermit) on Feb 28, 2001 at 21:47 UTC | |
by mirod (Canon) on Feb 28, 2001 at 22:14 UTC |