in reply to Re: Broken headlines
in thread Broken headlines

An RDF feed for the Monastery. It's a little broken, but it should be easy enough for you to parse with perl.

I am using Perl. Specifically with XML::RSS. Besides, broken XML is not XML. This site doesn't use XML, it uses something that happens to look like it.

"Don't parse XML with an XML parser, use regexes!". I guess it must be very hard to generate correct XML. After all -- and XML barbie concurs -- XML is *hard*!

I will use an extra Perl script. Not to parse the XML, because that would be extremely silly. But to try to make valid XML from the string I get.

Juerd # { site => 'juerd.nl', plp_site => 'plp.juerd.nl', do_not_use => 'spamtrap' }

Replies are listed 'Best First'.
Re^3: Broken headlines
by jpfarmer (Pilgrim) on Oct 20, 2003 at 17:03 UTC
    Would you consider sharing the script you're using to re-format the XML?