in reply to Re^2: Parsing ITEM Tag from RSS feed using XML::RSS::LibXML
in thread Parsing ITEM Tag from RSS feed using XML::RSS::LibXML
So, you're parsing an RSS file and then, for each item, you are fetching the link. What you get back is HTML not RSS so no, I don't think you'll get far trying to process the links with XML::RSS::LibXML.
I'm not sure what you plan to do with the HTML documents but you already have XML::LibXML loaded into RAM so you could use it to parse the HTML:
use XML::LibXML; my $dom = XML::LibXML->load_html( location => $fileName, recover => 1, # handle marginal HTML ); print $dom->toString;
The parser options for load_html are documented in XML::LibXML::Parser.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: Parsing ITEM Tag from RSS feed using XML::RSS::LibXML
by mr_p (Scribe) on Jun 27, 2010 at 02:38 UTC | |
by rowdog (Curate) on Jun 28, 2010 at 19:07 UTC | |
by mr_p (Scribe) on Jun 28, 2010 at 19:41 UTC | |
by rowdog (Curate) on Jun 29, 2010 at 00:13 UTC | |
by Anonymous Monk on Jun 28, 2010 at 20:57 UTC |