mr_p has asked for the wisdom of the Perl Monks concerning the following question:
I was trying to parse Item tag with regEx but I am having problems because $capture->{_content} as a string is being translated to some other characterset. So, I am trying to pull out <item> tag using below method and I keep getting this error. Can someone please let me know why?
Error: XPathContext: lost current node at link_ext2.pl line 30
#!/usr/bin/perl -w #use strict; use warnings; use XML::RSS::LibXML; use XML::LibXML; use LWP::UserAgent; use Data::Dumper; #my ( $htmlInfile, $htmlOutfile, $cssOutfile ) = @ARGV; my $html_link = "http://rss.news.yahoo.com/rss/topstories"; my $parser = XML::LibXML->new; my $client = LWP::UserAgent->new(); my $capture = $client->get("$html_link") || die"$!\n"; useLibXmlParseXmlItems($capture->{_content}); sub useLibXmlParseXmlItems { my $rss = XML::RSS::LibXML->new; $rss->parse($_[0]) || die "Could not parse. <$!>"; my $xp = XML::LibXML::XPathContext->new($rss); my @nodes = $xp->findnodes("/rss/channel/item"); #print @nodes; }
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Parsing Item Tag from RSS feed
by Corion (Patriarch) on Jul 12, 2010 at 15:14 UTC | |
by mr_p (Scribe) on Jul 12, 2010 at 15:37 UTC | |
by Corion (Patriarch) on Jul 12, 2010 at 16:25 UTC | |
by mr_p (Scribe) on Jul 12, 2010 at 19:47 UTC | |
by Corion (Patriarch) on Jul 12, 2010 at 20:13 UTC | |
by mr_p (Scribe) on Jul 12, 2010 at 21:07 UTC |