marcoss has asked for the wisdom of the Perl Monks concerning the following question:
Hi, I'm extracting info from a website using HTML::TreeBuilder::XPath. With the help of the monks I've been able to do this from other websites with almost no complication...until now. Basically, the foreach loop is not looping through the whole table in order to extract the info from each node. It's only retrieving the first results. I have tried this in many ways, but it's always the same result no matter the route I use for the node (even copying the whole XPath route from the browser by rightclicking on it). This is the code, if you execute it and look at the sourcecode, you'll see what I mean.
#!/usr/bin/perl -w use LWP::Simple; use HTML::TreeBuilder::XPath; use Data::Dumper; use strict; my $debug=1; my $base='http://www.msccrociere.it/it_it'; my $url='/Partenza-Crociere/Trova-La-Tua-Crociera.aspx?Reg=CAR&DateF=2 +01211&ddl=n&p=1&'; my $page = get($base.$url) or die $!; my $p = HTML::TreeBuilder::XPath->new_from_content( $page ); #binmode( STDOUT, ':utf8'); my @trips= $p->findnodes( '//table[@id="tblFYCXML_Itin"]'); foreach my $trip (@trips){ my $destination = $trip->findvalue('.//h2[@class="FYCm +aneDestXML"]'); my $shipname = $trip->findvalue('.//div[@class="cConte +ntLeft"]/a/h3'); print "$destination\n"; print "$shipname\n"; }
I know I'm making a newbie mistake somewhere, like I said I've tried many different things before asking here. I hope you can give me a hand. Thanks a lot!!
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: foreach my $question (@perlmonks){}
by tobyink (Canon) on Jun 19, 2012 at 09:32 UTC | |
by marcoss (Novice) on Jun 19, 2012 at 10:10 UTC | |
by muba (Priest) on Jun 19, 2012 at 10:27 UTC | |
by marcoss (Novice) on Jun 19, 2012 at 10:54 UTC | |
by Anonymous Monk on Jun 19, 2012 at 10:32 UTC | |
by marcoss (Novice) on Jun 19, 2012 at 11:00 UTC | |
by Anonymous Monk on Jun 19, 2012 at 11:09 UTC | |
|