in reply to Re^2: quite SOLVED Re^4: parsing html
in thread parsing html

...nothing...
Same here. :-(

I had more luck with LWP::UserAgent though

#!/usr/bin/perl use warnings; use strict; use LWP::UserAgent; use HTML::TreeBuilder; my $url = q{http://microrna.sanger.ac.uk/cgi-bin/targets/v5/detail_vie +w.pl?transcript_id=ENST00000226253}; my $ua = LWP::UserAgent->new; $ua->timeout(10); my $response = $ua->get($url); my $content; if ($response->is_success) { $content = $response->content; } else { die $response->status_line; } my $p = HTML::TreeBuilder->new; $p->parse_content($content); my @tds = $p->look_down(_tag => q{td}); for my $td (@tds){ my $bold = $td->look_down(_tag => q{b}); next unless $bold; my $txt = $bold->as_text; if ($txt =~ /miR|let/){ print $txt, qq{\n}; } } $p->delete;
mmu-miR-705 mmu-miR-705 hsa-let-7d hsa-let-7e hsa-miR-483-5p mmu-miR-683 hsa-miR-650 hsa-miR-920 mmu-miR-709 hsa-miR-26b* hsa-miR-185 hsa-let-7a hsa-miR-765 hsa-miR-629* hsa-miR-19b-2* hsa-miR-31 mmu-miR-707 hsa-miR-665 hsa-miR-339-5p hsa-let-7c hsa-let-7b hsa-miR-7 hsa-miR-26b* hsa-let-7g hsa-miR-382 hsa-miR-454* hsa-miR-501-5p mmu-miR-666-5p hsa-miR-486-3p hsa-let-7f mmu-miR-680 hsa-miR-219-2-3p hsa-miR-153 hsa-miR-26a-2* hsa-miR-328 hsa-miR-220c hsa-miR-19a* hsa-miR-433 hsa-miR-769-5p hsa-miR-26b* hsa-miR-19a* hsa-miR-19b-1* hsa-miR-25* hsa-miR-483-5p mmu-miR-685 hsa-miR-938 mmu-miR-465a-3p hsa-miR-139-3p hsa-miR-187* mmu-miR-687
I'm no expert on LWP, perhaps the timeout? It took a while to download.

Replies are listed 'Best First'.
Re^4: quite SOLVED Re^4: parsing html
by paola82 (Sexton) on May 15, 2009 at 11:26 UTC
    thanks a lot you completely solved....but I don't know why sometimes in perl something run and sometime the same doesn't run....that the mysterious power of perl....it can be great and at the same time very heavy...:-)