Beefy Boxes and Bandwidth Generously Provided by pair Networks
We don't bite newbies here... much
 
PerlMonks  

Re: parsing html link

by wfsp (Abbot)
on May 25, 2009 at 10:21 UTC ( [id://766001]=note: print w/replies, xml ) Need Help??


in reply to parsing html link

You were nearly there (if I've understood your question correctly)
#!/usr/bin/perl use warnings; use strict; use LWP::Simple; use HTML::TreeBuilder; print "ELENCO LIGANDI\n"; my $url3 = "http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum +/GetPage.pl?pdbcode=2j6p&template=ligands.html&l=1.1"; my $content =get ($url3); my $p = HTML::TreeBuilder->new; $p->parse_content($content); my @anchors = $p->look_down(_tag => q{a}); for my $anchor (@anchors){ my $txt = $anchor->as_text; if ($txt=~ /EPE\s/){ print $txt, qq{\n}; my $href = $anchor->attr(q{href}); print $href, qq{\n}; } } $p->delete; __DATA__ output: ELENCO LIGANDI EPE 1148(C) /thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=2j6p&templat +e=ligands.html&l=4.1 EPE 1148(D) /thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=2j6p&templat +e=ligands.html&l=4.2
update: changed var names to anchor/s

Replies are listed 'Best First'.
SOLVED: parsing html link
by paola82 (Sexton) on May 25, 2009 at 10:34 UTC

    thanks you solved my question, is it enough if I put there solved and paste your code below???For beginner like me, I suggest to read the previews posts

    #!/usr/bin/perl use warnings; use strict; use LWP::Simple; use HTML::TreeBuilder; print "ELENCO LIGANDI\n"; my $url3 = "http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum +/GetPage.pl?pdbcode=2j6p&template=ligands.html&l=1.1"; my $content =get ($url3); my $p = HTML::TreeBuilder->new; $p->parse_content($content); my @anchors = $p->look_down(_tag => q{a}); for my $anchor (@anchors){ my $txt = $anchor->as_text; if ($txt=~ /EPE\s/){ print $txt, qq{\n}; my $href = $anchor->attr(q{href}); print $href, qq{\n}; } }

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://766001]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others rifling through the Monastery: (3)
As of 2024-04-18 22:38 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found