in reply to How to extract text between two tags?
use htmltreexpather.pl / xpather.pl / examples(for tree-xpath and others)/walkthroughs/tutorials ...
They'll give you paths you can use to reach [ Paper ID - Title (# Reviewers) ]
Paths like these
/html/body/div[4]/form/dl/dd/p /html[1]/body[1]/div[4]/form[1]/dl[1]/dd[1]/p[1] //*[ name() = "form" and position() = 1 and @action = "/openconf/chair +/assign_reviews.php" and @method = "post" ] /dl[1] /dd[1] /p[1]
These paths are easy to use with HTML::TreeBuilder::XPath or XML::LibXML
They can help you visualize the html even if you choose to stick with TreeBuilder's look_down
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: How to extract text between two tags?
by Cursed Chico (Initiate) on May 30, 2015 at 16:23 UTC | |
by Anonymous Monk on May 30, 2015 at 17:10 UTC |