in reply to Re^3: can't extract node with HTML::TreeBuilder::XPath
in thread can't extract node with HTML::TreeBuilder::XPath
I second that. A specs compatible HTML::TreeBuilder::XPath that works with the xpaths extracted with a common browser would definitely a simplification....
I was being sarcastic :) HTML::HTML5::Parser isn't documented much better than HTML::TreeBuilder -- you have to read the source just the same
FYI, HTML::TreeBuilder::Xpath just tacks on an xpath-1 engine onto a TreeBuilder tree -- common browser addons commonly modify the DOM --- its usually only @class and @id attributes you're interested in , not absolute paths
htmltreexpather.pl works with the actual tree that HTML::TreeBuilder builds, no browser required :)
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^5: can't extract node with HTML::TreeBuilder::XPath
by tobyink (Canon) on Aug 01, 2012 at 06:35 UTC | |
by Anonymous Monk on Aug 01, 2012 at 07:15 UTC | |
by tobyink (Canon) on Aug 01, 2012 at 10:04 UTC | |
by Anonymous Monk on Aug 01, 2012 at 11:00 UTC |