Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:
It looks like the 2 main module distros that people use for parsing html are HTML-Tree (HTML::TreeBuilder) and HTML-Parser.
The HTML-Tree distribution was last updated in 2006. Is it still a good choice?
The HTML-Tree tutorial was written in 2003 and is quite short (and doesn't directly use HTML::TreeBuilder).
The HTML::TokeParser tutorial was written in 2001. Aside from its age, it also has no comments. Is it still accurate?
Are there any current and complete tutorials about for either of these modules? If not, could maybe the Monastery use a refreshed tutorial or two?
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Request: Current and more complete HTML parse parsing tutorial
by desemondo (Hermit) on Apr 20, 2010 at 22:33 UTC | |
by Anonymous Monk on Apr 21, 2010 at 13:13 UTC | |
|
Re: Request: Current and more complete HTML parse parsing tutorial
by blakew (Monk) on Apr 21, 2010 at 05:04 UTC | |
by Anonymous Monk on Apr 21, 2010 at 13:09 UTC | |
|
Re: Request: Current and more complete HTML parse parsing tutorial
by Anonymous Monk on Apr 23, 2010 at 20:24 UTC |