It looks like the 2 main module distros that people use for parsing html are HTML-Tree (HTML::TreeBuilder) and HTML-Parser.
The HTML-Tree distribution was last updated in 2006. Is it still a good choice?
The HTML-Tree tutorial was written in 2003 and is quite short (and doesn't directly use HTML::TreeBuilder).
The HTML::TokeParser tutorial was written in 2001. Aside from its age, it also has no comments. Is it still accurate?
Are there any current and complete tutorials about for either of these modules? If not, could maybe the Monastery use a refreshed tutorial or two?
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.