in reply to Re^3: Question on extracting HTML tables with HTML::TableExtract
in thread Question on extracting HTML tables with HTML::TableExtract

Clarity in module docs has always been the thing that bugged me the most about learning Perl, so I end up doing quite a bit of experimental programming. That said, I use Perl mostly because of CPAN-- anything I want to do, someone else has mostly solved already. I liked this problem because when I did it myself a long time ago for a scraper (that's been running a few times a week for a few years) I did it the brute force way with a regex and identifying the text around it that tells me it's the table I want. Then it goes into HTML::Treebuilder to get the data I want. I was going to suggest using Treebuilder, until I read the docs, which had the example almost written already.

  • Comment on Re^4: Question on extracting HTML tables with HTML::TableExtract