in reply to Re: HTML:TableExtract...?
in thread HTML:TableExtract...?

The "table states" are used to deal with situations where you have tables within cells of other tables within cells of other tables (...etc.) The primary aim of the module is to "extract" data from heavily formatted web pages, not tables used to store plain data. Yeah, it's overkill if you want to suck data out a data table with no embedded subtables, but that's why there's shortcut methods.

It's actually a pretty useful module for anyone who's tried to zero in on some piece of data on a webpage and have pulled their hair out trying to get a home-rolled regex-based or HTML::Parser solution to work.

It's quite a spiffy module. Lets you get on and worry about other things than deciphering a page full of td tags. :-)

Gary Blackburn
Trained Killer