in reply to Trying to step through a web page with WWW::Mechanize one line at a time

I recommend using a parser that returns the page as a tree, like HTML::TreeBuilder.

For data extraction, I like the approach of using CSS selectors or XPath queries to specify the elements to extract. See HTML::TreeBuilder::XPath and HTML::Selector::XPath. App::scrape provides a blueprint of how to use the two to extract data from a website.

  • Comment on Re: Trying to step through a web page with WWW::Mechanize one line at a time