in reply to LWP::Simple... Enough for Site Query & Data Download?
Corion was right earlier. WWW::Mechanize is the easiest interface to the things you want to do: submit (no-JavaScript) forms programmatically. Make sure the site you are querying allows it. Many (most?) sites do not permit any data scraping in their terms of service. Some will make allowances if you ask formally. Some have APIs to get the data in a robust/correct way.
update: CountZero's also right. You'll want HTML parsing after the form results return. HTML::TokeParser::Simple, HTML::TreeBuilder, or XML::LibXML for example. If you get stuck on one come back here.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: LWP::Simple... Enough for Site Query & Data Download?
by cheech (Beadle) on Jun 15, 2009 at 00:20 UTC | |
by Your Mother (Archbishop) on Jun 15, 2009 at 00:52 UTC | |
by cheech (Beadle) on Jun 16, 2009 at 21:26 UTC | |
by Anonymous Monk on Jun 16, 2009 at 21:52 UTC | |
by cheech (Beadle) on Jun 16, 2009 at 22:12 UTC | |
| |
by Your Mother (Archbishop) on Jun 16, 2009 at 22:42 UTC |