in reply to Re: Importing into Database
in thread Importing into Database
As an anecdotal usage example, I used this approach at one point to write a "screen scraper" program to pull tens of thousands of books' amazon sales ranks to stick them into a database for analysis. Their html code was fairly grotty, probably to try to prevent this sort of automated digging, but it had to look simple to a human being. In the lynx-parsed output it boiled down to one line that looked like "rank: foo" which was trivial to find/extract information from.
HTH. :-)
* ... and the information that you're interested in is rendered as opposed to being in the tag structure somehow. if you care about what's in the tags, it's time to fire up the Beast that is HTML::Parser...
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: HTML::Parser Alternative
by davorg (Chancellor) on Nov 24, 2001 at 14:12 UTC |