in reply to HTML parsing OR capturing text from a string within tags
Might I suggest a differnt tact than you're taking now?
Long ago, I wrote a newspaper headline grabber for a Perl class using LWP::Simple's get function to grab web pages. I found that easier to use since it can return the whole page to a scalar. Then I used HTML::TokeParser to actually divide up the information and based my collection on only the tokens I actually wanted to save.
If you look at Re: HTML::TokeParser help - parsing headlines there's a quick and dirty token parser that I wrote so that you can see how it splits up an HTML file.
Hope that helps!
If quizzes are quizzical, what are tests?
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: HTML parsing OR capturing text from a string within tags
by kevyt (Scribe) on Dec 24, 2006 at 07:09 UTC | |
|
Re^2: HTML parsing OR capturing text from a string within tags
by kevyt (Scribe) on Dec 24, 2006 at 07:31 UTC | |
by Popcorn Dave (Abbot) on Dec 24, 2006 at 09:12 UTC | |
by kevyt (Scribe) on Jan 02, 2007 at 17:44 UTC | |
by Popcorn Dave (Abbot) on Jan 02, 2007 at 18:43 UTC | |
by kevyt (Scribe) on Jan 04, 2007 at 18:04 UTC |