Re^2: Working with source of returned web page

wow that's really handy, but when it comes to more specific strings, which aren't defined by any html tags, then it's necessary to use regex anyway...
But still as I were saying very useful! Thanks again

Comment on Re^2: Working with source of returned web page

Replies are listed 'Best First'.
Re^3: Working with source of returned web page by Popcorn Dave (Abbot) on Jun 10, 2008 at 22:48 UTC
It's been a while since I used that module but if I recall correctly, it parses everything in to a token and the tokens not defined as an HTML tag should be defined as a text token. Take a look at HTML::TokeParser help - parsing headlines and you'll see a quick program I wrote to dump an HTML page to tokenized output. Run that on your page and I think you'll see you don't need to do the regex per se, but rather need to check text tokens to find what you're after. Good luck! Update: Changed link from scratchpad to node as per suggestion by ww Revolution. Today, 3 O'Clock. Meet behind the monkey bars. I would love to change the world, but they won't give me the source code	[reply]

Replies are listed 'Best First'.

Re^3: Working with source of returned web page
by Popcorn Dave (Abbot) on Jun 10, 2008 at 22:48 UTC

Take a look at HTML::TokeParser help - parsing headlines and you'll see a quick program I wrote to dump an HTML page to tokenized output. Run that on your page and I think you'll see you don't need to do the regex per se, but rather need to check text tokens to find what you're after.

Good luck!

Update: Changed link from scratchpad to node as per suggestion by ww

Revolution. Today, 3 O'Clock. Meet behind the monkey bars.

I would love to change the world, but they won't give me the source code

[reply]