Beefy Boxes and Bandwidth Generously Provided by pair Networks
The stupid question is the question not asked
 
PerlMonks  

Re: lwp diary: day 1

by ichimunki (Priest)
on Jan 14, 2001 at 18:31 UTC ( [id://51742]=note: print w/replies, xml ) Need Help??


in reply to lwp diary: day 1

Table parsing is a pain in the 4ss. I love that particular message-- especially since it does nothing to even extract the text from the target. For ultra-simple parsing I replace all td end tags with a spacer and all tr end tags with a newline.

You can use HTML::Parser, or you can use HTML::TokeParser (which is a little easier, imho, to get started with-- and is basically a wrapper on the HTML::Parser module). With it you can simply $page->get_token() until you get to a text token which matches your "User Since:" test. Then you can pull all the text tokens until you get to either a tag or a text signal that you are done with the user info (and/or custom) portion of the node. And then exit the parse routine.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://51742]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others learning in the Monastery: (6)
As of 2024-04-23 22:45 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found