Re^2: How would you extract *content* from websites?


Think about Loose Coupling
	PerlMonks

Re^2: How would you extract content from websites?

by BUU (Prior)

on Jun 18, 2005 at 03:10 UTC ( [id://467931]=note: print w/replies, xml )

Need Help??

in reply to Re: How would you extract *content* from websites?
in thread How would you extract *content* from websites?

That sounds reasonable, but how do you programatically determine the starting and ending comments?

Comment on Re^2: How would you extract content from websites?

Replies are listed 'Best First'.

Re^3: How would you extract *content* from websites?
by Popcorn Dave (Abbot) on Jun 18, 2005 at 18:27 UTC

Along the lines of what you're after I suppose you could just parse for comments and build a list of comment tags to look for. You had mentioned doing a diff on the files you wanted to look at, so that may be the way to start.

Useless trivia: In the 2004 Las Vegas phone book there are approximately 28 pages of ads for massage, but almost 200 for lawyers.

[reply]

In Section Meditations

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://467931]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others perusing the Monastery: (6)

As of 2024-04-23 09:40 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found