in reply to date of a web page

In any case, you can't count on the data you get from the site.

Some static sites are careless about updating the metas (or the "latest update" info in the body of the page). Some dynamic sites will give you "today's date" every time you visit, despite the lack of any changes in the content.

So, as an exercise, yours is a valid exercise. But if the date info has some critical meaning, be suspicious of what you get.

Replies are listed 'Best First'.
Re^2: date of a web page
by Anonymous Monk on Jun 13, 2008 at 05:08 UTC
    Yes, I never trust anything on the net :-) This is mainly an exercise to see how well I can track news stories as they evolve over time. Mix in a bit a semantic analysis and see if the 'tone' of the stories evolve over time. Hey, It keeps me off the streets. Thanks for your suggestions monks.