Re: Machine learning pattern matching...

in reply to Machine learning pattern matching...

> What I'm thinking of is along the lines of an algorithm that looks for repetition in the HTML structure of the page, and then examines them for the relevant data - could be table rows, divs, paragraphs, lists - trying to be as generic as possible...

Sounds for me like a combination of web mining and cluster analysis! (?)

I doubt that you can find any ready to use modules combining bothš, cause this is a core technology for some big players in web business.

Cheers Rolf

š) Especially as generic as you asked

In Section Seekers of Perl Wisdom