in reply to Re^3: Module hierarchy help
in thread Module hierarchy help

Thanks for long reply,

yes, ideal situation would be have those informations in one config. But I really can't imagine such a config, I will give you explanation: we have 50 different websites - I will compare 2 of them.

On 1st website is something like catalog, I have to click on category links, later click to sub-category, there should be paging (or not), clicking to detail, there should be more pages with details, parse all info I need, and save it to DB.

On 2nd website I have to simulate queries, so it is totally different as first, I get some list, clicking on that list, get other list, clicking and so on...

So creating general config like this and making all those routines in Baseclass seems nearly same to me. Problem is, there is really no same navigation nowhere and data is always stored other as on any page. I'd like to see how such a config should look like, never seen that. I know this would be ideal, but...Could you show me such a config, please ?

Replies are listed 'Best First'.
Re^5: Module hierarchy help
by Asim (Hermit) on Jul 27, 2006 at 15:44 UTC

    Have ye considered the wonders that WW::Mechanize would do for your coding? It's a module to simulate a browser for purposes of interacting on a website. With the help of HTTP::Recorder, you can even do it once manually, and have the module build the WWW:Mechanize script you need for your apps.

    Based upon what you've written, I'd strongly consider those options, and doing some research upon them. The docs I've linked to above, plus Googling for the module names, will lead to a wealth of resources.

    Think that might help remove a layer or two of work and code for you?

    ----Asim, known to some as Woodrow.

      thanks, I am already using WWW::Mechanize(::Cached), doing by hand, no need for HTTP::Recorder. This is not about downloading itself (BASE class for sure), but which links to visit, every web has another hierarchy and thats the problem. It is something like you want to parse news from webs: yahoo news, bbc news, google news and so on.