Hi,
currently I have a multi process web crawler based on LWP::UserAgent and Parallel::ForkManager. Spawning off more parallel processes only seems to get speedup up to about 10 processors.
I wanted to work on getting this working faster in serial before worrying about scaling to larger numbers of processors/threads.
Asynchronous DNS and HTTP certainly seems the obvious way to go to me (unless there is anything else you can suggest).
I'm not sure I follow how you can get Mechanize to use AnyEvent::HTTP results. Is there any chance of a quick little code snippet that illustrates what you mean?
thanks.