in reply to Testing a web crawler
It boils down to it being really, really easy to make Mojo respond any way you want to a URL, so you can have your spider "visit" the Mojo URL, get a page with a bunch of links in it, and then test all the different kinds of things that could happen (timeout, 404, 500, you name it) by sending appropriately-crafted URLs to the Mojo server - which all happen to be on the first page you crawl. You need one "that's all folks" URL to make the Mojolicious server go away, but that's easy enough to do.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Testing a web crawler
by dlarochelle (Sexton) on Mar 25, 2010 at 22:32 UTC |