in reply to Re^3: crawling one website
in thread crawling one website
Viewing the source of http://www.senopt.com shows "links" like:
<a href="senopt/VPS/vps.html" target="_blank">vps</a> <br> <a href="senopt/DogTraining/dogtraining.html" target="_blank">dog trai +ning</a>
You're going to have to do a bit more work, I'm afraid. You may want to start by reading More robust link finding than HTML::LinkExtor/HTML::Parser?.
HTH,
planetscape
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^5: crawling one website
by vit (Friar) on May 29, 2011 at 00:56 UTC | |
by planetscape (Chancellor) on May 29, 2011 at 02:07 UTC | |
by vit (Friar) on May 29, 2011 at 02:36 UTC | |
by planetscape (Chancellor) on May 29, 2011 at 03:10 UTC |