Yes, I want to follow the links in PDF as well, But the spider does this already. It is simply the grabbing the PDF page bit so I also have a hard copy that is the problem.
With html it works fine,it scours through links given a start link and then nabs all the pages it gets to. All links already spidered get put into a hash, so it doesn't go back there twice. I will giev you guys the code, It is long, so I can probably show u the bit doing the job in html. Its 2:33am right now, and I still havn't got further, so bed beckons!!!
Thans guys! | [reply] |