Re: Scraping a website


go ahead... be a heretic
	PerlMonks

Re: Scraping a website

by markong (Pilgrim)

on Jul 31, 2018 at 11:44 UTC ( [id://1219547]=note: print w/replies, xml )

Need Help??

in reply to Scraping a website

# check each hypertext link within page
        my @html = split(/a href=/, $html);
[download]

A recommendation: you are doing a lot of extra work to collect URLs and save the relative content, the code is a bit verbose and you could still miss something; peruse "standard" tools to help yourself:

HTML::LinkExtor - Extract links from an HTML document
LWP::UserAgent - Web user agent class - look at its get(...) method and in particular to its :content_file => $filename parameter

This should simplify things and help a lot