in reply to Re: Scrappy Module
in thread Scrappy Module
you can find that this the url 'http://search.cpan.org/recent' means we need to give the 'recent' tag. it is working only for this cpan site. And it is not working for other sites. That is my question. How we could use the tags and get scrape a website. Can you able to understand.use Scrappy; my $scraper = Scrappy->new; $scraper->crawl('http://search.cpan.org/recent', '/recent' => { '#cpansearch li a' => sub { print $_[1]->{href}, "\n"; } } );
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^3: Scrappy Module
by marto (Cardinal) on May 12, 2011 at 11:48 UTC |