in reply to Re: How to scraper ASP websites
in thread How to scraper ASP websites
Please have a look in this code and also check the link and tell me how can I scrape the details from here. Regards#!/usr/bin/perl use WWW::Mechanize; my $mech = WWW::Mechanize->new(); my $url = ('http://www.folkeferie.dk/da/ferier/Aktuelle-chartertilbud- +--afbudsrejser/'); $mech->get($url); my $hsh={}; $links = $mech->find_all_links(url_regex=>qr/templates\/textPage\.aspx +\?id/i, text_regex=>qr/Afbudsrejser/i); foreach my $link (@$links) { $url = $link->url_abs(); $mech->get($url); my $content = $mech->content(); while ($content=~/tr class="bgrow1"><td>(.*?)<\/td><td cla +ss="countryValue">(.*?)<\/td><td class="destnameValue">(.*?)<\/td><td + class="hotelNameValue">(.*?)<\/td><td class="durationValue">(.*?)<\/ +td><td align="RIGHT" class="priceValue"><a target="_blank" href="(.*? +)">(.*?)<\/a><\/td>/gisxm) { $hsh->{'url'} = $6; $hsh->{'crap_id'} = ''; $hsh->{'date'} = $1; $hsh->{'country'} = $2; $hsh->{'destination'} = $3; $hsh->{'trip_type'} = $4; $hsh->{'trip_length'} = $5; $hsh->{'price'}=$7; print "$hsh->{'date'}, $hsh->{'country'}, $hsh->{'destina +tion'}, $hsh->{'trip_type'}, $hsh->{'trip_length'}, $hsh->{'price'}, +$hsh->{'crap_id'}, $hsh->{'url'}, $airport\n\n"; } }
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: need help in scrapping asp site
by davido (Cardinal) on Sep 06, 2012 at 06:42 UTC | |
|
Re: need help in scrapping asp site
by 2teez (Vicar) on Sep 06, 2012 at 05:22 UTC | |
|
Re: need help in scrapping asp site
by Anonymous Monk on Sep 06, 2012 at 06:56 UTC | |
by Athanasius (Archbishop) on Sep 06, 2012 at 07:31 UTC | |
by Anonymous Monk on Sep 06, 2012 at 07:44 UTC | |
by Corion (Patriarch) on Sep 06, 2012 at 07:57 UTC | |
by Anonymous Monk on Sep 06, 2012 at 08:05 UTC | |
| |
by marto (Cardinal) on Sep 06, 2012 at 08:42 UTC | |
by Anonymous Monk on Sep 06, 2012 at 18:53 UTC | |
|