was looking for freelance work and came across a request for a script that would scrape yellow pages website. Hopefully the poster will google more before they spend $300 bucks for this...
#!/usr/bin/perl # program: yp.pl # purpose: search for phone number on www.yellowpages.com # date: 2015.01.28 use strict; use warnings; use LWP::Simple; my $usage = "\n\tusage: perl $0 [PHO-NEN-UMBR]\n"; my $baseUri = 'http://www.yellowpages.com/search?search_terms=+'; my $phoneNumber = $ARGV[0]; die $usage unless($phoneNumber =~ m/^\d{3}-?\d{3}-?\d{4}$/); my $content = get("$baseUri$phoneNumber"); $content =~ s/\s+/ /g; $content =~ s/\ / /g; $content =~ s/>\s*</\n/g; my $hits = 0; foreach my $e(split /\n/, $content){ chomp $e; if($e =~ m/^.*itemprop=['|"](\S+)['|"].*>(.*),?<\/\S+.*$/i){ my $itemProp = $1; my $itemValue = $2; $itemValue =~ s/,\s*$//; print "$itemProp: $itemValue\n"; $hits++; } } print "no results found for number '$phoneNumber'\n" if(!$hits); exit;
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: yellow pages
by Discipulus (Canon) on Jan 29, 2015 at 08:29 UTC | |
by japh2963 (Novice) on Jan 30, 2015 at 00:35 UTC | |
by japh2963 (Novice) on Jan 29, 2015 at 12:46 UTC | |
|
Re: yellow pages
by marto (Cardinal) on Jan 29, 2015 at 09:58 UTC | |
by Anonymous Monk on Jan 29, 2015 at 10:02 UTC | |
by marto (Cardinal) on Jan 29, 2015 at 10:03 UTC | |
by japh2963 (Novice) on Jan 30, 2015 at 00:33 UTC | |
by Discipulus (Canon) on Jan 30, 2015 at 07:41 UTC |