was looking for freelance work and came across a request for a script that would scrape yellow pages website. Hopefully the poster will google more before they spend $300 bucks for this...

 

#!/usr/bin/perl # program: yp.pl # purpose: search for phone number on www.yellowpages.com # date: 2015.01.28 use strict; use warnings; use LWP::Simple; my $usage = "\n\tusage: perl $0 [PHO-NEN-UMBR]\n"; my $baseUri = 'http://www.yellowpages.com/search?search_terms=+'; my $phoneNumber = $ARGV[0]; die $usage unless($phoneNumber =~ m/^\d{3}-?\d{3}-?\d{4}$/); my $content = get("$baseUri$phoneNumber"); $content =~ s/\s+/ /g; $content =~ s/\&nbsp;/ /g; $content =~ s/>\s*</\n/g; my $hits = 0; foreach my $e(split /\n/, $content){ chomp $e; if($e =~ m/^.*itemprop=['|"](\S+)['|"].*>(.*),?<\/\S+.*$/i){ my $itemProp = $1; my $itemValue = $2; $itemValue =~ s/,\s*$//; print "$itemProp: $itemValue\n"; $hits++; } } print "no results found for number '$phoneNumber'\n" if(!$hits); exit;

Replies are listed 'Best First'.
Re: yellow pages
by Discipulus (Canon) on Jan 29, 2015 at 08:29 UTC
    Hello, it seems i'm not able to run your program and getting results (even adding warnings..), but..
    ..i was able to automatically create a chrome extension from your used URL using this quick hack:
    perl chrome-ext-gen.pl yellowp http://www.yellowpages.com/search?searc +h_terms=+

    L*
    There are no rules, there are no thumbs..
    Reinvent the wheel, then learn The Wheel; may be one day you reinvent one of THE WHEELS.
      wasn't able to get your script to work yet, but looks interesting. have spent more time going through all the links you've posted on your profile. lots of great stuff! thanks
      nice
Re: yellow pages
by marto (Cardinal) on Jan 29, 2015 at 09:58 UTC

    "Hopefully the poster will google more before they spend $300 bucks for this..."

    So, this is a service which charges and you encourage others to automate their way around this cost? Will you complain if they go out of business?

    Update: Ignore this.

      um, $300 bucks for the program to scrape yellow pages ... yellow pages doesn't offer some service costing $300

        Ah, thanks for the clarification.