#!/usr/bin/perl -w #this is fairly silly use strict; use LWP::Simple; my $page = ''; my $url = 'http://www.google.com/search?q=%2B%22all+your%22+%2B%22are+ +belong+to+us%22+-base+-bases&num=100&hl=en&lr=&safe=off&start=' ; for (my $i=0; $i<=900; $i+=100) { $page .= get( $url.$i ) ; } $page =~ s/<[^>]*?>//gs ; $page =~ s/&[^;]*?;//gs ; my (@nouns) = ($page =~ /all your(.{3,20}?)are belong to us/gsi ); my %unique; foreach (@nouns) { s/^\s*//s ; s/\s*$//s ; tr/\n//d ; $unique{$_}++; } print join ', ', sort keys %unique ;
Typical output:
BARESIS, BASS, BIOCHEMISTS, BLOCKS, BUMS, Bosch, Buff, CHANNEL, CHAT, DSL, Droga, Elfie, GUITARS, Human Head, Internet, NATION, Notepad, Oses, PEPSI, PHASE PROBLEM, Phone, RAILBAIT, SEXY BARBS, STUPIDNESS, Simpsons, YOLGs, absolutepower, bats, blocks, blue butts, buff, buffs, christmas presents, face, family, fantasy, forums, government, issues, letters, lewtz, link, magic, money, moo, pant, plots, posts, registration data, server, sex toy, sticks, topics, true shot damage

(I was going to sort them by frequency but Google doesn't give a big enough sample)

andy.

Replies are listed 'Best First'.
Re: All your what, exactly, are belong to us? (apart from 'base')
by tomhukins (Curate) on Mar 02, 2001 at 18:16 UTC

    The problem with your query to Google is that it matches any page containing the phrases all your and belong to us. A better query would be all your * are belong to us.

    Example:
    http://www.google.com/search?q=%2B%22all+your+*+are+belong+to+us%22+-base+-bases&num=100&hl=en&lr=&safe=off&btnG=Google+Search

    I'm unsure why I'm replying to something this pointless, even if it is fun...

Re: All your what, exactly, are belong to us? (apart from 'base')
by EvanK (Chaplain) on Mar 02, 2001 at 23:30 UTC
    Dear god, it's made its way here too. Is no one safe? I pray for humanity

    ______________________________________________
    When I get a little money, I buy books. If I have any left over, I buy food and clothes.
    -Erasmus