I didn't use TableExtract on this site, but it does come up frequently. Had I gone with the runner-up: http://www.brainyquote.com/quotes/keywords/gratitude.html, then we would be heading down that road. I'm curious how similar a task it is to get the content from this link as opposed to the one in the original post. Now that I see http://www.brainyquote.com/quotes_of_the_day.html, I wonder if it may not have been my best option.
As it stands, I'm using HTML::TreeBuilder, and I'm getting nothing:
C:\cygwin64\home\Fred\pages2\list>perl scraper1.pl nix! C:\cygwin64\home\Fred\pages2\list>type scraper1.pl #! /usr/bin/perl use warnings; use strict; use 5.010; use HTML::TreeBuilder 5 -weak; my $site = 'http://www.fourmilab.ch/yoursky/cities.html'; my $tree = HTML::TreeBuilder->new_from_url($site); foreach my $e ($tree->look_down(_tag => 'div')) { foreach my $f ($e->look_down(_tag => 'p')) { say $f->as_text; } } say "nix!"; C:\cygwin64\home\Fred\pages2\list>
In reply to Re^2: regex to get random quote
by Aldebaran
in thread regex to get random quote
by Aldebaran
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |