I have a simple program to check the validity of links before submitting them to search engines (code below.)
This code declares that "incomplete" URLS, like
http://www.mysite.com/mydir are no good -- when, in fact, they work just fine in a browser (and when submited to search engines). The "problem" with these URLs is that the full path all the way down to file isn't explicitly specified, I think.
My question is how do I get LWP useragent to act like a browser and find the default page in a directory?
thanks
nop
package MyUA;
use base qw(LWP::UserAgent);
use strict;
use CGI qw/:standard/;
sub redirect_ok {1};
sub new {
my $class = shift;
my $self = new LWP::UserAgent;
bless($self, $class);
return $self;
}
sub validURL {
my ($self, $url) = @_;
my $req = new HTTP::Request POST => $url;
my $res = $self->request($req);
my $content = $res->content;
return 0 unless $res->is_success;
return 0 if $content =~ /the page you have requested cannot be fou
+nd/i;
return 1;
}
1;
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.