For the former issue (CAPS in URL), URI's SYNOPSIS has this:
$u5 = URI->new("HTTP://WWW.perl.com:80")->canonical;
...and from the canonical() method's documentation:
" Returns a normalized version of the URI. The rules for normalization are scheme-dependent. They usually involve lowercasing the scheme and Internet host name components, removing the explicit port specification if it matches the default port, uppercasing all escape sequences, and unescaping octets that can be better represented as plain characters. For efficiency reasons, if the $uri is already in normalized form, then a reference to it is returned instead of a copy."
I'm not sure about the http:// issue as I don't web-scrape often at all, but perhaps perusing the documentation in the link above will prove fruitful (scheme() method looks promising). That, or perhaps the docs of URI::Find has something.
In reply to Re: Detecting URLs with URI
by stevieb
in thread Detecting URLs with URI
by htmanning
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |