Keep in mind the point of the article: It's impossible to do URL normalizing well. They've even missed two items that need normalizing (but they might be in the linked spec):
1) The order of the arguments in a GET:
.../script.cgi?a=b&c=d vs
.../script.cgi?c=d&a=b
2) The domain name:
example.com vs
example.com. vs
EXAMPLE.COM
Oh and IP addresses too:
10.0.0.1 vs
0x0A000001 vs
167772161
In reply to Re: Normalizing URLs
by ikegami
in thread Normalizing URLs
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |