or download this
For original character sequences that contain non-ASCII characters,
however, the situation is more difficult. Internet protocols that
transmit octet sequences intended to represent character sequences
...
It is expected that a systematic treatment of character encoding
within URI will be developed as a future modification of this
specification.