saintmike has asked for the wisdom of the Perl Monks concerning the following question:
Parsing of undecoded UTF-8 will give garbage when decoding entities at .../LWP/Protocol.pm line xx.The cause seems to be HTML::Parser (or better HTML::HeadParser) which assumes non-UTF8, but sees something that looks like UTF8.
It's a warning, so I guess I could just suppress it, but I wanted to know if you guys had seen this before and have figured out an elegant way to deal with it.
As explained here, parse_head() is necessary to deal with some oddball webservers, otherwise I could turn it off in LWP::UserAgent's constructor.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Parsing of undecoded UTF-8 will give garbage
by kettle (Beadle) on Aug 03, 2006 at 02:06 UTC | |
by saintmike (Vicar) on Aug 03, 2006 at 06:23 UTC |