in reply to Explicit charset confuses WWW::Mechanize and/or HTTP::Response
Finally, I determined that the HTML page is being sent by the webserver using iso-8859-1 (latin-1), not utf-8.
The problem is that <meta> tag is lying about the encoding of the page (it sais utf-8), and I think that HTTP::Response is decoding the contents based on it and WWW::Mechanize receives that corrupted data, that saves with a wide character warning.
As I cannot change anything from the remote server, how can I handle this? Is there a way to stop the automagic decoding done by modules and then process that data myself?
Thanks...
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Explicit charset confuses WWW::Mechanize and/or HTTP::Response
by Polyglot (Chaplain) on May 16, 2009 at 14:49 UTC | |
by vitoco (Hermit) on May 18, 2009 at 22:30 UTC |