in reply to Parsing UTF-8 characters (Å is changed to Ã)
It's probably a case of double-encoding.
"Å" encoded using UTF-8 and then encoded using UTF-8 a second time would appear as "Ã" (followed by a control character) on an terminal expecting UTF-8.
For example, the following produces something that looks like "Ã□" on my terminal:
perl -e' use open ":std", ":encoding(UTF-8)"; use utf8; use feature qw( say ); $_ = "Å"; utf8::encode($_); # XXX Bug. Already handled by ":encoding". say; '
In context, this would indicate that $item->{ows_Justification} contains text encoded using UTF-8 rather than decoded text as one would expect.
|
|---|