in reply to Re^2: Reading in utf-8 txt file gives garbled data when printed as part of utf-8 html...
in thread Reading in utf-8 txt file gives garbled data when printed as part of utf-8 html...
f\303\266\303\266\n is UTF-8 encoded.
If it's a string of chars (the UTF-8 flag is set), you'll get UTF-8 when you print to a UTF-8 filehandle.
If it's a string of octets (the UTF-8 flag is clear), you'll get UTF-8 when you print to a raw filehandle.
f\x{f6}\x{f6}\n is iso-latin-1 encoded.
When you print to a UTF-8 filehandle, Perl will assume it's iso-latin-1 and convert it to UTF-8.
When you print to a raw filehandle, you'll get those exact octets.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: Reading in utf-8 txt file gives garbled data when printed as part of utf-8 html...
by isync (Hermit) on Aug 28, 2007 at 09:54 UTC | |
by Anonymous Monk on Apr 21, 2009 at 23:49 UTC | |
by ikegami (Patriarch) on Apr 22, 2009 at 00:19 UTC |