amonroy has asked for the wisdom of the Perl Monks concerning the following question:
Sorry if this is off-topic, but I need some advice from people who have worked with UTF-8.
Do you know why Firefox (so far I have tested under Windows, Linux and MacOS X) does not show an Ö when I send the following data to the browser:
print $cgi->header(-charset => "utf-8"), "\x{4F}\x{CC}\x{88}";
Firefox shows an O followed by the dieresis instead of a "clean" O with dieresis. Other browsers such as IE (under MacOS and WinXP) and Safari handle this correctly.
Update
The sequence \x{4F}\x{CC}\x{88} represents the Unicode LATIN CAPITAL LETTER O followed by a DIAERESIS, which in theory should be displayed just like unicode LATIN CAPITAL LETTER O WITH DIAERESIS (\x{C3}\x{96}).
But I found that Firefox might have a bug displaying Unicode canonical equivalents.
Now, I have a less off-topic question: How can I convert the Unicode LATIN CAPITAL LETTER O followed by a DIAERESIS to LATIN CAPITAL LETTER O WITH DIAERESIS. Or in hexadecimal terms, how can I convert \x{4F}\x{CC}\x{88} to \x{C3}\x{96}?
I tried using all the normalization forms of Unicode::Normalizer, but no luck.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: UTF-8 and browsers - Update
by theorbtwo (Prior) on Feb 14, 2005 at 06:35 UTC | |
|
Re: UTF-8 and browsers - Update
by dakkar (Hermit) on Feb 14, 2005 at 12:39 UTC |