Sorry if this is off-topic, but I need some advice from people who have worked with UTF-8.
Do you know why Firefox (so far I have tested under Windows, Linux and MacOS X) does not show an Ö when I send the following data to the browser:
print $cgi->header(-charset => "utf-8"), "\x{4F}\x{CC}\x{88}";
Firefox shows an O followed by the dieresis instead of a "clean" O with dieresis. Other browsers such as IE (under MacOS and WinXP) and Safari handle this correctly.
Update
The sequence \x{4F}\x{CC}\x{88} represents the Unicode LATIN CAPITAL LETTER O followed by a DIAERESIS, which in theory should be displayed just like unicode LATIN CAPITAL LETTER O WITH DIAERESIS (\x{C3}\x{96}).
But I found that Firefox might have a bug displaying Unicode canonical equivalents.
Now, I have a less off-topic question: How can I convert the Unicode LATIN CAPITAL LETTER O followed by a DIAERESIS to LATIN CAPITAL LETTER O WITH DIAERESIS. Or in hexadecimal terms, how can I convert \x{4F}\x{CC}\x{88} to \x{C3}\x{96}?
I tried using all the normalization forms of Unicode::Normalizer, but no luck.
In reply to UTF-8 and browsers - Update by amonroy
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |