in reply to Unicode to HTML code &#....;
I think your method (in your later reply) of using a space to replace each "\x{FFFD}" (the unicode replacement character, which is inserted whenever there is an "uninterpretable" byte sequence) is as good as any, though maybe the "ellipsis" character ("\x{2026}" or "\x{22ef}") would be more appropriate.
IMHO, anyone who goes to the trouble of creating a "name" that contains both Latin-based (left-to-right) and Arabic-based (right-to-left) characters in a single word token is most likely trying to make trouble, and should expect (presumably wants) to see things go wrong.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Unicode to HTML code &#....;
by Forlix (Novice) on Nov 16, 2008 at 01:49 UTC |