in reply to Fixing suspect characters in HTML
But then I tried it out, adding "Ñ" and "Ò" to the test string, and they magically came out right:
which looks like what you would want to get.... my $str = join('', chr(0x93), 'double', chr(0x94), ' Ñ Ò ', chr(0x201C), 'double', chr(0x201D), '‘single’' ); ... output: “double” Ñ Ò “double”‘sing +le’
Update: based on your reply, I figured it might make sense to try numeric character entities above 0xff -- e.g. Ǒ and ǒ (when converted to utf8, these have 0x91 and 0x92 as the second byte). It still works the way you would want, converting them correctly to hex-coded numeric entities (Ǒ and ǒ, upper and lower case letter o with caron, respectively).
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Fixing suspect characters in HTML
by wfsp (Abbot) on Jul 30, 2005 at 16:18 UTC |