in reply to Re^5: Malformed UTF-8
in thread Malformed UTF-8
It appears $term is not actually UTF-8 encoded when this occurs. Additionally, is it me or doesTOKEN: SV = PVMG(0x1c6cca0) at 0x1ab6a28 REFCNT = 1 FLAGS = (PADBUSY,PADMY,POK,pPOK,UTF8) IV = 0 NV = 0 PV = 0x2dd2f80 "ba\303\261o"\0 [UTF8 "ba\x{f1}o"] CUR = 5 LEN = 15 MAGIC = 0x2dc95f0 MG_VIRTUAL = &PL_vtbl_utf8 MG_TYPE = PERL_MAGIC_utf8(w) MG_LEN = 4 -------------------- TERM: SV = PVIV(0x18b8e20) at 0x1ab69c8 REFCNT = 1 FLAGS = (PADBUSY,PADMY,POK,pPOK) IV = 0 PV = 0x2dd3110 "ba\303\261o"\0 CUR = 5 LEN = 12 --------------------
look wrong ? From what i recall, the UTF8 part of Dump should show the actual word, meaning bano (accented n) and not the encoding.[UTF8 "ba\x{f1}o"]
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^7: Malformed UTF-8
by Joost (Canon) on May 15, 2007 at 17:41 UTC | |
by spiros (Beadle) on May 15, 2007 at 17:51 UTC | |
|
Re^7: Malformed UTF-8
by Joost (Canon) on May 15, 2007 at 17:48 UTC | |
by spiros (Beadle) on May 15, 2007 at 17:55 UTC | |
by Joost (Canon) on May 15, 2007 at 18:12 UTC | |
by spiros (Beadle) on May 15, 2007 at 19:58 UTC |