in reply to iso-8859-1 code converter
You can still get at kanjiworld.com via the wayback machine, but alas, only a few pages are archieved. Still a complete EUC-JP table is on the internet.
That said, it is not clear to me what you are trying to do, since there is no equivalent in iso-8859-1 for EUC-JP (in bytes, two bytes are just two bytes). There's only a unicode equivalent, which is straight forward to get from the table you refer to. E.g. の as HTML entity is written as の converting the numeric part to hex you get the unicode point: printf "%x", 12398 gives 306e. The perl unicode representation as string would be "\x{306e}" which is stored internally as a sequence of three char values: 227, 129, 174:
use HTML::Entities; use Devel::Peek; $c = decode_entities("の"); Dump $c __END__ SV = PV(0x91ecb00) at 0x91ebcdc REFCNT = 1 FLAGS = (POK,pPOK,UTF8) PV = 0x91ff608 "\343\201\256"\0 [UTF8 "\x{306e}"] CUR = 3 LEN = 8
The EUC-JP encodng for の is the two byte sequence a4ce (ascii 164,206 in decimal), but how is that iso-8859-1? On my terminal that renders as the Euro-Sign+Capital-I-Circumflex. So what is the sought outcome? what are you actually trying to do?
Thinking about it, your "somewhat strange request" sounds like a XY question...
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: iso-8859-1 code converter
by GaijinPunch (Pilgrim) on May 05, 2009 at 10:26 UTC | |
by rovf (Priest) on May 05, 2009 at 13:56 UTC | |
by GaijinPunch (Pilgrim) on May 05, 2009 at 15:01 UTC | |
by rovf (Priest) on May 05, 2009 at 16:01 UTC | |
by graff (Chancellor) on May 05, 2009 at 17:49 UTC | |
by GaijinPunch (Pilgrim) on May 06, 2009 at 01:12 UTC | |
by graff (Chancellor) on May 06, 2009 at 03:22 UTC | |
by GaijinPunch (Pilgrim) on May 06, 2009 at 08:26 UTC | |
by graff (Chancellor) on May 06, 2009 at 15:06 UTC | |
by Anonymous Monk on May 06, 2009 at 09:20 UTC | |
by GaijinPunch (Pilgrim) on May 07, 2009 at 00:40 UTC |