in reply to how to parse english-chinese fixed length data records in perl 5.6
I've had partial success recreating UTF-8 strings from a series of bytes by using pack/unpack with the U template, though if I remember correctly there were still some glitches I encountered with this approach, especially under 5.6.0 (5.6.1 was a bit better).
Perl 5.8 is supposed to have much improved Unicode support, and if that's an option for you it might be worth investigating. (Sorry, I don't have any firsthand experience with it yet.)
|
|---|