Having has such success in the past I am returning to the Monastary for more help from the Perl Monks! Please bear in mind that I am a relative beginner here ... but here is my problem:
We have to do a fair bit of post processing of HTML and javascript files and all has worked fine until we started handling Japanese files - now the text is getting corrupted because perl is handling the text byte by byte. Is there a (relatively) easy way to tell perl that the text is multibyte and therefore look character by character and not byte by byte? So that something like:
?? I need it to know that the \x5C is the second byte in a multibyte character and not an escaped space.
Thank you for any help you can offer.