Re: Character encoding fun...

Replies are listed 'Best First'.
Re^2: Character encoding fun... by joem (Initiate) on Nov 15, 2007 at 20:41 UTC
Hello, Thanks for the quick response. I though that's what I wanted though when I do that I get: `Cannot decode string with wide characters at C:/Perl588/lib/Encode.pm +line 166.` [download] which is why it's turned around. Joe	[reply] [d/l]
Re^3: Character encoding fun... by pc88mxer (Vicar) on Nov 15, 2007 at 20:48 UTC
Your problem is that `$my_utf_data` contains code points (numbers representing Unicode characters), not octets (i.e. bytes). If `$my_utf_data` really contains bytes, no character in that string should be > 255. The error message you are getting indicates that there are characters > 255 in your string. If `$my_utf_data` is really text (i.e. consists of code points), then all you need is the call to `encode` to get a cp1252 encoded stream of bytes: `encode('cp1252', $my_utf_data)` [download]	[reply] [d/l] [select]
Re^4: Character encoding fun... by joem (Initiate) on Nov 15, 2007 at 21:02 UTC
So the way is starting to seem a little clearer. To take the following path UTF_String -> CP1252 data -> UTF_String Source -> Storage -> Display I will need to `encode('cp1252', $my_utf_data) -> store -> decode('utf8', $retrieved_d +ata)` [download] though that does not provide any results per se. I must admit I am very new in the character encoding scene, It would be nice to be able to just manipulate the strings as byte arrays for storage and retrieval though I'm not sure how to accomplish that in perl :( Thanks, Joe	[reply] [d/l]
Re^5: Character encoding fun... by graff (Chancellor) on Nov 16, 2007 at 02:54 UTC