in reply to Unzip help needed (seems like a code page issue).

You're seeing the following swaps:
"è" is U+00E8 ⇒ "Š" is U+0160
" " is U+00A0* ⇒ "ÿ" is U+00FF

I found that:
decode('cp1252', encode('cp437', chr(0x00E8))) eq chr(0x0160)
decode('cp1252', encode('cp437', chr(0x00A0))) eq chr(0x00FF)

This is the only match I found. I looked at UTF-8, UCS-2le, iso-8859-* and (only) a few code pages.

That explains what is happening. It doesn't determine who is doing what and who is to blame, but it's a start.

* — Well, it could be something other than U+00A0, but it would be a mighty big coincidence.

Replies are listed 'Best First'.
Re^2: Unzip help needed.
by nlevesque (Initiate) on Jul 28, 2009 at 15:32 UTC
    Thanks! Indeed it is a start and it feels like something really simple to change, yet I still haven't figured out how to make the necessary change to avoid the issue.
      Does anyone have any idea? I would like to avoid relying on a third party software such as "Command line Winzip" when it can be done in Perl... Thanks!