in reply to Re^2: Windows-1252 characters from \x{0080} thru \x{009f} (source-code encoding)
in thread Windows-1252 characters from \x{0080} thru \x{009f}

Perl 5 should also be defaulting to Windows-1252 instead of to ISO 8859-1 (Latin 1).

I really hope this will NEVER happen, not even on a Windows platform. cp1252 is only "default" on Windows, it is not the default on any other platform and changing perl5's default to cp1252 would break every script that assumes the current default (wise or not).

Most perl scripts are cross-platform portable, at least they can be when the programmer follows the basic porting rules. Most of my scripts and modules are cross platform, and I do test my modules on HP-UX, Linux, AIX and Windows (and sometimes even on OSX when I can access such architecture).

That said, the default IMHO is likely to change for Windows. If not in Windows 8 (or whatever they will call it) then maybe Windows 9 or 10 will have Unicode as default character set. Problem solved. I already use UTF-8 as default encoding on all my browsers (Opera, Firefox, Konqueror, Opera Mobile) and IRC.

My advise to you would be to switch to using utf-8 (and declare 'use utf8;' next to use strict; and use warnings; in the head of your scripts when you do.


Enjoy, Have FUN! H.Merijn

Replies are listed 'Best First'.
Re^4: Windows-1252 characters from \x{0080} thru \x{009f} (source-code encoding)
by Jim (Curate) on Apr 19, 2012 at 17:44 UTC

    Every one of your browsers—Opera, Firefox, Konqueror and Opera Mobile—will default to the Windows-1252 character encoding if wrongly told by the creator of the HTML document that the text of the document is in the Latin‑1 (ISO 8859‑1) character encoding. It was exactly this behavior of these popular web browsers that influenced the W3C to standardize this practice in its specification of HTML5—a willful violation of existing standards.

    How many existing cross-platform Perl scripts treat characters in the range from 80 thru 9F as ISO 8859‑1 control codes? Maybe lots of them do. I don't know.

    Jim

Re^4: Windows-1252 characters from \x{0080} thru \x{009f} (source-code encoding)
by Anonymous Monk on May 22, 2012 at 22:14 UTC

    Most of my scripts and modules are cross platform

    Unicode::Tussle doesn't appear to be runnable on win32 :)

      Well, that is not one of my modules or scripts, nor does it depend on any of my modules or scripts. What point do you want to make here?


      Enjoy, Have FUN! H.Merijn

        Well, that is not one of my modules or scripts, nor does it depend on any of my modules or scripts. What point do you want to make here?

        Well obviously I've confused you with tchrist, so yeah, I have no point