in reply to Re^2: Windows-1252 characters from \x{0080} thru \x{009f} (source-code encoding)
in thread Windows-1252 characters from \x{0080} thru \x{009f}
By now, Perl 5 should also be defaulting to Windows-1252 instead of to ISO 8859-1 (Latin 1)
I don't know of a single place where Perl assumes iso-8859-1.
There are many places where Perl requires strings of Unicode code points. (In the above program, those would be the match operator and the encoder.) Since the strings passed to those were created by assigning each byte to a character, each byte is taken to be a Unicode code point. Not an iso-8859-1 character.
This makes it *look* like Perl defaults to iso-8859-1, but there is no "default" since there is only ever one thing those functions can accept. Because there is no default, it also means the default cannot be changed, to cp1252 or anything else.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: Windows-1252 characters from \x{0080} thru \x{009f} (source-code encoding)
by moritz (Cardinal) on Apr 19, 2012 at 07:19 UTC | |
by ikegami (Patriarch) on Apr 19, 2012 at 16:56 UTC | |
by moritz (Cardinal) on Apr 19, 2012 at 19:35 UTC | |
by BrowserUk (Patriarch) on Apr 19, 2012 at 22:33 UTC | |
|
Re^4: Windows-1252 characters from \x{0080} thru \x{009f} (source-code encoding)
by Jim (Curate) on Apr 19, 2012 at 17:09 UTC | |
by ikegami (Patriarch) on Apr 24, 2012 at 02:16 UTC |