Things get even more interesting if you try to use filenames containing invalid UTF-8 sequences on various filenames:
Invalid-UTF8 vs the filesystem by Kristian Köhntopp. In summary:
- XFS on Linux and ext4 on Linux don't care at all. Filenames are just bytes.
- ZFS on Linux refuses filenames containing invalid UTF-8 sequences.
- APFS on MacOS Ventura also refuses filenames containing invalid UTF-8 sequences.
Python does not like tar archives with invalid UTF-8 sequences.
And little ugly detail: Apparently there is a function sys.getfilesystemencoding() without parameters. Python seems to assume that all filesystems have the same encoding and that it is not path dependent.
This is at least conceptually similar to my pet problem of File::Spec, assuming uniform behaviour across various mounted filesystems.
Alexander
--
Today I will gladly share my knowledge and experience, for there are no sweeter words than "I told you so". ;-)
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.