in reply to Re^3: utf8 "\xD0" does not map to Unicode at /path/comparebin.pl line line_number, <STDIN> line line_number
in thread utf8 "\xD0" does not map to Unicode at /path/comparebin.pl line line_number, <STDIN> line line_number

The files were recovered from an ntfs file system to ext4 in ubuntu.
I thought that ext4 and ubuntu use utf8 by default, but I will try to set binmode STDIN to raw encoding, to see, if it helps.
  • Comment on Re^4: utf8 "\xD0" does not map to Unicode at /path/comparebin.pl line line_number, <STDIN> line line_number

Replies are listed 'Best First'.
Re^5: utf8 "\xD0" does not map to Unicode at /path/comparebin.pl line line_number, <STDIN> line line_number
by ikegami (Patriarch) on Nov 23, 2014 at 02:07 UTC

    Unix file names are any sequence of bytes that don't contain 00 or 2F ("/" in ASCII). In this case, it's part of a UTF-8 string that's not valid UTF-8 itself.

    Your GUI and terminal operates using UTF-8, but that doesn't mean you can't create a file name that's not UTF-8.