in reply to UTF-8 text files with Byte Order Mark
so I kinda assume that Perl will handle with this kind of stuff for me.
Having Perl remove the BOM automatically would be bad. print while <$fh>; would no longer print out a file exactly, for example. It wouldn't be possible to print out a file exactly by other means either.
However, if file contains that BOM, my program does not understand the first line in the file
Patient: "Doctor, it hurts when I do this."
Doctor: "So don't do it!"
If your program doesn't accept BOMs, don't feed it any. BOMs are not required.
Alternatively, you could change your spec and your program to accept it.
while (<$fh>) { s/\x{FEFF}//g; ... }
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: UTF-8 text files with Byte Order Mark
by muba (Priest) on Feb 13, 2007 at 20:05 UTC | |
by ikegami (Patriarch) on Feb 13, 2007 at 20:36 UTC | |
by muba (Priest) on Feb 13, 2007 at 20:43 UTC | |
|
Re^2: UTF-8 text files with Byte Order Mark
by Anonymous Monk on Jul 24, 2019 at 20:56 UTC |