in reply to Best Way To Parse Concordance DAT File Using Modern Perl?
If it's a UTF-8 file, isn't it meant to have a 3 byte BOM? Your BOM indicates that it's a UTF-16 file, not UTF-8.
Anyway UTF-8 text files with Byte Order Mark discussed this, and the comments in that node may be helpful.
See the module File::BOM which was mentioned in there as a means of opening files which may contain a BOM.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Best Way To Parse Concordance DAT File Using Modern Perl?
by Jim (Curate) on Dec 10, 2012 at 22:09 UTC | |
by Anonymous Monk on Jan 15, 2013 at 23:24 UTC |