danjkool35 has asked for the wisdom of the Perl Monks concerning the following question:
Hi,
I'm trying to parse some xml files, some of which contain non-UTF-8 characters, using XML::Simple.
I get the following error message:
/Users/Dan/Documents/Corpora/HIV_Database/xml/1438278.xml:66: parser error : Input is not proper UTF-8, indicate encoding !
Does anyone how to tweak the options, so I can either exclude these characters from the file or better still read them in.
Thanks
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: XML::Simple Non-UTF-8 characters won't read
by ikegami (Patriarch) on Feb 01, 2011 at 17:49 UTC | |
by danjkool35 (Initiate) on Feb 02, 2011 at 12:58 UTC | |
|
Re: XML::Simple Non-UTF-8 characters won't read
by grantm (Parson) on Feb 02, 2011 at 00:04 UTC | |
by danjkool35 (Initiate) on Feb 02, 2011 at 12:59 UTC |