in reply to Problem with quotes, speciao characters and so on, reading a xml file
Hi
How are you reading the XML file? It looks like you're just reading it in, not processing it as XML.
XML allows characters to be encoded with Ӓ encoding. This will be converted by an XML Parser, but Perl does not understand these codes by itself.
I would suggest using an XML Parser module, such as XML::LibXML otherwise you are likely to encounter similar little problems to this (example: what if your hwAssetUserField3 element is split over multiple lines).
However, if you insist on doing it yourself, you could solve this particular problem with something like:
# not recommended! ... or tested ;-) $vNombre =~ s{&#([0-9]+);}{chr($1)}g;
I would really suggest finding an XML parser, or at least find an XML character reference converter someone else has written, because you may also need to deal with hexadecimal (ਊ) and named character entities (á).
FalseVinylShrub
Disclaimer: Please review and test code, and use at your own risk... If I answer a question, I would like to hear if and how you solved your problem.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Problems with XML encoding
by Sombrerero_loco (Beadle) on Dec 29, 2009 at 11:49 UTC | |
by FalseVinylShrub (Chaplain) on Dec 29, 2009 at 12:14 UTC | |
by Jenda (Abbot) on Dec 30, 2009 at 11:14 UTC |