Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:
I am getting files from various sources in different European languages. The requirement in these files is that everything should be in ASCII compatible.
So for special characters in Danish, Finnish, the UTF-8 codes here ( UTF codes ) are typed in directly.So a line of text could contain
And this should be printed into HTML and PDF with the right fused AE character :"This line contains 0xC30x86n exotic character."
"This line contains AEn exotic character."
Changing the format of the files is not an option as it is easy for everyone to type the UTF codes for a character they do not even know.
My question is this: How should I read these text files, evaluate these special characters on the fly.
Some pointers would be much appreciated, Many thanks Chandra
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Evaluating UTF codes in a file
by almut (Canon) on Nov 26, 2009 at 16:51 UTC | |
|
Re: Evaluating UTF codes in a file
by ikegami (Patriarch) on Nov 26, 2009 at 16:53 UTC | |
|
Re: Evaluating UTF codes in a file
by Anonymous Monk on Nov 26, 2009 at 20:54 UTC |