in reply to Re^2: Using a variable with UTF8 content coming from XPATH findvalue
in thread Using a variable with UTF8 content coming from XPATH findvalue
In the meantime (just for grins, as we say ;), you might try running your sample XML file through this tool that I posted a while back: tlu -- TransLiterate Unicode
If the file really does contain any utf8 Russian character(s), a command line like this will tell you the exact unicode code point(s) and character name(s):
If there are Russian characters in your file, but they aren't really utf8-encoded (trust me, it happens!), then "tlu" will either report errors or else spit out "FFFD ... REPLACEMENT CHARACTER", and that is most likely the source of all your trouble -- you would need to convert the data from ... (whatever encoding it really is) into true and valid utf8.tlu -o uf test.xml | grep CYRILLIC
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: Using a variable with UTF8 content coming from XPATH findvalue
by inguanzo (Acolyte) on Sep 27, 2007 at 22:40 UTC |