nefertari has asked for the wisdom of the Perl Monks concerning the following question:
Here at our university i am the author of a perl-CGI-script, that searches for preprints on a preprintserver. Until last week it worked (with the exception of upgrades on the server and not getting the privilege to run CGIs). Since last week there is trouble with umlauts. It seems to be due to an update of the server from Debian Potato to Woody (i don't know any of these).
Today in the morning i changed it to die if the input didn't match /^((?:[a-zA-ZäöüÄÖÜß\s])*)$/, so that i could see, where our problems are. Now the funny part: "ö" matches every second time! (But in this case no matches with the preprint list are found, although we have two authors with ö in their names.)
To make it possible to search for people without umlauts on their keyboard we decided that oe should match ö in the search. So if i search for oe i again get no matches.
Does anybody of you have an idea what could be wrong?
Our data are in a XML-file, and umlauts are in a very ugly way encoded: <UL>o</UL> stands for ö. (I didn't design this part.) Then i parse them via XML::Parser and store only the matching preprints. One problem could be, that i write ö and the other umlauts directly in the perlscript. But i don't know another way to do this. If you know, i would be glad, if you could tell me how to achieve this.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: trouble with umlauts
by Biker (Priest) on Mar 19, 2002 at 16:31 UTC | |
|
perldoc perlunicode?
by RMGir (Prior) on Mar 19, 2002 at 16:25 UTC | |
|
Re: trouble with umlauts
by MZSanford (Curate) on Mar 19, 2002 at 16:56 UTC | |
by nefertari (Chaplain) on Mar 19, 2002 at 17:38 UTC | |
by Biker (Priest) on Mar 19, 2002 at 18:40 UTC | |
|
Re: trouble with umlauts
by mirod (Canon) on Mar 19, 2002 at 17:31 UTC | |
by nefertari (Chaplain) on Mar 19, 2002 at 17:35 UTC | |
|
Re: trouble with umlauts
by Anonymous Monk on Mar 19, 2002 at 16:59 UTC | |
by theorbtwo (Prior) on Mar 19, 2002 at 18:54 UTC | |
by nefertari (Chaplain) on Mar 20, 2002 at 08:45 UTC | |
by theorbtwo (Prior) on Mar 21, 2002 at 07:33 UTC | |
by nefertari (Chaplain) on Mar 21, 2002 at 08:28 UTC | |
|
Re: trouble with umlauts - update
by nefertari (Chaplain) on Mar 20, 2002 at 08:29 UTC |