in reply to Re^2: keeping diacritical marks in a string
in thread keeping diacritical marks in a string
Hmm
I can't see the obvious source of the problem. I think you need to dump out the result of the request before any processing and be sure exactly where the special characters are being lost. i.e. is it coming correctly out of LWP, is it the regex, could it be the MARC:: module, etc.
As graff said it shouldn't be losing these characters, but there are a number of places where things can go wrong.
It's all a bit complicated and I can't think of a good guide to it at the moment. On the other hand, I've never heard of Perl completely stripping special characters because of an encoding problem - normally, you would get a multi-byte utf-8 character treated as 2 or 3 characters if the encoding is not set correctly. So I suspect an error in some code somewhere - could it be that something is validating input and stripping out characters it doesn't think are "safe"...?
Sorry I can't be of more help. Try to narrow it down to where they disappear and it will be solved eventually.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: keeping diacritical marks in a string
by Foxpond Hollow (Sexton) on Oct 09, 2009 at 06:51 UTC | |
by FalseVinylShrub (Chaplain) on Oct 09, 2009 at 10:42 UTC | |
by Foxpond Hollow (Sexton) on Oct 10, 2009 at 00:47 UTC | |
by FalseVinylShrub (Chaplain) on Oct 10, 2009 at 04:55 UTC |