And... why do you need to filter the input? XML::Twig (and expat and XML::Parser in the lower levels) is more than happy to accept that input.
Now if what you want is to get rid of all non-ascii characters, then have a look at Text::Unidecode: US-ASCII transliterations of Unicode text. Use it in an output filter, or apply it to the result of your processing. Et voilą!
In reply to Re: XML::Twig and UTF-8
by mirod
in thread XML::Twig and UTF-8
by bobf
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |