in reply to regex for utf-8

What Zaxo said.

I seem to getting a lot of milage out of this advice recently, but have you considered using iconv or recode for this task? These are not Perl anything, but tools designed to convert between character encodings (recode is available for GNU/Linux and Windows--see this link, iconv is distributed with RedHat GNU/Linux distros).

--
Allolex

Replies are listed 'Best First'.
Re: Re: regex for utf-8
by jjohhn (Scribe) on Feb 27, 2003 at 23:37 UTC
    I did come across recode in my searching. In a addition to doing the task itself (which I can do right now by pouring the text into, converting, and pouring out of either vim, textpad or MS Access 2000), I want to understand this at a deeper level than I do. I also want to know how those three tools are doing the task.