I don't know if you're tied to ICU, but I've had good experience with Text::Unidecode, which turns Unicode strings (back to) Roman text data.
| [reply] |
Thanks for the recommendation but I should have said that I need conversion between strict standards-based scripts like IAST and Devenagari and that module just (quite well apparently) lets you do a helpful ASCII transliteration. Specifically, I need to be able to do things like IAST Sanskrit -> Devanagari as this is the way to collate such languages.
| [reply] |
However, it's said to be alpha quality and has a lot of compiler warnings.
But does it work for you? BTW, I see only three deprecation warnings when build it on Ubuntu with libicu52
| [reply] |
I rebuilt and fixed the warnings and it does now appear to build cleanly - a real tribute to the backwards compat of ICU ... I have to wait until my Sanskrit source can verify if the transliteration looks ok ...
| [reply] |
Apparently not. It seems that ICU doesn't support IAST, only the more general and different ISO15919. Ah well, perhaps I will have to fight with Lingua::Translit.
| [reply] |
> So, does anyone know what happened to PICU - the "wrapper for ICU"?
I was the co-author of PICU back in 2002. The source is still online, but I don't believe you will be able to build it 2 decades later without significant effort. | [reply] |