I am trying to write a transliterator for converting roman into another language.
It is much easier to write using roman/keyboard but the output will show in the appropriate language font.
For example:
dny = character #225
kh = character #35
k = character #12
h = character #10
a = character #8
kha = character #35#8 and not #12#10#8
What pattern should I write to split a string in those
characters? For example khatos should split into kh, a, t,o,s
So triplecharacter pattern should match first and then
double character then single. It is guranteed that there
will be always some vowels in between but they can be
multcharcter always.
For example word mukharjee should split into
m,u, kh,a, rj,ee. Is it possible get first character that
is not a vowel, then get vowels then characters again?
Once I split it, all I need to do is to find associated
charcter from assoc array and print.
Thanks for your help.
Originally posted as a Categorized Question.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.