Regexp is definitely not the place to post character sets, definitely ... except that it is :) um, I changed my mind
well actually UTF8 definitely belongs nowhere , characters ranges aren't UTF8, this module doesn't deal with UTF8 ... sure the OP wrote
but he meant some unicode string$char = "..."; # some UTF8 string
So after some more research
Regexp::Cherokee - Regular Expressions Support for Cherokee Script. Regexp::Ethiopic - Regular Expressions Support for Ethiopic Script. Regexp::CharClasses - Provide character classes Regexp::EN::NumVerbage - Regex pattern to match English number verbage + in text Regexp::CharClasses - Provide character classes
I propose the one new namespace to rule them all Regexp::CharProps - User Defined Character Properties like \p{InKona}
So for the OP Regexp::CharProps::Thai - \p(InThaiDigit} and other thai language character properties definitions
In reply to Re^4: Namespace/advice for new CPAN modules for Thai & Lao ( Regexp::CharProps::Thai )
by Anonymous Monk
in thread Namespace/advice for new CPAN modules for Thai & Lao
by Polyglot
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |