in reply to Re^3: Namespace/advice for new CPAN modules for Thai & Lao
in thread Namespace/advice for new CPAN modules for Thai & Lao
Regexp is definitely not the place to post character sets, definitely ... except that it is :) um, I changed my mind
well actually UTF8 definitely belongs nowhere , characters ranges aren't UTF8, this module doesn't deal with UTF8 ... sure the OP wrote
but he meant some unicode string$char = "..."; # some UTF8 string
So after some more research
Regexp::Cherokee - Regular Expressions Support for Cherokee Script. Regexp::Ethiopic - Regular Expressions Support for Ethiopic Script. Regexp::CharClasses - Provide character classes Regexp::EN::NumVerbage - Regex pattern to match English number verbage + in text Regexp::CharClasses - Provide character classes
I propose the one new namespace to rule them all Regexp::CharProps - User Defined Character Properties like \p{InKona}
So for the OP Regexp::CharProps::Thai - \p(InThaiDigit} and other thai language character properties definitions
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^5: Namespace/advice for new CPAN modules for Thai & Lao ( Regexp::CharProps - User Defined Character Properties )
by Anonymous Monk on Mar 24, 2015 at 00:06 UTC | |
by Polyglot (Chaplain) on Mar 24, 2015 at 05:00 UTC | |
by Polyglot (Chaplain) on Mar 24, 2015 at 09:32 UTC | |
|