I think you have to explicitly allow the mark, e.g.
/(\p{Lu}\p{M}*){2}/ or /(\p{Lu}\p{Mn}*){2}/ # "non-spacing" mark
Update: Also, there's \X which matches a general "combining character sequence", though I wouldn't know how to specify the desired "letter"/"uppercase" property in that case... -- but you probably knew this already.
In reply to Re: unicode combined characters in regular expressions
by almut
in thread unicode combined characters in regular expressions
by telcontar
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |