in reply to unicode combined characters in regular expressions
I think you have to explicitly allow the mark, e.g.
/(\p{Lu}\p{M}*){2}/ or /(\p{Lu}\p{Mn}*){2}/ # "non-spacing" mark
Update: Also, there's \X which matches a general "combining character sequence", though I wouldn't know how to specify the desired "letter"/"uppercase" property in that case... -- but you probably knew this already.
|
|---|