Re^3: Regex to remove generic accounts

Replies are listed 'Best First'.
Re^4: Regex to remove generic accounts by AnomalousMonk (Archbishop) on Oct 28, 2008 at 12:12 UTC
`(?!pattern)` is a negative look-ahead. Take a look at the sub-section Look-Around Assertions in the section on Extended Patterns in perlre.	[reply] [d/l]
Re^4: Regex to remove generic accounts by JavaFan (Canon) on Oct 28, 2008 at 07:58 UTC
They are certainly not equivalent. There are 10 characters that match `/[0-9]/`. The number of characters that match `/\d/` varies from Perl version to Perl version. There are more than 100 characters that match `/\d/` in 5.10, and that's only a proper subset of what is being matched in blead.	[reply] [d/l] [select]
Re^5: Regex to remove generic accounts by rovf (Priest) on Oct 28, 2008 at 09:36 UTC
There are more than 100 characters that match /\d/ in 5.10 Does this mean that digits from other languages are also considered as 'digit' by \d? For example, if I have a string consisting of Japanese kanji, would \d match the Kanji digits too? -- Ronald Fischer <ynnor@mm.st>	[reply]
Re^6: Regex to remove generic accounts by JavaFan (Canon) on Oct 28, 2008 at 10:26 UTC
Yes, and no. Digits from other languages are matched by \d, but not every language. I think, but I haven't studied the Unicode property database in detail, that if the language uses a strict base-10 system, its digits are matched by \d. But the existance of a "tens" or "hundreds" symbol exclude all its digits from being matched by \d. And it may very well be that the database isn't consistent in this aspect. I don't know what system Japanese uses, but AFAIK, Kanji digits aren't matched by \d.	[reply]
Re^7: Regex to remove generic accounts by rovf (Priest) on Oct 28, 2008 at 13:32 UTC
Re^8: Regex to remove generic accounts by JavaFan (Canon) on Oct 28, 2008 at 13:42 UTC
Some notes below your chosen depth have not been shown here