I have several complex regular expressions that act as natural language interpreters to identify such things as terms in link text that could be considered nondescript, to which the title of the target page should be appended (click here, more, etc). For myself and for user documentation, I'd like to have a list of terms that those regexes match. Some terms would include descriptions of things like when they must be preceded by x number of characters, but those descriptions would be in english, and whenever practical, a regex would be expanded to a list of terms that it could match. I'd probably edit the list for the documentation to eliminate nonsensical terms that wouldn't appear in the link text.