Well, without limiting scope on your uc/lc rules you are in for a world of hurt. Because, while you can put a large list of known acronyms in your code and special case them, the text could a Randomly Made Up Acronym (RMUA) that is new to you. This turns out to be a very hard problem to do well -- I know a few guys at MS who do some of the backend work on Word's grammar engine and they have spent a ton of time on this very issue. Either you settle for a known but limit use or you drop thousands of lines of code on the problem and still end up with something that is not perfect. I think in word they just assume and group of capped letters is an acronym (nix some special cases), and if all words in a sentence are Capped then bypass acronym rules altogether. An example for how ugly this can be is:
Some of the known acronyms for THE.
THE Technische Hogeschool Eindhoven
THE Technological Horizons in Education
THE Tennessee Hospitality Education (Council)
THE Teresina, Piaui, Brazil - Teresina (Airport Code)
THE The Humane Environment (Jef Raskin)
THE Theatre
THE Toronto Health Economics (Network)
THE Transportable Helicopter Enclosure
!-- Node text goes above. Div tags should contain sig only -->
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|