Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris

Re^8: Capitalize First Letter of Each Word

by waswas-fng (Curate)
on Jun 30, 2005 at 17:35 UTC ( #471402=note: print w/replies, xml ) Need Help??

in reply to Re^7: Capitalize First Letter of Each Word
in thread Capitalize First Letter of Each Word

Well, without limiting scope on your uc/lc rules you are in for a world of hurt. Because, while you can put a large list of known acronyms in your code and special case them, the text could a Randomly Made Up Acronym (RMUA) that is new to you. This turns out to be a very hard problem to do well -- I know a few guys at MS who do some of the backend work on Word's grammar engine and they have spent a ton of time on this very issue. Either you settle for a known but limit use or you drop thousands of lines of code on the problem and still end up with something that is not perfect. I think in word they just assume and group of capped letters is an acronym (nix some special cases), and if all words in a sentence are Capped then bypass acronym rules altogether. An example for how ugly this can be is:
Some of the known acronyms for THE. THE Technische Hogeschool Eindhoven THE Technological Horizons in Education THE Tennessee Hospitality Education (Council) THE Teresina, Piaui, Brazil - Teresina (Airport Code) THE The Humane Environment (Jef Raskin) THE Theatre THE Toronto Health Economics (Network) THE Transportable Helicopter Enclosure
!-- Node text goes above. Div tags should contain sig only -->


Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://471402]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others perusing the Monastery: (4)
As of 2023-02-08 07:25 GMT
Find Nodes?
    Voting Booth?
    I prefer not to run the latest version of Perl because:

    Results (40 votes). Check out past polls.