Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical
 
PerlMonks  

Re^8: Capitalize First Letter of Each Word

by waswas-fng (Curate)
on Jun 30, 2005 at 17:35 UTC ( [id://471402]=note: print w/replies, xml ) Need Help??


in reply to Re^7: Capitalize First Letter of Each Word
in thread Capitalize First Letter of Each Word

Well, without limiting scope on your uc/lc rules you are in for a world of hurt. Because, while you can put a large list of known acronyms in your code and special case them, the text could a Randomly Made Up Acronym (RMUA) that is new to you. This turns out to be a very hard problem to do well -- I know a few guys at MS who do some of the backend work on Word's grammar engine and they have spent a ton of time on this very issue. Either you settle for a known but limit use or you drop thousands of lines of code on the problem and still end up with something that is not perfect. I think in word they just assume and group of capped letters is an acronym (nix some special cases), and if all words in a sentence are Capped then bypass acronym rules altogether. An example for how ugly this can be is:
Some of the known acronyms for THE. THE Technische Hogeschool Eindhoven THE Technological Horizons in Education THE Tennessee Hospitality Education (Council) THE Teresina, Piaui, Brazil - Teresina (Airport Code) THE The Humane Environment (Jef Raskin) THE Theatre THE Toronto Health Economics (Network) THE Transportable Helicopter Enclosure
!-- Node text goes above. Div tags should contain sig only -->


-Waswas

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://471402]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others having a coffee break in the Monastery: (5)
As of 2024-04-19 04:04 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found