The problem with any/all solutions to this problem is that of acceptable usage. For example, there are technical documents who would refer to SCUBA but travel brochures that would refer to scuba. Similarly scientists work with LASER but companies sell laser devices.

You cant create rules about pronouncability because both 'laser' and 'scuba' are pronouncable. 'Sky' contains no vowels and so might/might not be an acronym. Qantas contains no 'u' after the Q, nor does Iraq. Qantas is the "Queensland and Northern Territory Aerial Service" however, noone uses that outside of a trivia game these days. Its considered a word. Iraq, of course, is from another language and doesn't follow English rules.

Databases like vera can help you with a list of known acronyms but in Real Estate advertising 'LUG' is a lock-up-garage whereas in a mechanical journal a 'lug' is a type of nut.

Given all this, when parsing user text I normally require that there be one or more lower-case letters in users input. That way I know they didn't just type it with the Caps-Lock button on. If it's all in capitals I'll ask them to change or confirm what they've entered.

"Get real! This is a discussion group, not a helpdesk. You post something, we discuss its implications. If the discussion happens to answer a question you've asked, that's incidental." -- nobull@mail.com in clpm

In reply to Re: Determining if a word is really an abbreviation by BigLug
in thread Determining if a word is really an abbreviation by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.