A single word makes too small a sample, but for larger runs of text the frequency of coincident characters is a guide. I think Kahn's popular book on crypto describes the algorithm and its uses. It can detect key length for multiple cyphers, and can often identify the language of the plaintext.

Your character frequency and vowel placement ideas seem good to me. They may also be suffering from small sample size.

As it is, you might try tr// of 1337 digits for words matching /\d/, minimum Text::Levenshtein distance from words in a wordlist, and inclusion (through index or m//i) of words in the wordlist. Those will be time-consuming, but I don't see a way around that.

Added: To recognise language-like nonwords, you could check that all adjacent pairs of characters are high-probability ones in the language.

After Compline,
Zaxo


In reply to Re: recognizing URL text by Zaxo
in thread recognizing URL text by vacant

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.