in reply to Junk NOT words
Why not measure the letter distribution of the sample being tested. I.e, count the number of 'a's, 'b's and so on. You could then compare the calculated distribution with one appropriate for your language, tables of which could almost certainly be found by googling, and if the actual result is within acceptable bounds, then assume it's ok.
A little further explanation: if the input's random(ish), then the letter distribtion should be fairly close to random (ignoring keyboard patterns). Real, written english (at least) has a much higher proportion of 'e's, 't's, and 's's.
cheers
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: Re: Junk NOT words
by FamousLongAgo (Friar) on Oct 30, 2002 at 17:56 UTC |