If you want to download and normalise the entire Gutenberg English language corpus, a good starting point might be http://www.monlp.com/2012/04/09/calculating-word-statistics-from-the-gutenberg-corpus/.
Translating the code into Perl from Slytherin is left as an exercise...
Update: linkified 'Slytherin'.
In reply to Re^3: Random phrases
by Not_a_Number
in thread Random phrases
by BrowserUk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |