Re: a question about making a word frequency matrix


Your skill will accomplish what the force of many cannot
	PerlMonks

Re: a question about making a word frequency matrix

by l3v3l (Monk)

on Dec 08, 2005 at 00:16 UTC ( [id://515084]=note: print w/replies, xml )

Need Help??

in reply to a question about making a word frequency matrix

I know this is not a working solution to your query but is a one-liner that I have used often to get this type of information quickly for any block of text I am dealing with:

perl -nle '$c{$_}++ for split/\s/;}print map {"$_:$c{$_}\n"} sort{$c{$
+b}<=>$c{$a}}keys %c;{' file_of_WHATEVER_to_count.txt
[download]

example, to count the number of instances of each word in a file you could use split/\W/ in the place of split/WHATEVER/ or just split if you wanted to keep punctuation,etc. intact. output is ordered by most frequent occurrences (at the top of the "item:count\n" listing)