Would've been nice of me to elaborate, sorry :-)
OK - Now I have about 10 meg, maybe more. I don't know
what I will have down the road. Maybe 100 meg next time
around?
And the data - it's a single number (a count) plus a
text descriptor. Descriptor may have whitespace, but
I can count on only one space between number and descriptor
Now, what I need to do is take all the lines that match
descriptor, add their counts, and print them.
i.e.,
my ($freq, $word) = split;
$freq_hash{$word} += $freq;
does the job in a rather unscalable way
Make sense?
Thanks again,
felonious
--