in reply to term weight
I see that better minds have not responded, and they usually do sooner than this, so I'll take a stab at it.
So, taking term weight to mean simply the ratio of the term frequency to the number of tokens in the text, basically you need to:
It would help us a lot if you posted your ideas about what you need to do as well and perhaps define "term weight" I'm hoping you mean my interpretation above, which is pretty well accepted as a general definition in linguistic circles.
Maybe someone a little smarter and a little more awake than I am can come up with a way to combine a few of these steps, but it looks like you may have to add stem tags to the text in order to accomplish your goal.
--
Allolex
|
|---|