Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: How to count the vocabulary of an author?

by bliako (Monsignor)
on Jun 11, 2021 at 20:32 UTC ( #11133798=note: print w/replies, xml ) Need Help??


in reply to How to count the vocabulary of an author?

There are stemming (basically chopping off letters from the end of a word in order to arrive to a basis) and tagging (find out which part of speech a word is, e.g. verb) packages in cpan and specific to different languages. e.g. Lingua::*

Then ask uncle NSA and aunty CIA for the corpus, they keep meticulous records for all major european politicos' conversations.

  • Comment on Re: How to count the vocabulary of an author?

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11133798]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (4)
As of 2022-08-08 12:56 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?