in reply to How do I read in a document, remove the stop words and then write the result to a new file?

toolic and grg answer your questions; your cross-post at stackoverflow has an alternate.

Since the Lingua::StopWords documentation provides an example of how to get the stopwords themselves - - eg, function getStopWords(en) - - I have to read your intent as creating a new file with the content of the original less the stopwords.

Do you intend to use the new file as part of an index; perhaps inserting the words into a database for the purpose of cross-referencing the source of certain words?

Or do you have some other more arcane intent? Won't the original, minus the stock set of stopwords provided by the module, be intelligble?

  • Comment on Re: How do I read in a document, remove the stop words and then write the result to a new file?