in reply to How do I read in a document, remove the stop words and then write the result to a new file?
toolic and grg answer your questions; your cross-post at stackoverflow has an alternate.
Since the Lingua::StopWords documentation provides an example of how to get the stopwords themselves - - eg, function getStopWords(en) - - I have to read your intent as creating a new file with the content of the original less the stopwords.Do you intend to use the new file as part of an index; perhaps inserting the words into a database for the purpose of cross-referencing the source of certain words?
Or do you have some other more arcane intent? Won't the original, minus the stock set of stopwords provided by the module, be intelligble?
|
|---|