in reply to Re: Hash versus substitutation efficiency?
in thread Hash versus substitutation efficiency?
Basically, I have a large set of documents and want to categorize them and be able to explore and search based on terms. Terms would include root terms and their synonyms. For example, "meat" as a root term could mean "ham", "beef", or "chicken". This could be stored in a hash so synonyms all lead to the root term:
In this way, different terms that mean or relate to basically the same thing will be counted all as occurrences of the base term. When searching for "ham", one might see documents that contain "beef" or "meat" as well as "ham".$termHash{ "ham" } = "meat"; $termHash{ "beef" } = "meat";
The numbers are a bit overwhelming:
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^3: Hash versus substitutation efficiency?
by Roy Johnson (Monsignor) on Oct 12, 2004 at 15:51 UTC | |
by bwelch (Curate) on Oct 12, 2004 at 16:05 UTC |