in reply to Advice for optimizing lookup speed in gigantic hashes
I ran such a dictionary over a gigabyte of text on my computer, and it took five minutes to check every word. This works out to about 3.5 days on the whole 1 TB corpus,
It seems unlikely that your slow timings are due to the hash lookup. Far more likely to be down to how you are breaking your data into words?
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Advice for optimizing lookup speed in gigantic hashes
by tobek (Novice) on Aug 23, 2011 at 02:47 UTC | |
by BrowserUk (Patriarch) on Aug 23, 2011 at 03:27 UTC | |
by BrowserUk (Patriarch) on Aug 23, 2011 at 03:10 UTC | |
by tobek (Novice) on Aug 23, 2011 at 03:56 UTC | |
by BrowserUk (Patriarch) on Aug 23, 2011 at 04:41 UTC | |
by tobek (Novice) on Aug 23, 2011 at 14:41 UTC | |
| |
| A reply falls below the community's threshold of quality. You may see it by logging in. |