in reply to Scaling Hash Limits
If you have to re-do the duplicate check a few times (say every-time new data is added to the set), perhaps now is the moment to store the data in a database and use the database-engine as your duplicate-detector.
CountZero
A program should be light and agile, its subroutines connected like a string of pearls. The spirit and intent of the program should be retained throughout. There should be neither too little or too much, neither needless loops nor useless variables, neither lack of structure nor overwhelming rigidity." - The Tao of Programming, 4.1 - Geoffrey James
My blog: Imperial Deltronics
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Scaling Hash Limits
by Endless (Beadle) on Sep 19, 2013 at 13:32 UTC | |
by BrowserUk (Patriarch) on Sep 19, 2013 at 15:30 UTC | |
by Laurent_R (Canon) on Sep 20, 2013 at 22:13 UTC | |
by BrowserUk (Patriarch) on Sep 20, 2013 at 23:13 UTC | |
by Laurent_R (Canon) on Sep 21, 2013 at 09:11 UTC | |
by Anonymous Monk on Sep 20, 2013 at 23:16 UTC | |
by Laurent_R (Canon) on Sep 21, 2013 at 09:39 UTC | |
by AnomalousMonk (Archbishop) on Sep 19, 2013 at 13:36 UTC |