rob_au has asked for the wisdom of the Perl Monks concerning the following question:
The most direct method of course would be to use the escaped URL of the web page as the key (most likely that generated by URI::Escape), but I am wondering if there might exist a cleaner and more expansive (read, ordered) way to index such pages. I have also considered using a MD5 hash of either the URL or the page itself as the key for indexing, but this seems to be an overkill with the time involved in subsequently generating these MD5 hashes to perform a lookup. The onus here for ease and speed is not so much in the indexing but the subsequent matching and lookup of the data - It should be noted that subsequent lookup will be again derived from the location URL.
Should I stick with the idea of an escaped URL as the hash key or do other monks here have a more ordered approach that I can use to index this data?
Ooohhh, Rob no beer function well without!
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Data indexing in BerkeleyDB hashes
by blakem (Monsignor) on Sep 16, 2001 at 12:40 UTC | |
|
Re: Data indexing in BerkeleyDB hashes
by thpfft (Chaplain) on Sep 16, 2001 at 15:23 UTC | |
|
Re: Data indexing in BerkeleyDB hashes
by perrin (Chancellor) on Sep 16, 2001 at 19:43 UTC | |
|
Re: Data indexing in BerkeleyDB hashes
by shotgunefx (Parson) on Sep 17, 2001 at 00:54 UTC |