http://qs1969.pair.com?node_id=324921


in reply to Efficient ways of storing a data set for random access

Your approach is not bad at all; you are trading memory versus a much higher access speed than the actual generation might require. The use of a data file allows for simple tweaking of the data distribuition just by altering the number of elements in the file.

I'd extend it by randomizing based on the actual number of item read from file, and I'd load the file at startup without touching it anymore; this way your program will be fast.

Or of course you could use one of the statistical modules...