in reply to Re^5: Random sampling a variable length file.
in thread Random sampling a variable record-length file.
How certain are you that this is true? If there is no correlation between any characteristic of interest in a record
Simple. I cannot know what will be inside the file! Because the user may apply the process to any file of their choosing.
So, Just as the polster might discover that the "random selection" they make of the populous, happens to coincidentally consist of the entire membership of some extremist political organisation, they cannot know it will until they take the sample.
Put another way, there may be all manner of correlations, but none of them are known, and so cannot be utilised.
So, at this point, the problem is how to take a statistically valid, random sample of records of any file, without resorting to reading the entire file. I've described the inferences to be drawn elsewhere.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^7: Random sampling a variable length file.
by eye (Chaplain) on Dec 27, 2009 at 21:49 UTC | |
by BrowserUk (Patriarch) on Dec 27, 2009 at 22:04 UTC | |
by pajout (Curate) on Dec 30, 2009 at 11:29 UTC | |
by BrowserUk (Patriarch) on Dec 30, 2009 at 11:51 UTC |