in reply to Reading HUGE file multiple times
Hi,
I am dealing just about every day with somewhat similar problems on huge data files, and I am fairly confident that it should be possible to read the file only once (or at most twice), but you don't give enough information about the structure of the file.
Is my understanding correct that you first have a bunch of identifier lines (1000+), and then you data lines? And the identifier lines some how give the rules as to what to do with the data lines? Or do you have one identifier line giving information about what to to on the next data line or next data lines?
Please tell us more about the identifiers: do they say on which data line numbers to do something? Or which field to extract in the data line?
In all cases, I believe that it should most probably be possible to read your file sequentially only once, record what you have in the identifier line and use that for processing the data lines coming afterwards. But I can't say more on how to do it without a better idea of your data format or, even better, a simplified sample of your file content together with some explanation on how to use the identifiers to analyze the data lines.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Reading HUGE file multiple times
by Anonymous Monk on Apr 28, 2013 at 12:42 UTC | |
by BrowserUk (Patriarch) on Apr 28, 2013 at 13:30 UTC | |
by Anonymous Monk on Apr 28, 2013 at 13:36 UTC | |
by BrowserUk (Patriarch) on Apr 28, 2013 at 13:45 UTC | |
by Anonymous Monk on Apr 28, 2013 at 14:16 UTC | |
| |
by Anonymous Monk on Apr 28, 2013 at 14:06 UTC | |
|
Re^2: Reading HUGE file multiple times
by Anonymous Monk on Apr 28, 2013 at 12:45 UTC |