I asked them this too, if I could just split the file by strings...that'd be easy....
Since I don't know what the data is for or from(they don't tell me, since I am not a permanent employee) they just said that these strings are too frequent among the file. They're something like START_HTML and END_HMTL is what I've heard. Maybe its a web crawler?
But they said that splitting based on this would make thousands and thousands of files. So I can't.
In reply to Re^2: File Processing...
by Anonymous Monk
in thread File Processing...
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |