in reply to Re^3: Possible to have regexes act on file directly (not in memory)
in thread Possible to have regexes act on file directly (not in memory)
Then, there is the question of the size of the input. On my server, processing a 10 GB (line-based) file with a relatively simple regex might take 5 to 10 minutes. It would probably be a bit faster if not line-based, reading chunks of say 1 MB. With a TB of data, it would take quite a bit of time, but that might still be relatively manageable. But that's assuming a simple regex with no need to backtrack. With a regex implying a lot of backtracking, it might very easily be completely unmanageable.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^5: Possible to have regexes act on file directly (decompose regex)
by LanX (Saint) on May 02, 2014 at 19:28 UTC | |
by Laurent_R (Canon) on May 02, 2014 at 21:07 UTC | |
by LanX (Saint) on May 02, 2014 at 21:55 UTC |