in reply to Re^3: How to optimize a regex on a large file read line by line ?
in thread How to optimize a regex on a large file read line by line ?
Using my own 200 million record 2Gb file, it takes 25 secs to get a count of lines only and 50 seconds with the regex included. (win 10 i5 3.3GHz/8GB AS v5.16.1)
poj#!perl use strict; my $testfile = '200-million-combos.txt'; unless (-e $testfile){ open OUT,'>',$testfile or die "$!"; my $record = '890123456'; for (1..200_000_000){ print OUT $record."\n"; } close OUT; } my $counter1 = 0; my $counter2 = 0; my $t0 = time; open FH, '<', $testfile or die "$!"; while (<FH>) { ++$counter1; if (/123456$/){ ++$counter2; } } close FH; my $dur = time-$t0;; print "$counter1 read in $dur secs\n";
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^5: How to optimize a regex on a large file read line by line ?
by John FENDER (Acolyte) on Apr 16, 2016 at 18:01 UTC | |
by poj (Abbot) on Apr 16, 2016 at 18:14 UTC | |
by John FENDER (Acolyte) on Apr 16, 2016 at 18:24 UTC | |
by poj (Abbot) on Apr 16, 2016 at 20:28 UTC | |
by John FENDER (Acolyte) on Apr 16, 2016 at 22:42 UTC |