in reply to Re^3: Dictionary filter regex
in thread Dictionary filter regex

Thanks, interesting. I see you also have the issue I have - _differnet_ typos ~

next if $word ~=

Heh.

My original code produces:

time perl sh.pl > sh.txt real 0m3.770s user 0m3.692s sys 0m0.074s

...and using the great sort of 4 liner while loop:

time perl sh.pl > sh.txt real 0m4.192s user 0m4.170s sys 0m0.015s

seems slower.

Also, as to the error on open a file, I never bother when doing it in a terminal on a local machine as I know the file exists - in other circumstances I would, of course.

Thanks for your input!

Nick

P.S. The first word my dictionary file pulls up is abandon ship

Replies are listed 'Best First'.
Re^5: Dictionary filter regex
by Laurent_R (Canon) on Nov 27, 2016 at 13:19 UTC
    I have different results. With a words.txt file containing about 113,800 words, these are my timings. With an array:
    $ time perl -e 'open my $fh, "<", "words.txt" or die; my @w = <$fh>; > my $c; > foreach $line(@w) { > if ($line =~ /s.*h/i) { > if ( ($line =~ /s.*s/i) || ($line =~ /h.*h/i) ) { > next; > } > $c++; > } > } > print "$c \n"; > ' 2834 real 0m0.138s user 0m0.093s sys 0m0.016s
    Reading directly from the file:
    $ time perl -e 'open my $fh, "<", "words.txt" or die; > my $c; > foreach $line(<$fh>) { > if ($line =~ /s.*h/i) { > if ( ($line =~ /s.*s/i) || ($line =~ /h.*h/i) ) { > next; > } > $c++; > } > } > print "$c \n"; > ' 2834 real 0m0.120s user 0m0.093s sys 0m0.015s
    I have used exactly the same code except for the use of the array, so that the comparison should be relatively significant.

    But the main point is that if the input file gets huge, then you sometimes simply can't load it into an array, because you'll run out of memory.