in reply to Re^4: removing duplicate lines
in thread removing duplicate lines

That's very interesting. I am sure I read somewhere that it was good practice to factor out system overheads such as i/o when benchmarking algorithms, which is why I have laid out the scripts that way. However, this now appears to be a flawed approach to real world problems.

I will mend my ways :-)

Thank you,

JohnGG

Replies are listed 'Best First'.
Re^6: removing duplicate lines
by revdiablo (Prior) on Apr 11, 2006 at 16:24 UTC
    I read somewhere that it was good practice to factor out system overheads such as i/o when benchmarking algorithms

    That may be a good rule of thumb, but -- as should now be obvious -- it doesn't apply all the time. In this case, the I/O is part of what we're testing. Factoring it out alters the algorithm significantly.

      I will definitely bear this in mind in future. It had never occurred to me that you could do seeks on the DATA filehandle; that's really neat.

      Thank you for the instruction,

      JohnGG