comment on

Yes, I fixed two significant bugs in your code
That is an outright lie. There simply aren't "two significant bugs" in that code. Neither of those trivia you've identified make the slightest bit of difference to the reproducible data posted.

1) Since I can't reproduce any of the "data" posted in the node with the bugs (since I still don't know what your input file was, other than its size), I wonder what "reproducible data posted" you are talking about.

2) The "trivia" of leaving off /g had a huge impact on performance for me. If I could reproduce your performance data, then I could determine by how much the lack of /g changed it for your secret input file. After making my own input file, the lack of /g made a 10x change in performance (for your code).

3) I was repeatedly quite clear that the two other bugs did not significantly impact performance (which is the data you posted). They were significant in that they caused incorrect output to be produced which was not acceptable to me since I didn't want to do a bunch of benchmarks and then notice that I wasn't actually doing the same work (like you did).

I don't know how you came up with your "2x slower" [....] Since you won't post code

I used your code. The only code I didn't fully post was trivia such as for generating the input and gathering the timings. If you can't figure that out without me spoon-feeding it to you, then here:

time perl -le'print "a"x1024 for 1..100_000' > 1kLines.txt
time perl -pe's/a/A/g' 1kLines.txt > output.cmp
time perl bukQueue.pl 1kLines.txt > output.tmp
diff -sq output.cmp output.tmp
[download]

and real timings he (and anyone; 'cept maybe you) can verify for themselves.

Oh, I didn't realize you had distributed your input file to the OP (and everyone except me). The change in performance is drastically impacted by the input used. For the stupidest-possible benchmark, you can use this instead:

time perl -le'print "z"x1024 for 1..100_000' > 1kLines.txt
[download]

which gives me 10x CPU consumption difference (still nowhere near your original "reproducible data" of "less than 2 minutes" vs "~4 hours").

The more realistic but still silly comparison gave me results similar to:

time perl -le'print "a"x1024 for 1..100_000' > 1kLines.txt
real      0m00.685s
user      0m00.228s
sys       0m00.460s
cpu       0m00.688s

time perl -pe's/a/A/g' 1kLines.txt > output.cmp
real      0m44.254s
user      0m43.323s
sys       0m00.924s
cpu       0m44.247s

rm -f output.tmp
time perl tyeQueue.pl 1kLines.txt
real      0m52.719s
user      1m32.618s
sys       0m02.692s
cpu       1m35.310s
diff -sq output.cmp output.tmp
Files output.cmp and output.tmp are identical

time perl bukQueue.pl 1kLines.txt > output.tmp
real      0m49.997s
user      1m35.598s
sys       0m01.696s
cpu       1m37.294s
diff -sq output.cmp output.tmp
Files output.cmp and output.tmp differ
[download]

CPU consumption goes from just under 45 seconds to a bit over 90 seconds. 2.2x more CPU used.

Feel free to scale up to 3.5GB, change the line contents, and reproduce your "2 minutes vs 4 hours" data but this time in a way that anyone with hours to waste on rather meaningless benchmarking can actually reproduce.

When you do that, also show some benchmarks for your chosen line contents but a small fraction of the 3.5GB so people with less copious time to waste can chip in. And then also explain your guess as to why the smaller file gives 2x..10x CPU consumption difference while your 3.5GB file produces a 120x difference (or whatever it is).

Since you keep emphasizing "120x" as being "my" number, perhaps you could also tell me what number you come up with for "<2 minutes vs ~4 hours". You said "my original figures are based upon actual cpu usage". The first performance figure I see from you is "That will take ~4 hours to process a 3.5 GB file". That sounds like elapsed time to me. So I bet the "120x" is not accurate. It was convenient short-hand and should be in the general ballpark based on "Neither of those trivia you've identified make the slightest bit of difference to the reproducible data posted".

It'd be nice if rather than just hinting that "120x" is an unreasonable interpretation that you would actually provide what the clear performance differences are for your original runs (the "<2 minutes" and "~4 hours" runs). If it was actually "~100x CPU usage" or "~50x elapse time (using two cores)" or whatever, just state it clearly.

And, yes, I haven't posted the code for tyeQueue.pl. As you can see, the performance differences compared to bukQueue.pl are trivial. The output from it is correct, however. I played around with a bunch of things in tyeQueue.pl and they made no difference in performance. So there is no point getting distracted with one particular version of that trivia.

- tye

In reply to Re^11: (Innuendo and guesswork) by tye
in thread Using kernel-space threads with Perl by aberman

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.