or download this
Benchmark: timing 5000000 iterations of with_my, without_my...
with_my: 11.0969 wallclock secs ( 6.75 usr + 3.42 sys = 10.17 CPU)
+ @ 491642.08/s (n=5000000)
...
with_my: 11.0938 wallclock secs ( 7.35 usr + 3.70 sys = 11.05 CPU)
+ @ 452488.69/s (n=5000000)
without_my: 11.018 wallclock secs ( 7.29 usr + 3.69 sys = 10.98 CPU)
+@ 455373.41/s (n=5000000)