I am just running it through 5000 iterations. I did comment out the "new" part of the benchmark, since I don't find that bit interesting. I am just looking at the "output" times.
i added some more tests in later versions. current version ist 0.380.39. i think the interesting part is new+param+output, and with that and a 18k template i get: