in reply to Re^2: Fast algorithm for 2d array queries
in thread Fast algorithm for 2d array queries
See the referenced thread. I later implemented a partial SSE version as well (merge using SSE, scan not optimized). Result:
Update:Total 301068 elements in 30 vectors timethis for 5: 5 wallclock secs ( 5.31 usr + 0.00 sys = 5.31 CPU) +@ 439.36/s (n=2333)
Update2: Right you are, BrowserUk, I was considering small n case only.Total 200752 elements in 4 vectors timethis for 5: 6 wallclock secs ( 5.28 usr + 0.00 sys = 5.28 CPU) +@ 1326.33/s (n=7003)
Total 50063728 elements in 1000 vectors timethis for 5: 6 wallclock secs ( 6.32 usr + 0.00 sys = 6.32 CPU) +@ 0.63/s (n=4)
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: Fast algorithm for 2d array queries
by BrowserUk (Patriarch) on Feb 08, 2014 at 00:56 UTC | |
|
Re^4: Fast algorithm for 2d array queries
by BrowserUk (Patriarch) on Feb 08, 2014 at 09:02 UTC |