in reply to Millions of line segment intersection calcs: Looking for speed tips

I suspect that there's probably a way to optimize the algorithm, but I really can't think of what it would be. However, there are numerous little changes that might help your performance a bit (though probably nothing too dramatic. All small things, but they might help a little.

Replies are listed 'Best First'.
Re^2: Millions of line segment intersection calcs: Looking for speed tips
by Anonymous Monk on Aug 03, 2005 at 16:37 UTC
    Might also try inlining Determinant and some common sub-expression elimination...
    sub SegmentIntersection { my @points = @{$_[0]}; my @p1 = @{$points[0]}; # p1,p2 = segment 1 my @p2 = @{$points[1]}; my @p3 = @{$points[2]}; # p3,p4 = segment 2 my @p4 = @{$points[3]}; my $a = $p2[0] - $p1[0]; my $b = $p3[1] - $p4[1]; my $c = $p2[1] - $p1[1]; my $d = $p3[0] - $p4[0]; my $det = $a*$b - $c*$d; return 0 if (abs($det) < $delta); # parallel my $e = $p3[1]-$p1[1]; my $f = $p3[0]-$p1[0]; my $n1 = $f*$b - $e*$d my $n2 = $a*$e - $c*$f; if ($det > 0) { return $n1 < $det && $n2 < $det && $n1 > 0 && $n2 > 0; } else { return $n1 > $det && $n2 > $det && $n1 < 0 && $n2 < 0; } }
Re^2: Millions of line segment intersection calcs: Looking for speed tips
by drewhead (Beadle) on Aug 03, 2005 at 20:11 UTC
    Thanks for all your suggestions. I've considered all of them and here's what I found.
    Why are $points and $neighborEdges hashrefs mapping integers (as strings) to values?
    Big picture implementation: what we are talking about is one method in a larger OO style pm I'm building. $points gets populated elsewhere and it's eaiser to pass this around as a ref. Obvioulsy this isn't apparent in my example. However I really have no reason to keep neighborEdges that way and can change it. Impact here was marginal if any.
    but it looks like you don't actually care about the real distance between points. Instead, you care about relative distances. So you can store the square of the distances (by not doing the sqrt) and get the same results.
    I do care about the actual distances, but only for those edges I identify as neighboors. So this point is quite valid, I don't really need to sqrt until after I identify these. This isn't going to impact things sine the caculation of the edges takes a second vs 17ish minutes for the intersect loop.
    Since Determinant is called so many times, I wonder whether you'd get any improvement over rewriting it without the temporary variables.
    Tested, the answer appears to be no.
    I'd probably try rethinking SegmentIntersection.
    Ahhh! I love coming here and having people see all these things that make perfect send yet never occured to me. Ofcourse! This only has marginal impact. The test data leaves only 2240 Determinant calls out... so 1120 get dropped early.