Re^3: Divide array of integers into most similar value halves

Replies are listed 'Best First'.
Re^4: Divide array of integers into most similar value halves by gone2015 (Deacon) on Sep 02, 2008 at 00:46 UTC
So... 2100 is ~ 1030. If the computer can try one combination in (just for the sake of argument) ~3 micro-secs, it can try ~ 1013 combinations per annum. So a result may be expected in 1017 years. Perhaps we should spend more on the hardware ? Actually, it's not as bad as 2n. 2n - 1 is the number of different combinations of n things. However, we have two partitions, A & B, and there is some symmetry, which reduces the number of combinations. Consider: there are n possible partition A's containing 1 number -- but this is the same arrangement as the n possible partition A's containing n-1 numbers (but with A & B exchanged). So we don't count both of A with 1 number and A with n-1 numbers, and we don't count both A with 2 numbers and A with n-2 numbers, and so on. The effect of this is to halve the number of possible arrangements[1]. Also: we can discount the 1 possible partition A with all the numbers in it. Phew. We only have wait half the time we expected. There, that's made a difference :-) Mind you, we're not taking into account the doubling of processor speed every 18 months or so. Which makes this sort of problem a kind of inverted Xeno's paradox. Let me see, if in 18 months I can buy a machine that will do twice as much work as the one I have: in 36 months time I can have done the same amount of work by waiting 18 months, and starting then with the faster machine. Also, that machine will be cheaper than one I can buy now ! Of course, in 36 months time I will be able to buy an even cheaper machine which is four times as fast. For maximum efficiency, it is clear: I should do nothing, indefinitely ! [1] If n is odd, the number of distinct partitions is exactly 2*(n-1) - 1. If n is even it's a bit trickier to work out, you have to add (1/2) (n!/(2*(n/2)!) to that.	[reply]
Re^5: Divide array of integers into most similar value halves by BrowserUk (Patriarch) on Sep 04, 2008 at 00:23 UTC
Could you discuss the effect of duplicates in the dataset? Is it as simple as making n = no of unique values? Examine what is said, not who speaks -- Silence betokens consent -- Love the truth but pardon error. "Science is about questioning the status quo. Questioning authority". In the absence of evidence, opinion is indistinguishable from prejudice. "Too many [] have been sedated by an oppressive environment of political correctness and risk aversion."	[reply]
Re^4: Divide array of integers into most similar value halves by Pepe (Sexton) on Sep 01, 2008 at 20:06 UTC
An approximation is more than enough. Actually i could skip the big lists and do it only with lists with less than 10 numbers, if that would help. The real motivation for the problem (sorry for not explaining before) is that I need both subarrays to have a sum over a not-so-big threshold (easily achieved with 2 or 3 numbers per array). So if the big array is really big I can assume that the two small ones will pass the threshold. Thanks for your help.	[reply]
Re^4: Divide array of integers into most similar value halves by ikegami (Patriarch) on Sep 01, 2008 at 20:11 UTC
I'm pretty sure the brute force method (find all divisions of all permutations) is O(N³), much better than O(2^N)	[reply]
Re^5: Divide array of integers into most similar value halves by moritz (Cardinal) on Sep 01, 2008 at 20:16 UTC
Would you care to elaborate? I admit that I only picked up the "NP complete" that Skeeve mentioned earlier. On thinking a bit more it seems that the problem is roughly equivalent with calculating the sum of each item of the power set, which is O(2N), and I see no obvious way to reduce that to O(N3).	[reply]
Re^6: Divide array of integers into most similar value halves by ikegami (Patriarch) on Sep 01, 2008 at 20:42 UTC
It's obviously not the best algorithm, and I rushed the implementation. use strict; use warnings; use Algorithm::Loops qw( NextPermuteNum ); use List::Util qw( sum ); my @nums = @ARGV; my $sum = sum @nums; my @best = @nums; my $best; @nums = sort { $a <=> $b } @nums; do { my $p = $sum; my $q = 0; for my $i (0..$#nums) { my $x = $nums[$i]; $p -= $x; $q += $x; my $diff = abs($p-$q); if (!defined($best) \|\| $best > $diff) { $best = $diff; @best = ( [ @nums[ 0 .. $i ] ], [ @nums[ $i+1 .. $#nums ] ], ); } } } while ( NextPermuteNum(@nums) ); my @p = @{ $best[0] }; print(sum(@p), ": ", join('+', @p), "\n"); my @q = @{ $best[1] }; print(sum(@q), ": ", join('+', @q), "\n"); [download] `>perl script.pl 8 14 32 29 40: 8+32 43: 14+29 >perl script.pl 7 10 12 15 40 44: 7+10+12+15 40: 40` [download] I'm not clear on the complexity of NextPermuteNum — I have a cold that is making me very tired — so maybe my analysis is wrong.	[reply] [d/l] [select]
Re^7: Divide array of integers into most similar value halves by moritz (Cardinal) on Sep 01, 2008 at 20:50 UTC