Re: Testing if an array contains a value and then deleting it in most efficient way

Replies are listed 'Best First'.
Re^2: Testing if an array contains a value and then deleting it in most efficient way by parv (Parson) on Feb 18, 2008 at 16:21 UTC
In code (while using array) ... `use List::MoreUtils qw/ firstidx /; for my $val ( generate() ) { my $i; # Deal w/ "uninitialized variable" warning as appropriate. while ( -1 < ( $i = firstidx { $_ == $val } @array ) ) { delete $array[ $i ]; } } # Later... # exists() here would go better with earlier delete(), but # then would need to generate list of indices. my @save = grep { defined $_ } @array;` [download]	[reply] [d/l]
Re^3: Testing if an array contains a value and then deleting it in most efficient way by ikegami (Patriarch) on Feb 18, 2008 at 16:30 UTC
ow! You transformed a O(N) problem into an O(N²) solution. `firstidx` starts at the start of the array every loop pass. Not only did you half the speed, you doubled the memory requirements. "`firstidx`" copies the entire array before processing it.	[reply] [d/l] [select]
Re^4: Testing if an array contains a value and then deleting it in most efficient way by parv (Parson) on Feb 18, 2008 at 16:40 UTC
How can you avoid O(N^2) cost if using unsorted array? So far there is no indication whether it would be sorted. (Addendum: I wasn't thinking so just quoted the O(N^2) cost, which really should have been O(N). Time to sleep belatedly, I suppose.) I thought that List::MoreUtils::firstidx would use XS magic (to avoid copying). No? (I would have looked inside the C code myself but am not familiar with XS yet.) Later ... I see now that array might be already sorted (and first element would be the interesting one).	[reply]
Re^5: Testing if an array contains a value and then deleting it in most efficient way by ikegami (Patriarch) on Feb 18, 2008 at 16:50 UTC
Re^6: Testing if an array contains a value and then deleting it in most efficient way by ikegami (Patriarch) on Feb 19, 2008 at 03:59 UTC
Some notes below your chosen depth have not been shown here
Re^2: Testing if an array contains a value and then deleting it in most efficient way by karpatov (Beadle) on Feb 18, 2008 at 16:30 UTC
In fact the values to be looked-up will come numerically sorted (probably if the record IDs in db are really ordered by their numeric ID), so sorting the values in the array would mean that only first element is deleted. So it seems that I can only ask whether the value to be looked up equals just the first item of the array! How only could miss it? Tx for pointing the possibility of ordering. Undefs are ignored during the search, so their presence doesnt slowdown the procedure?	[reply]
Re^3: Testing if an array contains a value and then deleting it in most efficient way by ikegami (Patriarch) on Feb 18, 2008 at 16:37 UTC
Fortunately for you, deleting the first (or last) element of an array (`shift(@a)`) is very fast (O(1)) in Perl.	[reply] [d/l]
Re^4: Testing if an array contains a value and then deleting it in most efficient way by karpatov (Beadle) on Feb 18, 2008 at 16:45 UTC
There is one problem though, the values to be check against the array come from a db which is splitted into several files, so their names something0001,something1000 should be ordered but @files = sort { $a <=> $b } @files causes the IDE to freeze. What can be wrong? tx.	[reply]
Re^3: Testing if an array contains a value and then deleting it in most efficient way by parv (Parson) on Feb 18, 2008 at 16:43 UTC
A side note ... The records may be sorted but will not be returned as such when not asked, if database server is Sybase 15.0.x.	[reply]
Re^4: Testing if an array contains a value and then deleting it in most efficient way by mpeppler (Vicar) on Feb 19, 2008 at 11:03 UTC
Even more of a side note - one should never rely on the inherent ordering of data in a SQL database. Any ordering is a side effect of the storage mechanism, and can change (for example due to parallel query engines). Always use an ORDER BY clause if the order of the returned data is significant in any way! Michael PS - for Sybase it's not only ASE 15.0.x that has this behavior - it's any table that has row-level locking (DOL), or if you have partitioned tables, or if you use parallel queries.	[reply]
OT: Sybase by parv (Parson) on Feb 19, 2008 at 11:24 UTC