Database Table to Hash of Hash of Array

This is useful for reproducing a database with an unknown schema into a hash of a hash of an array.

my $sqlstr1 ="select table_name, column_name".
                    " from Dba_tab_columns".
                    " where owner = '$schema'".
                    " order by table_name, column_name";
        my $sth=$db->prepare($sqlstr1) or print STDOUT "Query:GetData 
+Failed";
        $sth->execute;
        my $table_column = $sth->fetchall_arrayref;
        $sth->finish;
        
        for $table_column_r (@{$table_column})
        {
             if ($table_column_r->[0] eq "EMP")
            {
                 my $sqlstr2 =" select $table_column_r->[1]".
                            " from $schema.$table_column_r->[0]";
                my $sth=$db->prepare($sqlstr2) or print STDOUT "Query:
+GetTablesColumnsData Failed";
                $sth->execute;
                warn print STDOUT "\n***Error*** Table: $table_column_
+r->[0]  Column: $table_column_r->[1]      \n", $sth->errstr(), "\n"
                     if $sth->err();
                warn print SUMMARY "\n***Error*** Table: $table_column
+_r->[0]  Column: $table_column_r->[1]      \n", $sth->errstr(), "\n"
                     if $sth->err();
                my $column_data = $sth->fetchall_arrayref;
                $sth->finish;
                
                # This for-loop create a hash of a hash of an array wi
+th the keys {table name} and {column name}
                for $row (@{$column_data})
                {
                    push (@{$tables{$table_column_r->[0]}{$table_colum
+n_r->[1]}}, $row->[0]);
                    # To print value of hash: p @{$tables{$table_colum
+n_r->[0]}{$table_column_r->[1]}}
                }
                undef @data_array;
            }
        }
[download]

Comment on Database Table to Hash of Hash of Array Download Code

Replies are listed 'Best First'.
(ar0n) Re: Database Table to Hash of Hash of Array by ar0n (Priest) on Nov 30, 2001 at 08:32 UTC
To tell you the truth, your code is horrible. You're not using placeholders You're warning the return value of a print statement. Why? You're undefining `@data_array`, yet `@data_array` is neither used nor declared in your code. What does this code do, anyway? Your description wasn't very clear, and from the looks of your code its use appears to be restricted to a very small scope. Here's a rewrite: # "RaiseError => 1" tells DBI to die on errors. my $dbh = DBI->connect($DSN, $username, $password, { RaiseError => 1 } +); my $sth; $sth = $dbh->prepare(" SELECT table_name, column_name FROM Dba_tab_columns WHERE owner = ? ORDER BY table_name, column_name "); $sth->execute($schema); my $table_column = $sth->fetchall_arrayref; $sth->finish; for my $tcr (@{$table_column}) { if ($tcr->[0] eq "EMP") { my $sth = $dbh->prepare("SELECT " . $tcr->[1] . " FROM " . $schema + . $tcr->[0]); $sth->execute; my $column_data = $sth->fetchall_arrayref; $sth->finish; # This for-loop create a hash of a hash of an array with the keys +{table name} and {column name} for my $row (@{$column_data}) { push @{ $tables{$tcr->[0]}{$tcr->[1]} }, $row->[0]; + } } } [download] [ ar0n -- want job (boston) ]	[reply] [d/l]
Re: (ar0n) Re: Database Table to Hash of Hash of Array by timo (Novice) on Dec 01, 2001 at 05:32 UTC
Thanks alot for the feedback. I've haven't been writing Perl for very long and actually thought what I developed was pretty cool and portable to any Oracle db. I'm going to apply your comments to my code and continue to improve. Once again, thanks.	[reply]
Re: Database Table to Hash of Hash of Array by davorg (Chancellor) on Nov 30, 2001 at 15:33 UTC
In addition of other criticisms, you're also using a system-specific method of getting data about tables, instead of using DBI's more portable `table_info` method. -- <http://www.dave.org.uk> "The first rule of Perl club is you don't talk about Perl club."	[reply]
Re: Database Table to Hash of Hash of Array by runrig (Abbot) on Nov 30, 2001 at 05:25 UTC
So you're basically creating an in-memory copy of every table and creating an in-memory index of every table by every column (an equivalency index only, not good for '>' or '<' or 'like' comparisions). This is bound to suck up huge amounts of memory on a database of any significant size. But interesting :-)	[reply]
Re: Re: Database Table to Hash of Hash of Array by timo (Novice) on Dec 01, 2001 at 05:30 UTC
A full comparison of every piece of data in the database is my tasking. Is there a better way to approach this task w/o consuming so much memory? I've investigated an SQL command 'MINUS' that does this very thing but requires each database to have a dblink established. I wanted a program that would work on any Oracle db.	[reply]
Re: Re: Re: Database Table to Hash of Hash of Array by gmax (Abbot) on Dec 04, 2001 at 17:45 UTC
First a few small points: 1. Is it really necessary to call `$sth->finish` after a `fetchall_arrayref` ? All the records are already fetched by this time, so there is nothing to "finish". (I've learned that at my expenses. ;-) 2. A "comparison of every piece of data in the database" sounds like a job for a database engine. Exporting everything into memory would deprive you of the powerful querying system of your server. IMO, if you want to compare data, you are better off within the database engine itself (which you can wonderfully drive around with Perl/DBI of course). Comparing hashes of arrays is not an easy task, while comparing arrays of hashes ( = database tables) is everyday's task for any DB engine. 3. If what you want to achieve is comparing data across databases in different hosts, it would be enough to read one table at the time (hence, an array of hashes) provided that you have enough memory. If memory is an issue, here is the skeleton of a script that I use to compare my production db with the development db (lying in different hosts): #!/usr/bin/perl -w use strict; use DBI; # just configuration -- change according to your needs, # or use a config file my $DBD = "mysql"; # change to your DBD driver my $local_database = "monks"; my $local_host ="IP_development_DB"; my $remote_database = "monks"; my $remote_host="IP_production_DB"; my $local_password = "locsecret"; my $remote_password = "remsecret"; my $local_username = "locuser"; my $remote_username = "remuser"; my $local_DSN = "DBI:$DBD:$local_database;host=$local_host"; my $remote_DSN = "DBI:$DBD:$remote_database;host=$remote_host"; # end configuration my $local_dbh=DBI->connect($local_DSN, $local_username, $local_password, {RaiseError => 1}); my $remote_dbh=DBI->connect($remote_DSN, $remote_username, $remote_password, {RaiseError => 1}); my @local_tables = $local_dbh ->tables(); my @remote_tables = $remote_dbh->tables(); foreach my $table (@local_tables) { if (grep {$table eq $_} @remote_tables) { my $query = qq{ SELECT COUNT() FROM $table}; my $local_sth = $local_dbh->prepare( $query ); my $remote_sth = $remote_dbh->prepare( $query ); $local_sth->execute(); $remote_sth->execute(); my ($local_result) = $local_sth->fetchrow_array(); my ($remote_result) = $remote_sth->fetchrow_array(); if ($local_result != $remote_result) { print "$table: \tL ($local_result)\t", "R ($remote_result)\n"; } } else { print "missing $table\n" ; } } $local_dbh->disconnect(); $remote_dbh->disconnect(); [download] This simple code will just compare which tables are present in database 1 but missing in the database 2, and which ones have different number of records. You can change the portion between my $query = qq{...}* and if ($local_result ...) to suit your purposes. Notice that, when the comparison by COUNTing gives back the same number, then you need to compare further by specific fields, which you either know in advance (if all your tables have a "telling" field with a CRC, for instance) or you have to get the field list from the engine and carry out a more detailed comparison. A workable idea is to identify sensible grouping fields and then compare queries with a GROUP BY clause, in order to avoid unnecessary record by record comparison. The bottom line is that you should not get crude data from the database, but first organize it and process the results with Perl.	[reply] [d/l] [select]