Tree Structure and Db

InfiniteLoop has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Tree Structure and Db by ikegami (Patriarch) on Jul 05, 2005 at 21:31 UTC
If you're primarily going to read the tree, it might be advantageous to store pointers to every ancestor instead of just the direct parent. You can query for all the ancestors in one query, and then build the path in Perl. For example, Nodes ===== id \| node_name \| parent ----+------------------+-------- 1 \| Parent Node1 \| 0 2 \| Child Node 1 \| 1 3 \| Sub Child Node 1 \| 2 4 \| Leaf1 \| 3 HasAncestor =========== child \| ancestor -------+---------- 2 \| 1 3 \| 2 3 \| 1 4 \| 3 4 \| 2 4 \| 1 SELECT Nodes.* FROM Nodes LEFT JOIN HasAncestor ON Nodes.id = HasAncestor.child WHERE Nodes.id = ? Returns for args (3) ==================== id \| node_name \| parent ----+------------------+-------- 1 \| Parent Node1 \| 0 2 \| Child Node 1 \| 1 3 \| Sub Child Node 1 \| 2 [download] This way, you can easily fetch an entire subtree. For example, `SELECT Nodes.* FROM Nodes LEFT JOIN HasAncestor ON Nodes.id = HasAncestor.child WHERE HasAncestor.ancestor = ? Returns for args (2) ==================== id \| node_name \| parent ----+------------------+-------- 3 \| Sub Child Node 1 \| 2 4 \| Leaf1 \| 3` [download] You can still fetch only the immediate children. For example, `SELECT Nodes.* FROM Nodes WHERE Nodes.parent = ? Returns for args (2) ==================== id \| node_name \| parent ----+------------------+-------- 3 \| Sub Child Node 1 \| 2` [download] Similar for the immediate parent.	[reply] [d/l] [select]
Re^2: Tree Structure and Db by pg (Canon) on Jul 06, 2005 at 01:38 UTC
As the author stated, this structure is good for trees that are mostly static. In case the tree can be updated, performance becomes an issue. Even a simple operation as to insert a leaf node, could cause you to insert multiple rows (average number of rows inserted equals average depth of all pathes, and in the worst case, the number of rows get inserted is the length of the deepest path.) Look at one more operation: to move a subtree to be under a new parent. if you only store the immediate parent-child relationship, you only need to modify 1 row, but if you store all ancestor, you will need to modify multiple rows for each node in the subtree (basically to modify all relationships towards a node that is above the root of the subtree.) Data integraty could also be an issue, a simple coding mistake can spread dirty data all over the place, not just an isolated spot.	[reply]
Re^3: Tree Structure and Db by ikegami (Patriarch) on Jul 06, 2005 at 03:00 UTC
Thanks for expanding on this. I intended to do so, but didn't get get a chance 'til now. That is exactly what I meant by my comment.	[reply]
Re^2: Tree Structure and Db by InfiniteLoop (Hermit) on Jul 05, 2005 at 21:38 UTC
Thnx ikegami. I did think of a lookup table, but was the opposite of your design. This is a great help, thnx again.	[reply]
Re: Tree Structure and Db by dbwiz (Curate) on Jul 05, 2005 at 21:32 UTC
You may look at the following sources: storing tree-like structures in tables Recommended modules for Parent-Child trees? Here is a (practical) alternative data structure: DBIx::Tree::NestedSet (CPAN) Storing Hierarchical Data in a Database (off site) Information on Tree data (off site)	[reply]
Re: Tree Structure and Db by sharkey (Scribe) on Jul 06, 2005 at 02:08 UTC
This probably won't help you, but... If you happen to be using Oracle or DB2, those databases have SQL extensions to do tree structured queries, based on a table like you describe. The syntaxes are radically different, so your best bet is to look it up in your database documentation. For Oracle the keyword is "CONNECT BY", and for DB2 the keyword is "WITH". Here's a nice overview: http://www.oreilly.com/catalog/sqlpr/chapter/ch01.pdf	[reply]
Re^2: Tree Structure and Db by hartwig (Sexton) on Jul 07, 2005 at 11:21 UTC
CONNECT BY is not working very well in oracle 9.2 - in oracle 8.X its OK. This is a bug in oracle 9.2 (also admited by oracle ;)) - What kind of DB are you using? Best regards Hartwig	[reply]
Re^3: Tree Structure and Db by InfiniteLoop (Hermit) on Jul 07, 2005 at 19:33 UTC
I use MySQL 4.x	[reply]
Re: Tree Structure and Db by devnul (Monk) on Jul 06, 2005 at 04:19 UTC
... and if you are using Postgres there is ltree which works quite well - dEvNuL	[reply]
Re: Tree Structure and Db by DrHyde (Prior) on Jul 06, 2005 at 09:39 UTC
Take a look at DBM::Deep. The source code for that module is a good read too.	[reply]
Re: Tree Structure and Db by bart (Canon) on Jul 06, 2005 at 16:46 UTC
1. is there an optimal way to design a database table to represent an tree structure ? I think there's a good candidate. I have to thank demerphq for drawing my attention to it, a few months ago. Look up Joe Celko's article trees in SQL. There's several copies of this (and a follow-up) article floating around on the internet — some even with user comments; he also wrote an entire book about it.	[reply]
Re: Tree Structure and Db by vyach (Novice) on Jul 06, 2005 at 07:21 UTC
If your mind is open and you don't fear to change the usual way, you can try something different then relational database. I suppose that XML is more appropriate for hierarcic tree-like structures. You can use XPath for querying them - it was designed exactly for that. Look at the Berkeley DB XML. But, frankly, I had no time to test it with Perl.	[reply]
Re: Tree Structure and Db by anonymized user 468275 (Curate) on Jul 06, 2005 at 13:35 UTC
1) The standard solution is to use a link table from the main table to itself. The link table contains two foreign keys from the same main table, i.e. the main table has TWO one-to-many relationships to the link table, one for parent relation and the other for child relation. 2) But to get performance out of this, it is best to write some access stored procedures (and in some cases perhaps views) which include a GetChild and a GetParent along with any procedures or triggers for insert, update and delete that may be required to simplify and unify access, but which otherwise hide (or rather make it unnecessary to expose) the link-table implementation to the database user ("user" includes any perl code that transacts with it). One world, one people	[reply]
Re^2: Tree Structure and Db by simonm (Vicar) on Jul 06, 2005 at 16:26 UTC
In a tree-wise data structure, with only one parent per child, what is the perceived advantage of using this linking table rather than just adding the parent ID to the main table?	[reply]
Re^3: Tree Structure and Db by anonymized user 468275 (Curate) on Jul 07, 2005 at 11:18 UTC
A link table is the normal way to enforce referential integrity in many to many relationships (0 or 1 counts as many for these purposes) and in this case prevents orphans; it also enables you to define different types of relationship without putting more illegal or awkwardly-implemented constraints (and adding maybe-null foreign keys for them) on the master table. One world, one people	[reply]
Re: Tree Structure and Db by Argel (Prior) on Jul 06, 2005 at 23:00 UTC
. . . is there an optimal way to design a database table to represent an tree structure ? You are storing a tree -- which is a hierarchical data structure -- so why not use a hierarchical database such as OpenLDAP or SUN's Directory Server 5.2? Both seem like a good fit for what you are trying to do (though you would have to learn all the jargon, etc.). I've played with SUN's Directory Server 5.2 a bit but you would probably want to go with OpenLDAP since it is free. There is Net::LDAP on CPAN but if performance is a serious concern you should look into something tailored specificaly to the LDAP server you are using. SUN has perldap (which you can get as part of the directory server resource kit). I have not used OpenLDAP so I am not sure what is availalbe for it. It might be easier to go with a relational database that you are familiar with instead. But since no one else had mentioned it I thought I would toss the LDAP idea out there for consideration. -- Argel	[reply]


Your skill will accomplish what the force of many cannot
	PerlMonks