Dotted hash access

by sfink (Deacon) on Nov 25, 2004 at 09:40 UTC

field names

values

$cost = $h->{$location}{$building} wouldn't work for me. The structure contains many more things than just Locations, and a location has much more data associated with it than a set of Buildings. So I don't want to accidentally get the wrong value back if I have a location name that happens to coincide with some other field name. (Perhaps I should give a more detailed dump of a data set?)

My approach differs from Perl4's because mine can be used hierarchically, unlike $;. So I am free to do

  $locinfo = $h->{"Locations.$location"};
  count_totals($locinfo->{"Resources"});
  compute_upkeep($locinfo->{"Buildings"});
[download]

Re^3: Dotted hash access

by brian_d_foy (Abbot) on Nov 25, 2004 at 19:24 UTC

$location = $thingy->get_location( $l );
[download]

$building = $location->get_building( $b );
[download]

$building = $thingy->get_location( $l )->get_building( $b );

#or

$building = $thingy->get_building_by_location( $l, $b );
[download]

$cost = $thingy->get_location( $l )->get_building( $b )->cost;
$totals = $location->get_totals;
$upkeep = $building->get_upkeep;
[download]

foreach my $l ( $thingy->all_locations )
    {
    foreacn my $b ( $b->all_buildings )
        {
        $b->set_cost( $b->cost + 1 );
        }
    }
[download]

The Perl Review 0.5

while( my $location = $thingy->next_location )
    {
    ...
    }
[download]

--
brian d foy <bdfoy@cpan.org>

Re: Dotted hash access
by diotalevi (Canon) on Nov 24, 2004 at 23:50 UTC

Locations/Earth/Buildings/PowerPlant/cost

by sfink (Deacon) on Nov 25, 2004 at 09:52 UTC

That's not a bad idea. I've used XPath -- well, actually I haven't, but I implemented something very similar to it (and borrowed as much syntax and semantics as I could in the process.) It may be overkill here, but it seems like a nice middle ground between straight Perl data structure access and the generality of SQL.

Re: Dotted hash access
by Zaxo (Archbishop) on Nov 25, 2004 at 03:47 UTC

The advantage of perl's notation is that it is consistent and, with practice, it tells you everything about the structure above the data.

The usual way to make that briefer would be to make a real data accessor,

sub cost {
    my ($self, $location, $b) = @_;
    $self->{Locations}{$location}{Buildings}{$b}{cost};
}
# . . .
my $cost = $h->cost($location, $b)
[download]

cost

It probably would help to break the big data structure into subobjects to which the whole has a 'has-a' relation.

I agree that big deep data structures full of bits of everything are awkward to handle. I don't think notation is the real problem. It's a design matter.

After Compline,
Zaxo

Re: Dotted hash access
by castaway (Parson) on Nov 25, 2004 at 06:34 UTC

Oops, strange, that's how I'd do it were I using a database..

Nothing says you have to put all your data in one huge data structure. Your idea sounds like a nice one, until you realise the restrictions, and that its an elaborate way of getting around not using a DB or a DB-like structure. Theres no need to use a huge DB just use SQLite or something?

by sfink (Deacon) on Nov 25, 2004 at 09:48 UTC

I guess what I'm saying is that I periodically run across cases where the data is fundamentally deeply nested, and I need to start over from a fairly high level frequently (so I can't just grab the nested hash out and pass it into the other functions, avoiding long lookup chains.) Admittedly, it's not a frequent occurrence, but configuration files and simulations seem to be typical examples.

Re^3: Dotted hash access

by castaway (Parson) on Nov 25, 2004 at 13:39 UTC

Re: Dotted hash access
by dimar (Curate) on Nov 25, 2004 at 00:10 UTC

Additionally, who else runs into this problem? Is there some alternative approach that I should be considering?

It depends on how you define "problem" ... I would suspect you aren't the only one who has had this consideration, but there may not be enough "momentum" behind this idea to motivate or inspire something different. I asked a similar question a while back ... Perl complex data structure ... how to get more flexible access? ... outside of using XPATH, or rolling your own code, or looking into the suggestions in the thread cited previously, I have not been able to find exactly what you are asking for.

In other words ... me too!

Re: Dotted hash access
by hardburn (Abbot) on Nov 25, 2004 at 04:06 UTC

It's just syntax. Nothing I would spend too much effort on.

"There is no shame in being self-taught, only in not trying to learn in the first place." -- Atrus, Myst: The Book of D'ni.

Re: Dotted hash access
by Jenda (Abbot) on Nov 25, 2004 at 16:35 UTC

Please don't. At best you save one character between each level of the structure. What you loose is clarity and speed. Think about the poor guy who inherits your programs. He'll have no idea what the heck are you doing there with those dots. Especialy since dots already do have a meaning for strings. Dot is the concatenation operator, and suddenly it should be a separator? And how come this hash behaves in this strange way?

Jenda
We'd like to help you learn to help yourself
Look around you, all you see are sympathetic eyes
Stroll around the grounds until you feel at home
-- P. Simon in Mrs. Robinson

Re: Dotted hash access
by Ctrl-z (Friar) on Nov 25, 2004 at 19:22 UTC

similar question

time was, I could move my arms like a bird and...

Re: Dotted hash access
by Juerd (Abbot) on Nov 25, 2004 at 21:51 UTC

As you can read in another node by me in this thread, I also dislike the typing exercise that one needs to practice every time a deep HoHoHoH element is needed. However, I don't like joining all keys together, because that makes iterating or assigning a reference to a deeper hash hard, or impossible, depending on the time available for hacking up ugly solutions.

Still, if I would join keys together, I'd do so with Perl's own built-in mechanism for that. Supply a list as a hash key and perl automatically joins it with $;. It'd be nice if there was an interpolating qw. :)

Juerd # { site => 'juerd.nl', plp_site => 'plp.juerd.nl', do_not_use => 'spamtrap' }

by sfink (Deacon) on Nov 28, 2004 at 19:08 UTC

(I replied this privately a minute ago, but then it sparked an idea.)

But that gives me an idea, or perhaps it's what you already meant: it would be much better to use the same underlying implementation, but switch to using $; rather than a period as a separator, so that the example would be:

$cost = $h->{'Locations',$location,'Buildings',$building,'cost'};
[download]

No new syntax that way, although it does use an unfamiliar one in these post-Perl4 days. But also no smushing of keys together into one string, even if it's only temporary.

And the more I think about it, the more it looks like this is what you meant -- since my immediate reaction to typing in the example was that an interpolating qw would be really nice! But your point about it being difficult to iterate over or assign to a deeper level isn't a problem with my current implementation.

I think I'll go change my code to take an optional separator string parameter, defaulting to $;, so that you can do it either way. I'm not sure yet which I'll use; the lack of interpolation with the $; approach defeats much of the benefit.

As for Perl6 -- we could always add in more than one interpolating context. You'd still need syntax to select them, of course. How about

  $code = %h{''Locations $location Buildings $building cost''};
[download]

  $code = %h{''Locations'$location'Buildings'$building'cost''};
[download]

  $P1['Locations';$S1;'Buildings';$S2;'cost']
[download]

Re^3: Dotted hash access

by demerphq (Chancellor) on Dec 02, 2004 at 16:16 UTC

The more I read this thread the more I dont understand why you dont maintain a hash that is structured with only two levels, location and then building. Then your code looks like:

my $code=$locations{$location}{$building}{code};
[download]

Also something to keep in mind (although its not hugely critical) each deref takes time, each hash lookup takes time, each unique key takes space. So in some circumstances your dotted approach would result in considerably more memory being taken up by the keys. Not only that but determinisitc traversal of your dotted form of the tree would be quite expensive as compared to the non dotted form. Overall I wouldnt go this route unless i had really strong justification to do so. And style isnt a strong justification IMO :-)

---
demerphq

Re^4: Dotted hash access

by sfink (Deacon) on Dec 03, 2004 at 04:24 UTC

Re: Dotted hash access
by zentara (Cardinal) on Nov 25, 2004 at 12:51 UTC

I'm not really a human, but I play one on earth. flash japh

by TimToady (Parson) on Nov 25, 2004 at 20:57 UTC

    %hash�Foo��Bar��Baz�
[download]

    use supersubscripts;
[download]

    %hash�Foo/Bar/Baz�
[download]

    %hash{'Foo/Bar/Baz'}
[download]

:-)

So the answer to your question is probably "no" for now...

by Juerd (Abbot) on Nov 25, 2004 at 21:44 UTC

Isn't this problem supposed to be corrected in Perl6?

For several reasons, the dot cannot be used safely for hash access. I proposed backticks for this purpose, but it's not going to happen, because many find it too ugly (since when is that reason to not do something in Perl?), and the powers that be have decided. See also http://groups.google.com/groups?selm=20040414121848.GJ3645%40c4.convolution.nl;

Still, I think it elegantly solves the problem.

$cost = $h->{Locations}{$location}{Buildings}{$b}{cost};
[download]

$cost = $h`Locations`$location`Buildings`$b`cost;
[download]

I want this for two reasons:

Typing { and } repeatedly is hard, at least for my hands
Typing {'key'} or «key» is even harder

$cost = $h->{Locations}{$location}{Buildings}{$b}{cost};
[download]

$cost = $h{'Locations'}{$location}{'Buildings'}{$b}{'cost'};
[download]

$cost = $h�Locations�{$location}�Buildings�{$b}�cost�;
[download]

IMO, the OP has a good point. Hash access is nice, and the syntax is certainly doable, but it gets tedious for accessing an deep element in a HoHoHoH. And even though many things are made much easier by Perl 6, this specific thing is IMHO made much worse.

(Please, let's not start another "write your own grammar" subthread.)

Juerd # { site => 'juerd.nl', plp_site => 'plp.juerd.nl', do_not_use => 'spamtrap' }

Re: Dotted hash access
by TomDLux (Vicar) on Nov 25, 2004 at 17:34 UTC

I like the idea of your extended each to iterate through deeply nested structures. It especially simplifies printing data, or makign minor alterations.

More generally, though, my approach is to isolate each layer. Ideally, each level should be an array of objects or a hash of objects, which in turn is an array or hash of objects. Especially so if you have a number of operations carried out on the objects.If there are only one or two operations on the structures, I use a function for each layer:

sub process_frumptions { my ( $frumption_set ) = @_; for my $frumption ( keys %$frumption_set ) { process_one_frumption $frumption; } } sub process_one_frumption { my ( $frumption ) = @_; for my $barfloon ( keys %{$frumption->{'barfloon_gallery'}} ) { process_one_barfloon $barfloon; } }
[download]

Of course, you may have to profile and optimize, but most of the time I find the code runs fast enough, for some definition of "fast enough".

--
TTTATCGGTCGTTATATAGATGTTTGCA