Storing multiple blocks of text in the __DATA_

Re^2: Storing multiple blocks of text in the __DATA__ section

by blindluke (Hermit) on Jan 02, 2015 at 13:54 UTC

Thanks, the module looks interesting.

- Luke

Re: Storing multiple blocks of text in the __DATA__ section
by Athanasius (Archbishop) on Jan 02, 2015 at 13:40 UTC

Hello blindluke,

Have you looked through the thread multiple __DATA__ && __END__?

Hope that helps,

Athanasius <°(((>< contra mundum Iustus alius egestas vitae, eros Piratica,

Re^2: Storing multiple blocks of text in the __DATA__ section

by blindluke (Hermit) on Jan 02, 2015 at 14:49 UTC

Now I have. Thank you for linking to the thread, I found this reply by Coruscate especially interesting.

- Luke

Re: Storing multiple blocks of text in the __DATA__ section
by LanX (Saint) on Jan 02, 2015 at 16:11 UTC

<<ENDS

<<"ENDS"

<<'ENDS'

2. you are free to use multiple here-docs in the same line, so if the number of blocks is too small for concerns about being DRY, you can write

use Data::Dump;

my %desc;
init_desc();

dd \%desc;

sub init_desc {
    @desc{ONE,TWO}= (<<'__ONE__',<<'__TWO__');
one
__ONE__
two
__TWO__
}
[download]

3. Please note that I've put the population part away into a sub init_desc() , like this you can have multiple of such initializations hidden at the end of your code.

4. Aforementioned solution isn't as DRY as you wanted, but actually your split-solution wasn't too bad, though your grep to ignore the first line is dangerous:

my %desc = init_desc();

dd \%desc;

sub init_desc {
    (undef,my %hash) =                                      #  ignore 
+first line
      split /^ \[ (\w+) \] \s* $/xm,  <<'__ENDS__';
[ONE]
one

[TWO]
two
__ENDS__

    return %hash;
}
[download]

(potential trimming of leading and trailing "\n" is left as an exercise).

5. please note that the last approach can also be used to parse a slurped __DATA__ section.

Cheers Rolf

_{(addicted to the Perl Programming Language and ☆☆☆☆ :)}

Re^2: Storing multiple blocks of text in the __DATA__ section

by blindluke (Hermit) on Jan 03, 2015 at 14:35 UTC

Thank you for taking the time to reply, and providing all those suggestions.

Ad 1: Noted. I expect that due to this behavior, even if I wanted interpolation, it would still be better ( in terms of style) to write <<"ENDS" to make the fact (that I am aware of this) explicit.

Ad 2&3: Excellent stuff, I was not aware of this possibility (multiple here-docs in the same line). Thank you!

Ad 4: Very interesting. I assume that the danger associated with grep is the fact that it will remove both undef (as intended) and anything that evaluates as non-true (could pose a problem). Is there any other danger involved? Not that it would matter much, as your solution is definitely better, but I'm just curious.

Ad 5: Slurping __DATA__ is exactly what I had in mind when asking about possible split usage.

- Luke

Re: Storing multiple blocks of text in the __DATA__ section
by gnosti (Chaplain) on Jan 02, 2015 at 20:38 UTC

Data::Section::Simple

Re: Storing multiple blocks of text in the __DATA__ section
by graff (Chancellor) on Jan 02, 2015 at 16:44 UTC

#!/usr/bin/perl

use strict;
use warnings;

my %structure;

{
    local $/ = "";  # input record separator = empty string for "parag
+raph mode"
    while (<DATA>) {
        s/^(.*)\n//; # first line is key string
        $structure{$1} = $_;
    }
}

print "key: $_ / value:\n$structure{$_}\n----\n" for ( sort keys %stru
+cture );

__DATA__
first_key
Here's some data to
go with the first key

key_3
Third key gets
this part

key number 2
This element of %structure
has spaces in the hash key.
[download]

UPDATED to localize the use of paragraph-mode.

Re^2: Storing multiple blocks of text in the __DATA__ section

by LanX (Saint) on Jan 02, 2015 at 17:29 UTC

> but - oddly enough - I didn't see anyone mention ... use "paragraph mode"

The OP wanted to allow multiple paragraphs in one section, and IMHO this isn't easily done with $/ .

E.g using multiple newlines like in "\n\n" is a bit too error-prone and other separators would be part of the sections and needed to be filtered again.

Cheers Rolf

_{(addicted to the Perl Programming Language and ☆☆☆☆ :)}

update

use Data::Dump;

my %desc=init_data();

dd \%desc;


sub init_data {
    my $sep = "\n=====\n";
    local $/ = $sep; 
    my %hash;
    while (<DATA>) {
        s/$sep$//;     # kill separator
        s/^(.*)\n//;   # first line is key string
        $hash{$1} = $_;
    }
    return %hash;
}

    

__DATA__
ONE
one


=====
TWO
two

two


=====
THREE

  Three

  three
[download]

Re^3: Storing multiple blocks of text in the __DATA__ section

by graff (Chancellor) on Jan 02, 2015 at 17:55 UTC

$/ = "";
while (<DATA>) {
    s/^(.*)\n//;
    $key = $1;
    s/\n==(?=\n)/\n/g;
    $structure{$key} = $_;
}

__DATA__
key1
Here's a text block including blank lines ("encoded" as "==" in the pe
+rl script):
==
and here's a part of the block that's enclosed within "blank lines"
==
and here's the last part of the value for key1.

key2
blah blah
etc.
[download]

Re^4: Storing multiple blocks of text in the __DATA__ section

by blindluke (Hermit) on Jan 03, 2015 at 13:53 UTC

Re: Storing multiple blocks of text in the __DATA__ section
by LanX (Saint) on Jan 02, 2015 at 16:28 UTC

> Does anyone know of a Config:: module that would accept such syntax?

searching Config:: cpan produced some like Config::IniFiles , Config::Simple and Config::General .

It's always a question of which extra features you need or want to avoid.

Cheers Rolf

_{(addicted to the Perl Programming Language and ☆☆☆☆ :)}

Re^2: Storing multiple blocks of text in the __DATA__ section

by blindluke (Hermit) on Jan 03, 2015 at 11:59 UTC

I already use (and adore) Config::IniFiles, but it does not accept such simple syntax. It does, however accept multiline values for the params, but then the config would have to look like this:

[general]
Room=<<EOT
A simple multiline text
description
EOT

Wall=<<EOT
Another multiline 
wall description

With two paragraphs.
EOT
[download]

In recent versions of Config::IniFiles, you can specify a default section, so the first line of the above example could be omitted by doing:

$cfg = Config::IniFiles->new( -file => *DATA, -default => "general" );
[download]

Still, this is the same heredoc syntax which I was trying to avoid in the first place.

Fortunately, gnosti has found the Data::Section::Simple module that seems to do exactly what I was searching for. His recommendation, and your excellent first reply, add to the reasons why I love our Monastery. Thank you.

- Luke

Re: Storing multiple blocks of text in the __DATA__ section
by RMGir (Prior) on Jan 02, 2015 at 13:43 UTC

#!/bin/env perl

use strict;
use warnings;

use Data::Dumper;

my $data = join "",<DATA>;
my $config = eval "{$data}" or die "eval failed, $@";

print Dumper($config);

__DATA__
    foo => "This is foo's data"
  , bar => qq{this is bar's data
it includes a newline
and other stuff}
  , baz => { bazfoo => "baz is more complex"
           , bazbar => "it contains a sub-hash"
           }
[download]

Mike

Re^2: Storing multiple blocks of text in the __DATA__ section

by blindluke (Hermit) on Jan 02, 2015 at 14:40 UTC

Thanks for the reply and your time, but this solution just moves the variable assignments from within the code to the DATA section at the end of it.

What would be the use of such a thing? My point was never moving the text to a specific place in my code, and placing it in __DATA__ was never an end unto itself.

The point is making the code easier to read by putting the text descriptions as far away from the code syntax as possible. That way, someone can open the file, ignore all the Perl code, and edit the descriptions as any text document.

- Luke

Re: Storing multiple blocks of text in the __DATA__ section
by thargas (Deacon) on Jan 02, 2015 at 15:24 UTC

Data::Embed

Re: Storing multiple blocks of text in the __DATA__ section
by RonW (Parson) on Jan 06, 2015 at 00:04 UTC

Since you indicated an interested in ultimately moving the data to a separate file, I suggest YAML::Tiny

Using YAML::Tiny your data file would look like:

room: >
some text here, probably something
a few lines long

wall: |
    another text, here, this time
    pre-formatted (but must be indented)
[download]

But, I'm not sure it will handle multiple paragraphs. More likely the pre-formatted syntax would because it uses indentation.

That said, your parser for your syntax might just be the best choice for your application.