Re^3: Parsing by indentation

Same code with expanded regex with comments

#!/usr/bin/perl

# https://perlmonks.org/?node_id=1224600

use strict;
use warnings;
use Data::Dump 'dd';

my $data = <<'END';
interface XYZ
  given param1 -> child of "interface XYZ"
  given param2 -> child of "interface XYZ"
    given param2.1 -> child of "given param2"
      given param2.1.1 -> child of "given param2.1"
      given param2.1.2 -> child of "given param2.1"
    given param2.2 -> child of "given param2"
  given param3 -> child of "interface XYZ"
  given param4 -> child of "interface XYZ"
interface SECOND
  given param5 -> child of "interface SECOND"
END

my $struct = buildstruct($data);
dd $struct;

sub buildstruct
  {
  my $block = shift;
  my @answers;
  while( $block =~ /^ # make sure to start at beginning of a line (wit
+h m)
      (\ *)           # match leading spaces of header line
      (.*)            # match rest of line, save as head
      \n              # and match the newline
      (
      (?:             # match all following lines with
        \1            # same whitespace as head
        \ +           # plus at least one more space ( i.e. indented )
        .*\n          # contents to be looked at later
      )*              # as many as possible
      )               # save as rest
      /gmx )          # global, multiline, and extra whitespace
    {
    my ($head, $rest) = ($2, $3);
    $head =~ s/ ->.*//;
    push @answers, $rest ? { $head => buildstruct($rest) } : $head;
    }
  \@answers;
  }
[download]

I hope this helps :)

The regex matches each line, gets the indentation space string, then also matches all following lines that are indented that much plus at least one more space.

Comment on Re^3: Parsing by indentation Download Code

Replies are listed 'Best First'.
Re^4: Parsing by indentation by llarochelle (Beadle) on Oct 25, 2018 at 02:14 UTC
Oh yes it does help. Time to read that regex book one more time ... This is just absolutely fantastic ! So much power in a handful of lines !	[reply]
Re^4: Parsing by indentation by llarochelle (Beadle) on Oct 25, 2018 at 14:13 UTC
/msg tybalt89 After all, I thought I understood the line with the ternary operator, but it seems you're doing some other sort of magic with the double arrow operator. If you don't mind explaining this one too ^_^ ?	[reply]
Re^5: Parsing by indentation by hippo (Archbishop) on Oct 25, 2018 at 15:16 UTC
If you are referring to the Here Document, search for '<<EOF' in perlop for more on that. They also get a mention at the end of the quotes in Perl tutorial (and in the replies).	[reply]
Re^6: Parsing by indentation by llarochelle (Beadle) on Oct 26, 2018 at 13:41 UTC
Oh I was talking about the => , since it seems it has many usages. But in this case it seems to be simply used to assign a value to a key in the hash structure. Have a great day !	[reply]
Re^7: Parsing by indentation by haukex (Archbishop) on Oct 26, 2018 at 14:00 UTC
Re^5: Parsing by indentation by AnomalousMonk (Archbishop) on Oct 25, 2018 at 17:02 UTC
Also see Here document for a general discussion. Give a man a fish: `<%-{-{-{-<`	[reply] [d/l]