use strict; use warnings; my $templateformat = 'w<NM>b<NM>cm<CH>sw<SW>'; my $inputexample = 'w8b8cm512swno'; my %type_matchers = ( NM => qr/\d+/, CH => qr/\d+/, SW => qr/yes|no/, ); my @field_names; my @field_types; while ($templateformat =~ /([^<]+)<([^>]+)>/g) { push(@field_names, $1); push(@field_types, $2); } # my $re = '^'; # $re .= "\Q$field_names[$_]\E((?:(?!\Q$field_names[$_+1]\E).)*)" # for 0..$#field_names-1; # $re .= "\Q$field_names[-1]\E(.*)\\z"; # $re = qr/$re/s; my $re = '^'; $re .= "\Q$field_names[$_]\E($type_matchers{$field_types[$_]})" for 0..$#field_names; $re = qr/$re/s; my @field_values = $inputexample =~ $re or die("Input \"$inputexample\" doesn't match the format defined by +template \"$templateformat\"\n"); local $, = "\t"; local $\ = "\n"; print(@field_names); print(@field_values);

outputs

w b cm sw 8 8 512 no

Updated: Replaced the commented paragraph with the one that follows due to a better understanding of the question. Both give the same answer.


In reply to Re: text extraction question by ikegami
in thread text extraction question by echoangel911

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.