Your description has a mismatch between the order of data specified and the order captured:

DATA: "joe(lots of spaces) 0.0000E(one space)000 (spaces) 9.0720E-001 (lots of spaces) d23 ..."

CAPTURE: "... the first three "words" ("joe", "0.00E 000", and "d23") ..."

Your description of the spaces separating the data fields is inconsistent. The following assumes "(amount of spacing between each "group of characters" is irregular, but always more than just a single space)" is more accurate than, for instance, the highly vague "a whole bunch spaces".

My best guess is that the spaces separating the data fields match /\s{2,}/. On this basis, you can simply use split:

#!/usr/bin/env perl -l use strict; use warnings; while (<DATA>) { chomp; print "Line $.: $_"; print for split /\s{2,}/; } __DATA__ joe 0.00E 000 9.0720E-001 d23 9.0208E-001 joe2 0.00E-000 9.0720E 001 d23 9.0208E 001 joe3 0.00E 000 9.0720E-001 d23 9.0208E 001 joe4 0.00E-000 9.0720E 001 d23 9.0208E-001

Output:

Line 1: joe 0.00E 000 9.0720E-001 d23 9.0208E-001 joe 0.00E 000 9.0720E-001 d23 9.0208E-001 Line 2: joe2 0.00E-000 9.0720E 001 d23 9.0208E 001 joe2 0.00E-000 9.0720E 001 d23 9.0208E 001 Line 3: joe3 0.00E 000 9.0720E-001 d23 9.0208E 001 joe3 0.00E 000 9.0720E-001 d23 9.0208E 001 Line 4: joe4 0.00E-000 9.0720E 001 d23 9.0208E-001 joe4 0.00E-000 9.0720E 001 d23 9.0208E-001

— Ken


In reply to Re: How to extract these groups of characters? by kcott
in thread How to extract these groups of characters? by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.