in reply to Re: Seperating individual lines of a file
in thread Seperating individual lines of a file

Here my start on the code:
#!/usr/bin/perl open ORIGFILE, "noc.060202_13"; #print ORIGFILE; push(@DATA,<ORIGFILE>); foreach my $LINE (@DATA){ open FH, "$LINE[0]"; print FH $LINE; close FH; }
after setting up the foreach loop, i can print $LINE to STDOUT and see each line, but i cant pull out the first element as $LINE[0] to name the file, seemingly.

Replies are listed 'Best First'.
Re^3: Seperating individual lines of a file
by chargrill (Parson) on Feb 02, 2006 at 22:16 UTC

    First, that's an interesting use of push - you could just do @DATA=<ORIGFILE>;, but I don't think that's a big deal.

    Second, you'll find that you'll want to open your file for writing by using the open, FILEHANDLE, ">filename" nomenclature.

    Third, as written, your code will open a file called "host1 BUNCHOFDATAINALINE". You might want to split( /pattern/, expression ) each line of the file so you can separately refer to the separate pieces of data, like ( $HOSTNAME, $SOMEDATA ).



    --chargrill
    $/ = q#(\w)# ; sub sig { print scalar reverse join ' ', @_ } + sig map { s$\$/\$/$\$2\$1$g && $_ } split( ' ', ",erckha rlPe erthnoa stJu +" );
      ok, on the split function, can i split $line into two sections, one being the first 16 characters, then name the file via this string of characters?

        If you're specifically interested in just the first 16 characters, you could also look into substr( EXPR, OFFSET, LENGTH). When I saw "host1 BUNCHOFDATAONALINE" I assumed that splitting on whitespace was what you were looking for, but when you phrase it as "one being the first 16 characters", substr comes to mind.

        You may wish to examine the differences between split and substr to see which would suit you better.



        --chargrill
        $/ = q#(\w)# ; sub sig { print scalar reverse join ' ', @_ } + sig map { s$\$/\$/$\$2\$1$g && $_ } split( ' ', ",erckha rlPe erthnoa stJu +" );
        ok, on the split function, can i split $line into two sections, one being the first 16 characters, then name the file via this string of characters?
        Yes, you can, but then chances are that substr or unpack are better suited for the tast. Split works best for splitting on a pattern. Well, more precisely it is exactly for splitting on a pattern!
Re^3: Seperating individual lines of a file
by graff (Chancellor) on Feb 03, 2006 at 04:13 UTC
    Something that chargrill didn't mention, but might be an issue:

    Doing a file open and file close for every line can get really expensive and time consuming if there happen to be thousands of lines of input.

    Perl allows you to store file handles in a hash, so you can open a new file each time you see a new "hostname" string, and just re-use that handle whenever you see the same name again:

    # set $listfile to some constant, or to $ARGV[0] (and supply the file +name # as a command-line arg when you run the script) my %outfh; # hash to hold output file handles open ORIGFILE, $listfile or die "$listfile: $!"; while ( <ORIGFILE> ) { my ( $host, $data ) = split " ", $_, 2; if ( ! exists( $outfh{$host} )) { open( $outfh{$host}, ">", $host ) or die "$host: $!"; } print $outfh{$host} $data; } # perl will flush and close output files when done
    Of course, if there are lots of different host names in the input file (or if there is something really wrong and unexpected in the list file contents), the script would die when it tries to open too many file handles.
      I am trying out your code; I replaced the $listfile with $ARGV[0].
      #!/usr/bin/perl # set $listfile to some constant, or to $ARGV[0] (and supply the file #+name # as a command-line arg when you run the script) my %outfh; # hash to hold output file handles open ORIGFILE, $ARGV[0] or die "$ARGV[0]: $!"; while ( <ORIGFILE> ) { my ( $host, $data ) = split " ", $_, 2; if ( ! exists( $outfh{$host} )) { open( $outfh{$host}, ">", $host ) or die "$host: $!"; } print $outfh{$host} $data; } # perl will flush and close output files when done
      But this produces a syntax error of
      Scalar found where operator expected at ./nocsplit.pl line 17, near "} + $data" (Missing operator before $data?) syntax error at ./nocsplit.pl line 17, near "} $data" Execution of ./nocsplit.pl aborted due to compilation errors.
      It seems to not like the
      print to $outfh{$host} $data;

        There may be a more elegant solution, but graff's code works if you change the line:

        print $outfh{$host} $data;

        to:

        my $fh = $outfh{$host}; print $fh $data;

        dave

Re^3: Seperating individual lines of a file
by blazar (Canon) on Feb 03, 2006 at 12:31 UTC

    People generally do

    use strict; use warnings;

    nowadays, and that's the single best piece of advice I can give you!

    Also, people do

    my @DATA=<ORIGFILE>;

    but then they also prefer to avoid slurping in files all at once, and they iterate on the lines instead with a while loop rather than with a for one:

    while (my $line=<ORIGFILE>) { # ...

    In any case you have to specify '>' mode in open for writing (and '>>' for appending). More generally I recommend you to stick with the three args form of open and lexical handles, and always check the return value:

    open my $in, '<', "whatever" or die "can't open `whatever': $!\n"; open my $out, '>', "whatever" or die "can't open `whatever': $!\n";