Re^2: Seperating individual lines of a file

Replies are listed 'Best First'.
Re^3: Seperating individual lines of a file by chargrill (Parson) on Feb 02, 2006 at 22:16 UTC
First, that's an interesting use of push - you could just do `@DATA=<ORIGFILE>;`, but I don't think that's a big deal. Second, you'll find that you'll want to open your file for writing by using the `open, FILEHANDLE, ">filename"` nomenclature. Third, as written, your code will open a file called "host1 BUNCHOFDATAINALINE". You might want to `split( /pattern/, expression )` each line of the file so you can separately refer to the separate pieces of data, like `( $HOSTNAME, $SOMEDATA )`. --chargrill `$/ = q#(\w)# ; sub sig { print scalar reverse join ' ', @_ } + sig map { s$\$/\$/$\$2\$1$g && $_ } split( ' ', ",erckha rlPe erthnoa stJu +" );` [download]	[reply] [d/l] [select]
Re^4: Seperating individual lines of a file by tgrossner (Novice) on Feb 02, 2006 at 22:24 UTC
ok, on the split function, can i split $line into two sections, one being the first 16 characters, then name the file via this string of characters?	[reply]
Re^5: Seperating individual lines of a file by chargrill (Parson) on Feb 02, 2006 at 23:20 UTC
If you're specifically interested in just the first 16 characters, you could also look into `substr( EXPR, OFFSET, LENGTH)`. When I saw "`host1 BUNCHOFDATAONALINE`" I assumed that splitting on whitespace was what you were looking for, but when you phrase it as "one being the first 16 characters", `substr` comes to mind. You may wish to examine the differences between split and substr to see which would suit you better. --chargrill `$/ = q#(\w)# ; sub sig { print scalar reverse join ' ', @_ } + sig map { s$\$/\$/$\$2\$1$g && $_ } split( ' ', ",erckha rlPe erthnoa stJu +" );` [download]	[reply] [d/l] [select]
Re^5: Seperating individual lines of a file by blazar (Canon) on Feb 03, 2006 at 12:38 UTC
ok, on the split function, can i split $line into two sections, one being the first 16 characters, then name the file via this string of characters? Yes, you can, but then chances are that substr or unpack are better suited for the tast. Split works best for splitting on a pattern. Well, more precisely it is exactly for splitting on a pattern!	[reply]
Re^3: Seperating individual lines of a file by graff (Chancellor) on Feb 03, 2006 at 04:13 UTC
Something that chargrill didn't mention, but might be an issue: Doing a file open and file close for every line can get really expensive and time consuming if there happen to be thousands of lines of input. Perl allows you to store file handles in a hash, so you can open a new file each time you see a new "hostname" string, and just re-use that handle whenever you see the same name again: `# set $listfile to some constant, or to $ARGV[0] (and supply the file +name # as a command-line arg when you run the script) my %outfh; # hash to hold output file handles open ORIGFILE, $listfile or die "$listfile: $!"; while ( <ORIGFILE> ) { my ( $host, $data ) = split " ", $_, 2; if ( ! exists( $outfh{$host} )) { open( $outfh{$host}, ">", $host ) or die "$host: $!"; } print $outfh{$host} $data; } # perl will flush and close output files when done` [download] Of course, if there are lots of different host names in the input file (or if there is something really wrong and unexpected in the list file contents), the script would die when it tries to open too many file handles.	[reply] [d/l]
Re^4: Seperating individual lines of a file by tgrossner (Novice) on Feb 03, 2006 at 16:25 UTC
I am trying out your code; I replaced the $listfile with $ARGV[0]. `#!/usr/bin/perl # set $listfile to some constant, or to $ARGV[0] (and supply the file #+name # as a command-line arg when you run the script) my %outfh; # hash to hold output file handles open ORIGFILE, $ARGV[0] or die "$ARGV[0]: $!"; while ( <ORIGFILE> ) { my ( $host, $data ) = split " ", $_, 2; if ( ! exists( $outfh{$host} )) { open( $outfh{$host}, ">", $host ) or die "$host: $!"; } print $outfh{$host} $data; } # perl will flush and close output files when done` [download] But this produces a syntax error of `Scalar found where operator expected at ./nocsplit.pl line 17, near "} + $data" (Missing operator before $data?) syntax error at ./nocsplit.pl line 17, near "} $data" Execution of ./nocsplit.pl aborted due to compilation errors.` [download] It seems to not like the `print to $outfh{$host} $data;` [download]	[reply] [d/l] [select]
Re^5: Seperating individual lines of a file by Not_a_Number (Prior) on Feb 03, 2006 at 20:14 UTC
There may be a more elegant solution, but graff's code works if you change the line: `print $outfh{$host} $data;` to: `my $fh = $outfh{$host}; print $fh $data;` [download] dave	[reply] [d/l] [select]
Re^5: Seperating individual lines of a file by blazar (Canon) on Feb 05, 2006 at 07:58 UTC
`print $outfh{$host} $data;` [download] In addition to Not_a_Number's solution by means of assigning to a temporary variable, another possible one is that given in perldoc -f print: `print { $outfh{$host} } $data;` [download]	[reply] [d/l] [select]
Re^3: Seperating individual lines of a file by blazar (Canon) on Feb 03, 2006 at 12:31 UTC
People generally do `use strict; use warnings;` [download] nowadays, and that's the single best piece of advice I can give you! Also, people do `my @DATA=<ORIGFILE>;` [download] but then they also prefer to avoid slurping in files all at once, and they iterate on the lines instead with a `while` loop rather than with a `for` one: `while (my $line=<ORIGFILE>) { # ...` [download] In any case you have to specify `'>'` mode in open for writing (and `'>>'` for appending). More generally I recommend you to stick with the three args form of open and lexical handles, and always check the return value: open my $in, '<', "whatever" or die "can't open `whatever': $!\n"; open my $out, '>', "whatever" or die "can't open `whatever': $!\n"; [download]	[reply] [d/l] [select]