Re^3: Textfile to csv with a small twist

Replies are listed 'Best First'.
Re^4: Textfile to csv with a small twist by jZed (Prior) on Aug 25, 2005 at 19:43 UTC
I am still not understanding what output you want. If your data is this: H1: T1.1 T1.2 H2: T2.1 T2.2 Do you want this: H1,H2 "T1.1\nT1.2\n","T2.1\nT2.2\n" Or this: H1,H2 "T1.1\n","T2.1\n" "T1.2\n","T2.2\n" Or something else? And another ambiguity: you haven't mentioned whether headings can repeat in your input data (for example more than one section labeled heading1).	[reply]
Re^5: Textfile to csv with a small twist by Bentov (Novice) on Aug 25, 2005 at 20:15 UTC
Closer to the first one, what I'm exactly looking for is `H1,T1.1 CRLF T1.2 crlf,H2,T2.1 CRLF T2.2 CRLF` [download] While the headings are supposed to be same in all of my files, I.e. they will always appear in the same order, and all of the headings will appear, I know they arn't. So I'm figuring that I'll need to have the heading as part of the record, that way if for example I have something like `H1: T1.1 T1.2 H3: T3.1 T3.2 T3.3` [download] I can generate something like: `H1,T1.1 CRLF T1.2 CRLF,H3,T3.1 CRLF T3.2 CRLF T3.3 CRLF` [download] and from that generate... `H1,T1.1 CRLF T1.2 CRLF,H2,,H3,T3.1 CRLF T3.2 CRLF T3.3 CRLF` [download] Since the H2 heading didn't appear in the data, I will have to force in in the final file. I planned this to be a two steep process. I hope that explains things better. Bentov	[reply] [d/l] [select]
Re^6: Textfile to csv with a small twist by jZed (Prior) on Aug 25, 2005 at 21:17 UTC
You say you want this H1,T1.1 CRLF T1.2 crlf,H2,T2.1 CRLF T2.2 CRLF How will you ever retrieve the data from that? If instead you have this: H1,"T1.1 CRLF T1.2 CRLF" H2,"T2.1 CRLF T2.2 CRLF" You will then be able to retrieve the text for any heading, for example if you print out the text for H2, it will look like this: T2.1 T2.2 Is that what you want? If so, here's how I would do it: #!/usr/bin/perl -w use strict; use IO::Scalar; # not needed if input and output are from files use Text::CSV_XS; my $csv = Text::CSV_XS->new( {binary=>1} ); my @input_data = <DATA>; # turn the input data into a CSV string with records and fields # my $csv_str = text_to_csv( @input_data ); # as a test, turn the CSV string back into a text string # my $output_data = csv_to_text( $csv_str ); # check that the text string created from the CSV is the same # as the original # print "ok!\n" if $output_data eq join '', @input_data; sub text_to_csv { my (@new_row,$new_text,$output_csv,$heading); for my $line (@_) { if ($line =~ /^(\w+:)$/) { $heading = $1; if (@new_row) { $output_csv .= make_row(@new_row,$new_text); } @new_row = ($heading); $new_text = ''; } else { $new_text .= $line; } } $output_csv .= make_row(@new_row,$new_text); } sub csv_to_text { my($input_str)=@_; my $output_str = ''; my $fh = IO::Scalar->new(\$input_str); while (my $cols = $csv->getline($fh)) { last unless @$cols; $output_str .= sprintf "%s\n%s", @$cols; } return $output_str; } sub make_row { my $success = $csv->combine(@_); die "Coulnd't parse '@_'\n" unless $success; return $csv->string . "\n"; } __DATA__ H1: T1.1 T1.2 H2: T2.1 T2.2 [download] Note that only the text_to_csv() sub is needed to do what you asked, the other subs are there as tests and illustrations.	[reply] [d/l]