perlmad has asked for the wisdom of the Perl Monks concerning the following question:
Hi Monks
I have a csv file and i need to parse those data as well as write into separate file with type filter
input_parse_and_output("Bill"); input_parse_and_output("Bond"); input_parse_and_output("Note"); # Subroutine has two arguments 1.Input file name 2. Output file direct +ory 3. Output file name all of these are scalar sub input_parse_and_output{ foreach my $input_data_each_line(@input_data_in_array){ chomp($input_data_each_line); # if the current line not contatin cusip then program will move to ne +xt line next unless $input_data_each_line=~ /cusip/; next unless $input_data_each_line=~ /"securityType":"$_[0]"/; if(defined $input_data_each_line){ foreach my $output_header_names_temp(@output_header_names){ chomp($output_header_names_temp); if(defined $output_header_names_temp){ # regex to replace the double quotes to blank string #input_data_each_line=~ s/"//g; $input_data_each_line=~ /$output_header_names_temp:([\w\d\-\$\%\:\!\ +@\&\*\.]+)/; if(defined $1){ my $temp=$1; if($temp=~ /\:/){ # regex for get exact first ten charavtetr it can be digit or hyph +en $temp=~ /^([\d\-]{10})/; print OUTPUT_FILE_WRITE "$1\t"; } else { print OUTPUT_FILE_WRITE "$1\t"; } } else { print OUTPUT_FILE_WRITE "NULL\t"; } } } print OUTPUT_FILE_WRITE "\n"; } } }
In this above code @input_data_in_array contain 100 lines so the subroutine call itself by 3 times for sorting by order "Bill","Bond","Note"
The given input file data sample as below
912828Q86 Note 1-Year 2016-05-25 2016-05-27 2018-0 +4-30 100.003850 NULL 912796HD4 Bill 4-Week 2016-01-26 2016-01-28 2016-0 +2-25 99.977056 NULL 912810RS9 Bond 30-Year 2016-05-12 2016-05-16 2046-0 +5-15 97.619462 2.500000 912810RQ3 Bond 29-Year 2016-04-14 2016-04-15 2046-0 +2-15 98.011430 2.500000 912796HD4 Bill 4-Week 2016-01-26 2016-01-28 2016-0 +2-25 99.977056 NULL 912828N71 Note 9-Year 2016-05-19 2016-05-31 2026-0 +1-15 103.533587 0.625000
In this program the array of content is read 3 times so it taken more time to complete , I need it to done with in 1 time reading and filter by "Bill","Bond","Note"
Expected output
912796HD4 Bill 4-Week 2016-01-26 2016-01-28 2016-0 +2-25 99.977056 NULL 912796HD4 Bill 4-Week 2016-01-26 2016-01-28 2016-0 +2-25 99.977056 NULL 912810RS9 Bond 30-Year 2016-05-12 2016-05-16 2046-0 +5-15 97.619462 2.500000 912810RQ3 Bond 29-Year 2016-04-14 2016-04-15 2046-0 +2-15 98.011430 2.500000 912828Q86 Note 1-Year 2016-05-25 2016-05-27 2018-0 +4-30 100.003850 NULL 912828N71 Note 9-Year 2016-05-19 2016-05-31 2026-0 +1-15 103.533587 0.625000
Your Suggestion is mostly appreciated...
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: File content sorting based on type
by Corion (Patriarch) on May 31, 2016 at 11:12 UTC | |
|
Re: File content sorting based on type
by Corion (Patriarch) on May 31, 2016 at 11:01 UTC | |
by perlmad (Sexton) on May 31, 2016 at 11:06 UTC | |
|
Re: File content sorting based on type
by Marshall (Canon) on May 31, 2016 at 12:08 UTC | |
by johngg (Canon) on May 31, 2016 at 18:04 UTC | |
by Marshall (Canon) on Jun 01, 2016 at 17:25 UTC | |
|
Re: File content sorting based on type
by Anonymous Monk on May 31, 2016 at 10:45 UTC |