perlmad has asked for the wisdom of the Perl Monks concerning the following question:
Hi Monks
I have problem in nested div tgs
my code
#!/usr/bin/perl use strict; use HTML::TokeParser::Simple; my $parser = HTML::TokeParser::Simple->new(handle => \*DATA); my @dnldLinks; my @month_year; my @date; my @data; my $index; my $contract; while ( my $div = $parser->get_tag('div') ) { if($div->is_start_tag('div')){ if($div->[1]{class} =~ 'Cell month-year') { my $time = $parser->get_trimmed_text; push(@month_year,$time); } if($div->[1]{class} =~ 'Cell Release Date') { my $time = $parser->get_trimmed_text; push(@date,$time); $index=$time; } if($div->[1]{class} =~ 'Mortgage Contract Rate') { my $time = $parser->get_trimmed_text; push(@date,$time); $contract=$time; } print "date : $index, data : $contract\n"; } } #use Data::Dumper; #print Dumper \@dnldLinks; __DATA__ <div class='historicalChartTable'> <div class="Row Jun-2015"> <div class="Cell month-year "> Jun-2015 </div> <div class="Cell Release D +ate "> 2015-07-30 </div> <div class="Cell Natio +nal Mortgage Contract Rate "> 3.850 </div> </div> class="Row May-2015"> <div class="Cell month-year even"> May-2015 </div> <div class="Cell Release Date even"> 2015-06-25 </div> <div class="Cell National Mortgage Contract Rate even" +> 3.750 </div> </div> </div>
Getting Output: date : , data : date : , data : date : , data : date : 2015-07-30, data : date : 2015-07-30, data : 3.850 date : 2015-07-30, data : 3.850 date : 2015-06-25, data : 3.850 date : 2015-06-25, data : 3.750
my code is parsing entire div tag one by one , not recursively,Kindly help me to find out the data recursively in the div tag
Expected output: date : 2015-07-30, data : 3.850 date : 2015-06-25, data : 3.750
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Nested div tag
by haukex (Archbishop) on Jul 20, 2016 at 14:30 UTC | |
|
Re: Nested div tag
by Anonymous Monk on Jul 20, 2016 at 20:33 UTC | |
by perlmad (Sexton) on Jul 21, 2016 at 05:51 UTC | |
by Anonymous Monk on Jul 21, 2016 at 06:14 UTC | |
by perlmad (Sexton) on Jul 21, 2016 at 06:18 UTC | |
by Anonymous Monk on Jul 21, 2016 at 07:07 UTC | |
|