Re: Extracting Text After <pre> tag in HTML

In case you haven't been given enough Ways To Do It, my first thought was HTML::TokeParser::Simple:

use HTML::TokeParser::Simple;
my $p = HTML::TokeParser::Simple->new( file => 'test_data.html' );
my $t; #token;
my @text; 

#get all text between pre tags
while ($t = $p->get_token) {
   next unless $t->is_start_tag('pre');
   
   my $content;
   while ($t = $p->get_token) {
      last if $t->is_end_tag('pre');
      $content .= $t->as_is;
   }
   push @text, $content;
}
[download]

I'm guessing this isn't the fastest approach... but hey, TMTOWTDI.

<–radiant.matrix–>
A collection of thoughts and links from the minds of geeks
The Code that can be seen is not the true Code
I haven't found a problem yet that can't be solved by a well-placed trebuchet

Comment on Re: Extracting Text After <pre> tag in HTML Download Code