ChuckP has asked for the wisdom of the Perl Monks concerning the following question:

Howdy!

I'm having an issue using the HTML5 section tag with the HTML::TreeBuilder::XPath package. It doesn't seem to recognize the section tag as a valid tag. I simply replace it with a div tag and Wha-La! Here's the HTML code:

<!DOCTYPE html> <html> <head> <style> .center {text-align:center;} </style> <title>DIV Example</title> </head> <body> <section class=center> <h1>WWF</h1> <p>The World Wide Fund for Nature (WWF) is....</p> </section> <section class=center> <h1>M4P</h1> <p>The Money for People Fund (MFP) is....</p> </section> <section class=center> <h1>WW2</h1> <p>The World Wide Fund for People (WWFP) is....</p> </section> </body> </html>

Now here's the code snippet:

use strict; use HTML::TreeBuilder::XPath; use Data::Dumper; my $file_name = qq~/usr/local/apache/htdocs/jspelkey/html5_section.htm +l~; my $tree = HTML::TreeBuilder::XPath->new; # empty tree $tree->parse_file($file_name); for my $result ($tree->findnodes(q{/html/body/div[@class="center"] +})) { my $x = HTML::TreeBuilder::XPath->new; $x->parse($result->as_HTML); print $x->findvalue(q{/h1}); print "<br>".("-" x 17)."<br>\n"; } $tree = $tree->delete;

Anybody got any ideas? Much Thanks in Advance!!

Replies are listed 'Best First'.
Re: Issues using HTML::TreeBuilder::XPath and the HTML5 <section> tag ( ignore_unknown => 0 )
by Anonymous Monk on Sep 09, 2013 at 02:15 UTC