Howdy!

I'm having an issue using the HTML5 section tag with the HTML::TreeBuilder::XPath package. It doesn't seem to recognize the section tag as a valid tag. I simply replace it with a div tag and Wha-La! Here's the HTML code:

<!DOCTYPE html> <html> <head> <style> .center {text-align:center;} </style> <title>DIV Example</title> </head> <body> <section class=center> <h1>WWF</h1> <p>The World Wide Fund for Nature (WWF) is....</p> </section> <section class=center> <h1>M4P</h1> <p>The Money for People Fund (MFP) is....</p> </section> <section class=center> <h1>WW2</h1> <p>The World Wide Fund for People (WWFP) is....</p> </section> </body> </html>

Now here's the code snippet:

use strict; use HTML::TreeBuilder::XPath; use Data::Dumper; my $file_name = qq~/usr/local/apache/htdocs/jspelkey/html5_section.htm +l~; my $tree = HTML::TreeBuilder::XPath->new; # empty tree $tree->parse_file($file_name); for my $result ($tree->findnodes(q{/html/body/div[@class="center"] +})) { my $x = HTML::TreeBuilder::XPath->new; $x->parse($result->as_HTML); print $x->findvalue(q{/h1}); print "<br>".("-" x 17)."<br>\n"; } $tree = $tree->delete;

Anybody got any ideas? Much Thanks in Advance!!


In reply to Issues using HTML::TreeBuilder::XPath and the HTML5 <section> tag by ChuckP

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.