comment on

my $xpath = '//div[@class="ccc"]/node()';

The way I understand that expression is "match any node of any kind that is a direct child of any <div> elements with a class attribute equal to ccc." If I remove all the whitespace from the XML, the matching <div> has three children: <img src="">, <h2>Hello</h2>, and <div class='s'>. And since node() matches any kind of nodes, including text nodes, that's what it's matching when you put the whitespace back in. You can see all of this in action if you put print "[[",$superCat->as_XML,"]]\n"; as the first thing in your loop. In other words, your XPath is behaving correctly. If you only want to match the <div class="ccc">, change the expression to //div[@class="ccc"].

(Also note that your XML is not valid, the <img> tag isn't closed.)

In reply to Re: HTML::TreeBuilder::LibXML creates multiple copies of the same result by haukex
in thread HTML::TreeBuilder::LibXML creates multiple copies of the same result by password

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.