Beefy Boxes and Bandwidth Generously Provided by pair Networks
Your skill will accomplish
what the force of many cannot
 
PerlMonks  

Re^3: Avoiding escaped child elements with HTML::TreeBuilder::XPath or HTML::Element

by haukex (Bishop)
on Nov 15, 2021 at 12:50 UTC ( #11138833=note: print w/replies, xml ) Need Help??


in reply to Re^2: Avoiding escaped child elements with HTML::TreeBuilder::XPath or HTML::Element
in thread Avoiding escaped child elements with HTML::TreeBuilder::XPath or HTML::Element

However, I am not sure how to apply a substitution, s///, to an element containing more that just text.

The documentation of HTML::Element's content_refs_list gives an example of how to modify text nodes contained in an element and the documentation of HTML::Element::traverse shows how to use a recursive function to walk the tree. Putting those together:

sub html_trim { my $elem = shift; for my $itemref ($elem->content_refs_list) { if ( ref $$itemref ) { html_trim($$itemref) } # remove this for non-recursive else { $$itemref =~ s/^\s+|\s+$//g } } } for my $elem ($xhtml->findnodes('//div/ul/li')) { html_trim($elem) }

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11138833]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others examining the Monastery: (5)
As of 2022-01-24 20:54 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    In 2022, my preferred method to securely store passwords is:












    Results (65 votes). Check out past polls.

    Notices?