I'm new to Perl.
OBJECTIVE
I want to automate editing of textual parts of HTML files.APPROACH
At the abstract level, an HTML document is a tree of nodes. I want to traverse the tree and edit its text nodes.PROBLEM
Unfortunately, I can't give a simple demonstration because this website translates HTML tags and special characters in posts. I have to verbalize an example.Suppose in an HTML file to be edited I have
(a) “ampersand” + “lt;” string to represent a left angle bracket “<”, and
(b) “--” string.
Problem 1
After I've built the tree using HTML::TreeBuilder, the “ampersand” + “lt;” string is printed as an ordinary “<” sign.This is not acceptable.
Problem 2
After I've built the tree using HTML::TreeBuilder, I edit “--” to “M dash”, i.e., “ampersand” + “mdash;”. Printed output is “ampersand” + “amp;” + “mdash;”This is not acceptable either.
QUESTION
Could you, pray, help me get correct output.In reply to processing text nodes using HTML::Element by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |