Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:
I'm new to Perl.
OBJECTIVE
I want to automate editing of textual parts of HTML files.APPROACH
At the abstract level, an HTML document is a tree of nodes. I want to traverse the tree and edit its text nodes.PROBLEM
Unfortunately, I can't give a simple demonstration because this website translates HTML tags and special characters in posts. I have to verbalize an example.Suppose in an HTML file to be edited I have
(a) “ampersand” + “lt;” string to represent a left angle bracket “<”, and
(b) “--” string.
Problem 1
After I've built the tree using HTML::TreeBuilder, the “ampersand” + “lt;” string is printed as an ordinary “<” sign.This is not acceptable.
Problem 2
After I've built the tree using HTML::TreeBuilder, I edit “--” to “M dash”, i.e., “ampersand” + “mdash;”. Printed output is “ampersand” + “amp;” + “mdash;”This is not acceptable either.
QUESTION
Could you, pray, help me get correct output.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: processing text nodes using HTML::Element
by hossman (Prior) on Mar 08, 2003 at 01:32 UTC | |
by Anonymous Monk on Mar 08, 2003 at 04:00 UTC | |
by hossman (Prior) on Mar 08, 2003 at 22:09 UTC | |
|
Re: processing text nodes using HTML::Element
by demerphq (Chancellor) on Mar 08, 2003 at 12:59 UTC | |
by Anonymous Monk on Mar 09, 2003 at 21:28 UTC | |
|
Re: processing text nodes using HTML::Element
by bbfu (Curate) on Mar 09, 2003 at 01:07 UTC | |
by Anonymous Monk on Mar 09, 2003 at 21:38 UTC | |
by bbfu (Curate) on Mar 09, 2003 at 23:58 UTC |