in reply to substitution in textual area of HTML file
"I am parsing the output of pdftohtml" so this seem unlikely.