But of course any decent XML-parser (such as XML::Parser) will choke on this "bad" XML (I wonder if technically it is even XML due to the missing encoding of 'forbidden' characters).
Therefore I suggest that you try to capture these 'forbidden' characters before they enter your XML. Can't you run the encode-function of HTML::Entities on the incoming data, prior to it being XML-ized?
CountZero
"If you have four groups working on a compiler, you'll get a 4-pass compiler." - Conway's Law
In reply to Re: regex on XML
by CountZero
in thread regex on XML
by bear0053
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |