Hi, I am transforming html files into xml data sheets and I am finding certain characters are breaking the script. I am trying to filer these out but I fear I am being reactive rather than proactive and it's really a case of waiting for a character to break the script. Is there a generic approach or module to try to trap these things before they crop up?
$content =~s/ís//gi; $content =~s/ìH//gi; $content =~s/Ùt//gi; $content =~s/ía//gi; $content =~s/∫s//gi; $content =~s/íX//gi;
In reply to stripping characters from html by jonnyfolk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |