Sigmund asked a very similar question here. The advice here was to use HTML::Tokeparser. It seems to have done the trick!
I would urge you to consider the same approach. (Sigmund did it without commenting out strict and warnings too!)
Using regexs to parse HTML is a nightmare. Don't do it!
Update: fixed typo.
In reply to Re: Broken News- Reg. Exp.
by wfsp
in thread Broken News- Reg. Exp.
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |