If you apply several regexpes to your HTML text, you could first translate all nbsp to real spaces, i.e.
$text =~ tr/\xA0/ /;
In reply to Re: \s and non-breaking spaces
by rovf
in thread \s and non-breaking spaces
by Ratazong
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |