in reply to regexp for stripping tables

If all you need is to strip the table tags you can use Regexp::Token (look in: Regexp::Token -- Use regular expressions to match tokens). If you want to strip the tabels together with the content of them there are two possibilities: Some more discussion about those techniques you'll find in: Scraping HTML: orthodoxy and reality, Regexps to change HTML tags/attributes, Parsing nested HTML with just regex