in reply to LWP::Simple question

LWP::Simple, or other LWP modules, only download the source, they don't do any parsing. You can use other modules in the HTML namespace for parsing, but getting only viewable text doesn't seem like an easy problem.