does anybody know a Perl module that replaces a HTML Entity by its representation as you can see it in a web browser?
I already extracted the text within the <body> tags of an HTML formatted website using HTML::TagParser, and what I get are things like
"interdisziplinär", "größeren".