Tommy1981 has asked for the wisdom of the Perl Monks concerning the following question:

Hello everybody,

does anybody know a Perl module that replaces a HTML Entity by its representation as you can see it in a web browser?

I already extracted the text within the <body> tags of an HTML formatted website using HTML::TagParser, and what I get are things like

"interdisziplin&#228;r", "gr&#246;&#223;eren"

instead of

"interdisziplinär", "größeren".

Thank you very much for your help in advance,

Thomas

Replies are listed 'Best First'.
Re: Replacement of HTML Entites
by Corion (Patriarch) on Jul 12, 2010 at 16:43 UTC
      Hi Corion,

      Thank you very much for your answer. I didn't know that HTML::Entities is contained in HTML::Parser, and I couldn't find "HTML-Entities" via the Active Perl's "Perl Package Manager".

      Now it works fine.

      Thomas