usr345 has asked for the wisdom of the Perl Monks concerning the following question:

I have an html file in utf8, containing portuguese text. After I parse it with HTML::TreeBuilder I get the accented letters corrupted. For example word 'Acrelândia' becomes 'Acrelândia'.

I am totally confused. Can anyone help?

Replies are listed 'Best First'.
Re: HTML::TreeBuilder incorrect encoding
by choroba (Cardinal) on Nov 09, 2010 at 14:55 UTC
      I searched, but didn't find this. Will be working with there suggestions. Thx!