in reply to Re: TreeBuilder and encoding
in thread TreeBuilder and encoding

Now I looked, you also need to read https://metacpan.org/module/HTML::TreeBuilder#parse_file because treebuilder is interpreting those UTF-8-encoded-bytes as latin-1

I had read the latin-1 issue as having to do with file opening. Since I am passing data to parse() as a string, I thought that wouldnt apply. Thinking about it, that may be a poor assumption, but I do note that the parse() call does not mention charsets at all.

This works

Hmmm.... Ill have to spend some time looking at the changes you made.

Replies are listed 'Best First'.
Re^3: TreeBuilder and encoding
by Anonymous Monk on Jul 15, 2013 at 23:24 UTC
    you're using content which is bytes, mech content is chars ie decoded_content

    Mechanize is for you :)