in reply to Re: Matching ampersands that are NOT part of an HTML entity?
in thread Matching ampersands that are NOT part of an HTML entity?

I may be missing something, but it looks like you're printing and then discarding the utf8-encoded text, then continuing on with the non-utf8 text. Shouldn't it be something like this?
my $utf8 = HTML::Entities::decode($line); $utf8 = Encode::encode_utf8($utf8); my $proper = HTML::Entities::encode($utf8); print $proper;
Update: Ah, nevermind, I misunderstood what you were saying initially.

__________
Systems development is like banging your head against a wall...
It's usually very painful, but if you're persistent, you'll get through it.

Replies are listed 'Best First'.
Re^3: Matching ampersands that are NOT part of an HTML entity?
by Your Mother (Archbishop) on Aug 07, 2008 at 15:18 UTC

    Uh... no. Did you run it? The print Encode::encode_utf8($utf8); is just there to see the intermediary step. Encode::encode_utf8 makes the output "safe" for the terminal: no "wide character" warnings.