in reply to Re: How to Remove Junk Characters
in thread How to Remove Junk Characters

Hi Monks,

Here are some sample junk characters Downloaded files Input -- Original Output =========================================== 1. jury trial. For his -- jury trial. For his 2. Börries Ahrens -- Börries Ahrens 3. Aldejohann’s main -- Mr. Aldejohann’s 4. University of MĂĽnster -- University of Münster 5. the €625 million senior and €130 -- €625 million senior and €1 +30 6. acquisition of a properties’ -- acquisition of a properties’ 7. Westfield College – University -- Westfield College – University + 8. TelĂ©fonos -- Teléfonos 9.(CelumĂłvil S -- (Celumóvil S 10. Dr. jur., 1990, with a dissertation on “Die Unabhängigkeit des +genossenschaftlichen PrĂĽfungsverbandes” (“The Independence of th +e Cooperative Inspection Association”) --- Dr. jur., 1990, with a dissertation on "Die Unabhängigkeit des genosse +nschaftlichen Prüfungsverbandes" ("The Independence of the Cooperativ +e Inspection Association")

Thanks,
Rajesh.K

Replies are listed 'Best First'.
Re^3: How to Remove Junk Characters
by wfsp (Abbot) on Jan 06, 2006 at 19:56 UTC
    Change
    my $file_cnt = $res->content;
    to
    my $file_cnt = $res->decoded_content;

    See HTTP::Message for an explanation of the difference.

    Many thanks to the search artist kwapping for finding it and to tye for explaining it :-)