I'll agree with zwon -- you haven't given enough information. It would help if you could show a snippet of input data and resulting output, along with a minimal (but runnable) perl script that shows how you converted that input to that output.

It'll also help if you present the data in the form of hex byte values. The thing that you say is the "weird character" that whitespace is converted to is not a "character" -- it's a string of up to 14 bytes, two of which are in the ASCII range ("3" and space, which happen to be adjacent in the string: ... 0x33 0x20 ...). I won't hazard a guess as to what the other bytes might represent, because I'm not even sure whether the string you posted is an accurate copy of the output you got. (That's why the hex byte-value dump is important.)

So, your question can't be answered yet, because you haven't shown us what the input looked like or what sort of code is creating the output. And it's not even clear what the output really is.

In case it helps, here's a perl one-liner for generating a simple hex byte dump:

perl -lpe '$_=unpack("H*",$_);s/(..)/$1 /g'

In reply to Re: Problem with UTF-8 email by graff
in thread Problem with UTF-8 email by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.