Hi, I'm converting a database (that I thought it was utf8) to a txt flat database; here are the results I'm getting:
guozi|5|41.102188|122.502444|CH|19||Guo?zi| e|5|41.102188|122.502444|CH|19||锅?子| zaomugou|5|41.144313|122.489929|CH|19||Zaomugou| |5|41.144313|122.489929|CH|19||枣木沟| wangjia|5|41.417097|122.412761|CH|19||Wangjia| c|5|41.417097|122.412761|CH|19||王家| shengli|5|41.393546|122.456116|CH|19||Shengli| e|5|41.393546|122.456116|CH|19||胜利| minjitun|5|41.375362|122.471185|CH|19||Minjitun| e|5|41.375362|122.471185|CH|19||民集屯| zhangjiagou|5|41.413368|122.487095|CH|19||Zhangjiagou| |5|41.413368|122.487095|CH|19||张家沟| yahua|5|41.42516|122.473109|CH|19||Yahua|
As you can see, some of the lines have strange symbols (utf8?) - not many. I want to exclude them. What kind of match pattern can I use to exclude them? Kind regards, Kepler

In reply to Database Problem by kepler

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.