Thanks but this doesn't work for me. I know that qr returns a regex object, I was only using it because a straight substution doesn't do the trick. I don't have Path::Tiny so I amended your example:
#!/usr/bin/perl -- use strict; use warnings; my $infile = 'C:\Scripts\Working2\Users_0.xml'; my $outfile = 'C:\Scripts\Working2\Users2.xml'; my $find = VerifyHex( '\xe9' ); my $replace = VerifyHex( '\x65' ); open(IF, "<$infile") or die "Could not open $infile $!"; open(OF, ">$outfile") or die "Could not open $outfile $!"; binmode IF; binmode OF; while( my $str = <IF> ){ $str =~ s{$find}{$replace}g; print OF $str; } close IF; close OF; sub VerifyHex { my( $str ) = @_; if( $str =~ m/(\\[a-zA-Z0-9][a-zA-Z0-9])/ ){ return "$1"; } die "evil input $str"; }
The input file contains this (edited from an extract from an IBM tool), which says it's UTF-8 but isn't:
<?xml version="1.0" encoding="UTF-8" ?> <foundation Version="1.0.0"> <contributor> <userId>C12760</userId> <name>Shilpaé Durgale</name> </contributor> </foundation>
Sadly the output is unchanged from the input, the é is not replaced with e. As you can tell I'm no genius with Perl and I'm sure I'm missing something fundamental. Thoughts?

In reply to Re^2: Hex-matching Regex pattern in scalar ( substitution by CliffG
in thread Hex-matching Regex pattern in scalar by CliffG

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.