kcinmd has asked for the wisdom of the Perl Monks concerning the following question:
I am about to implement a preprocessing one liner to remove Invalid ascii character representations which fall out of the range of our decoding ability. Our data feed is a drop directory of XML that ultimately is loaded into data warehouse. Prior to load I will have the ETL issue the following pre-command:
perl -pi -e 's/^&#x.+;//g' ./*.xmlTesting has proven desired result. I figured I would throw this out there for opinion just in case I am missing something.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Sanity Check
by choroba (Cardinal) on Jan 28, 2015 at 12:45 UTC | |
by kcinmd (Initiate) on Jan 28, 2015 at 13:38 UTC | |
by choroba (Cardinal) on Jan 28, 2015 at 13:42 UTC | |
by kcinmd (Initiate) on Jan 28, 2015 at 15:07 UTC | |
by Anonymous Monk on Jan 28, 2015 at 14:17 UTC |