I am about to implement a preprocessing one liner to remove Invalid ascii character representations which fall out of the range of our decoding ability. Our data feed is a drop directory of XML that ultimately is loaded into data warehouse. Prior to load I will have the ETL issue the following pre-command:
perl -pi -e 's/^&#x.+;//g' ./*.xmlTesting has proven desired result. I figured I would throw this out there for opinion just in case I am missing something.
In reply to Sanity Check by kcinmd
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |