I am parsing a file created by an unkown database and being served up by a Windows server. There are just a couple of lines that have Malformed UTF-8 characters on them, but when you feed one of them into a regex, then it creates a warning message.
I could simply not use the -w option, but that isn't a solution that I want permanently because of other problems that may occur in the future with the script. Does anyone know how I might be able to detect a Malformed UTF-8 character before it gets worked on by the regex.
Thanks,
Wonko El Sano