That was it!
Turns out they're UTF-16 coded. Hadn't thought of that. I saved a test file in Roman and one in Latin—the scripts worked on both. I don't yet know if the specific data that has to be matched loses info if I convert to Roman/Latin but at least I'm on a better path.
Thanks.
In reply to Re: Peeling Data with Reserved Characters and Long Lines
by PerlReader
in thread Peeling Data with Reserved Characters and Long Lines
by PerlReader
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |