in reply to Re^2: What's the 'M-' characters and how to filter/correct them?
in thread What's the 'M-' characters and how to filter/correct them?
There are no "M-" characters as such. That's just how cat is displaying the non-ASCII characters. The ones that you say are "trivial" are probably some kind of white space character that you don't notice in the spreadsheet.
Once you know what characters they are, for example as suggested in Re: What's the 'M-' characters and how to filter/correct them?, you can remove them with a regular expression. For example, here's a situation I dealt with recently involving invisible special characters that were causing problems with web browsers:
# U+2028 ('Line Separator') and U+2029 ('Paragraph Separator') + are valid JSON # but cause a parse error in the browser. So we remove them. $job_xml =~ s/\x{2028}|\x{2029}//sg;
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^4: What's the 'M-' characters and how to filter/correct them?
by sylph001 (Sexton) on Jan 20, 2016 at 09:54 UTC | |
by shmem (Chancellor) on Jan 20, 2016 at 10:17 UTC | |
by sylph001 (Sexton) on Jan 20, 2016 at 15:02 UTC | |
by shmem (Chancellor) on Jan 21, 2016 at 15:28 UTC | |
by Anonymous Monk on Jan 20, 2016 at 22:01 UTC | |
by sylph001 (Sexton) on Jan 21, 2016 at 09:25 UTC | |
by AnomalousMonk (Archbishop) on Jan 21, 2016 at 20:05 UTC |