G'day Cody Fendant,
I can comment on the "characters" part. I'm not an MSWin user, so I'm unable to help with the "Win32::OLE and Word" part.
'... all I get, ..., is (hex) C2A0, which I believe is just "non-breaking space".'
C2 is LATIN CAPITAL LETTER A WITH CIRCUMFLEX (Â). A0 is NO-BREAK SPACE ( ). You can see both in the PDF: Unicode Code Chart: C1 Controls and Latin-1 Supplement.
C2A0 (슠) is in the PDF: Unicode Code Chart: Hangul Syllables. There are no formal names shown for any characters in that block of Unicode characters (AC00–D7AF).
My gut feeling is that this is related to different encodings in the Word and HTML documents. Another monk may be able to help further with that. If you supplied some code showing the conversion from Word to HTML you might get a better answer.
— Ken
In reply to Re: Win32::OLE and Word checkbox characters
by kcott
in thread Win32::OLE and Word checkbox characters
by Cody Fendant
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |