I see now that "by default, the internal format is either ISO-8859-1 (latin-1), or utf8, depending on the history of the string." I hadn't seen that before. I was under the impression that if the utf8 flag is not set, the string consists of octets that should be decoded. It now appears that this impression was incorrect. That brings me back to the question: Exactly what does the Win32:OLE documentation mean when it talks about the CP option for "translations between Perl strings and Unicode strings?" Does the CP_UTF8 option actually mean "character strings in Perl's internal format?" Thanks.
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|