Hi All,
POSTing UTF-8 data returns "Internal Server Error: Wide Character in syswrite". Example:
use WWW::Mechanize;
use encoding 'utf-8';
binmode STDOUT, ':utf8';
....
$mech->submit_form(form_name => 'form');
I get the "Wide Character" error if $mech->content contains UTF-8 data ... It works if the data is ASCII. What am I missing?
Thanks!
Update:
I tried installing libwww-perl-5.8.26 - got a new error "HTTP::Message content must be bytes"
Solution was to edit HTTP/Request/Common.pm, adding utf8 lines
$k =~ s/([\\\"])/\\$1/g; # escape quotes and backslashes
if (utf8::is_utf8($v)){
utf8::encode($v);
}
push(@parts,
qq(Content-Disposition: form-data; name="$k"$CRLF$CRLF$v));<<
+/
And removing the "use encoding 'utf-8';" line from the sample script.
Working code:
use WWW::Mechanize;
binmode STDOUT, ':utf8'; # Removes 'wide character' warnings
$url = "http://localhost/test.cgi";
$mech = WWW::Mechanize->new();
$mech->get($url);
if ($mech->success) {
if ($mech->form_name( 'frmLookup' )) {
$mech->submit_form(form_name => 'frmLookup');
print "Success";
}
}
Example form:
print qq{Status: 200
Content-Type: text/html
<html>
<head>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
</head>
<body>
<FORM NAME="frmLookup" ACTION="test.cgi" METHOD="Post" ENCTYPE="mult
+ipart/form-data">
<INPUT TYPE="text" NAME="test" VALUE="刘">
</FORM>
</body>
</html>};
Note: this works for multipart/form-data encoding. A similar edit is required for x-www-form-urlencoded encoding.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.