http://qs1969.pair.com?node_id=1162858

ultranerds has asked for the wisdom of the Perl Monks concerning the following question:

Hi,

I'm trying to make a basic page, that will grab a site (with non-latin characters on - such as the below Russian page) ... and then extract and print out the title. Here is my code:
#!/usr/bin/perl use CGI::Carp qw(fatalsToBrowser); use strict; use lib './'; use warnings; use LWP::UserAgent; use HTTP::Request::Common qw(GET); my $ua = LWP::UserAgent->new; # Define user agent type $ua->agent('Mozilla/8.0'); # Request object my $req = GET 'http://www.gazeta.ru/culture/2016/04/22/a_8191769.s +html'; # Make the request my $res = $ua->request($req); binmode STDOUT, ":utf8"; print "Content-Type: text/html; charset=utf-8 \n\n"; if ($res->is_success) { my $title; $res->content =~ /<title>(.+?)<\/title>/ and $title = $1; print "GOT TITLE: $title \n"; } else { print $res->status_line . "\n"; }
For some reason, it just doesn't seem to want to play ball - I end up with:

Румыния не будет участвовать в «Евровидении-2016» из-за денег - Газета.Ru

Instead of:

Румыния не будет участвовать в «Евровидении-2016» из-за денег - Газета.Ru

Does anyone have any suggestions?

Thanks!

Andy