in reply to LWP::Simple // Special Character problems.
The target web-page contains the tex: "æ, ø, å"That indicates that the web-page in question did not properly encode unsafe characters using HTML::Entities or equivalent, right? But that probably not your fault, unless of course it's your own site. But you can decode it yourself:
use strict; use warnings; use LWP::Simple; use HTML::Entities; my $str = decode_entities(get(q{http://www.uio.no})); my @arr = split('\s+', $str); for (@arr) { print if (m/[æøå]/i); } __END__ største ønsker å søk--> <!--Søk søkeknapp alt="Søk" value="Søk" Walløe forskingsråd</a><span å nivået ...
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: LWP::Simple // Special Character problems.
by ikegami (Patriarch) on May 24, 2007 at 17:32 UTC | |
by RipHard (Initiate) on May 26, 2007 at 08:53 UTC |