I am trying the impossible: Making the module TermExtract (Module: zip; Info in Japanese with Google Translate here) work. My Problem is the encoding. I get only garbage out. My simple script:
#!/usr/local/bin/perl -w use TermExtract::JapanesePlainTextEUC; use strict; my $data = new TermExtract::JapanesePlainTextEUC; my $InputFile = "in.txt"; my @noun_list = $data->get_imp_word($InputFile); open(FH, '>:encoding(UTF-8)', "out.txt") or die $!; foreach (@noun_list) { print FH "$_->[0], $_->[1]\n"; } close FH;
Run against a simple textfile utf8
If somebody has an idea...
In reply to Encoding TermExtract by IB2017
For: | Use: | ||
& | & | ||
< | < | ||
> | > | ||
[ | [ | ||
] | ] |