I am trying the impossible: Making the module TermExtract (Module: zip; Info in Japanese with Google Translate here) work. My Problem is the encoding. I get only garbage out. My simple script:
#!/usr/local/bin/perl -w use TermExtract::JapanesePlainTextEUC; use strict; my $data = new TermExtract::JapanesePlainTextEUC; my $InputFile = "in.txt"; my @noun_list = $data->get_imp_word($InputFile); open(FH, '>:encoding(UTF-8)', "out.txt") or die $!; foreach (@noun_list) { print FH "$_->[0], $_->[1]\n"; } close FH;
Run against a simple textfile utf8
If somebody has an idea...
In reply to Encoding TermExtract by IB2017
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |