in reply to Re: Interpreting input file having special characters from multiple European languages
in thread Interpreting input file having special characters from multiple European languages

My Perl program for reading the multi-language Excel spreadsheet, and writing selected data to an ASCII *.ini file, uses the modules Spreadsheet::BasicRead and Spreadsheet::BasicReadNamedCol to read the Excel spreadsheet.

The unmodified Text::Unicode module is not correctly translating Polish special characters while writing them into the ASCII *.ini file, the Polish special characters are garbled.

I am using Strawberry Perl, which I assume is very out of date compared with standard Perl. Could this be causing my problem?

Is there an alternate solution for the problems I am having in my Perl program?
  • Comment on Re^2: Interpreting input file having special characters from multiple European languages

Replies are listed 'Best First'.
Re^3: Interpreting input file having special characters from multiple European languages
by RonW (Parson) on Sep 16, 2015 at 22:09 UTC
    I am using Strawberry Perl, which I assume is very out of date compared with standard Perl.

    The Strawberry Perl project is very good at keeping up with releases from the main Perl project. The incantation perl -v will tell you the specific version you are running.

Re^3: Interpreting input file having special characters from multiple European languages
by tangent (Parson) on Sep 16, 2015 at 16:57 UTC
    The unmodified Text::Unicode module...
    May be a typo, but my suggestion was to use Text::Unidecode not Text::Unicode - they do quite different things. Spreadsheet::BasicRead should be fine as it uses Spreadsheet::ParseExcel under the hood.
      I made a typo in my reply I was using Text::Unidecode in my Perl program, not Text::Unicode.

      The CPAN entry for Text::Unidecode suggests modifying Text::Unidecode with special code for any language (such as Polish) whose special characters cause problems for Text::Unidecode. I was hoping to avoid this.

        In our test environment, an acceptable workaround for Polish special characters is to use codepage cp1250 for the Polish language. The other European languages seem to work fine with cp1252. Our test devices contain special code that touches up any remaining errors in codepage cp1250 (when Polish special characters are being rendered). No further replies are needed, problem is solved.