in reply to Re: Spreadsheet::ParseExcel Problem
in thread Spreadsheet::ParseExcel Problem

Below is the original string when uploaded

(Looking at dumps, numerous transactions which abended with dump which were waiting on v1 call (via cornep)
John to log report with R&D )


Below is the string when it is parsed.

(Looking at dumps, numerous transactions which abended with dump which were waiting on v1 call (via cornep)  John to log report with R&D)

Note the character - (in red font) is converted to . And as far as "'" is concerned is taken care at the time of insert into database.
  • Comment on Re: Re: Spreadsheet::ParseExcel Problem

Replies are listed 'Best First'.
Re: Re: Re: Spreadsheet::ParseExcel Problem
by bart (Canon) on Dec 26, 2003 at 13:15 UTC
    That character in red looks like an en-dash ("–") to me. There is no en-dash character in ISO-Latin-1 — but only in Microsoft's extended character set, Code Page 1252, at position 0x96. In Unicode, its character code is 0x2013.

    I'm not sure exactly what is happening here, but I'm quite convinced this is the cause of your problem. Check what character set the Excel file is in, I would think it's in 16-bit Unicode. Perhaps Spreadsheet::ParseExcel simply drops the upper byte, leaving you with the control character 0x13, chr(19).

    Copying the block character out of the HTML page and pasting it into my text editor shows that it is indeed a chr(19). So that explains what is going on, but that still doesn't fix it. It looks like a lot of work; likely you should somehow try to convert the character set from Unicode to CP-1252/Windows.

Re^3: Spreadsheet::ParseExcel Problem
by Aristotle (Chancellor) on Dec 26, 2003 at 05:20 UTC
    So it seems to genuinely be a problem with Spreadsheet::ParseExcel, which wasn't quite clear from your initial post. In that case I can't help you. :)

    Makeshifts last the longest.