vallavan_sathish has asked for the wisdom of the Perl Monks concerning the following question:

Hi I'm new to this site and here is my doubt My problem is I need to code a HTML entity for each symbols every time, is there any possibility to convert all the symbols i found in my file to html entity for example: µ to μ Á to Á Like this for all the special symbols i found in a microsoft word please help me

Replies are listed 'Best First'.
Re: Non Keyboard Characters to HTML entity
by ikegami (Patriarch) on Aug 05, 2009 at 07:15 UTC
Re: Non Keyboard Characters to HTML entity
by Utilitarian (Vicar) on Aug 05, 2009 at 09:24 UTC
    Hi vallavan_sathish, I must admit I'm not familiar with Word's data format, are you looking at the characters after they have been extracted from the file?

    How are they represented @ the moment, Unicode ?

    Are you familiar with the Encode module, it allows you to define a mapping for characters.

    update: to check on the available encodings use the following one liner (adjust quotes for windows)

    perl -e 'use Encode;@list = Encode->encodings(":all"); for $schema (@l +ist){print "$schema\n";}'
Re: Non Keyboard Characters to HTML entity
by vinoth.ree (Monsignor) on Aug 05, 2009 at 07:17 UTC