Tobiwan has asked for the wisdom of the Perl Monks concerning the following question:

Has anyone use the module PDF::FromHTML with unicode-characters? My HTML contains german chars like 'ö' and it would contain russian text soon. It doesn't matter if I write it in utf-8 or as ö or in XML-encoded, all non-ACSCII-chars where deleted.

I tried to set an unicode-font, but it uses PDF::API2::Resource::Font::CoreFont::* fonts, which are allways without any unicode definition. Any idea? Or know anybody another module für generate PDF from HTML with uncode-chars?

Tobiwan

Replies are listed 'Best First'.
Re: PDF::FromHTML and UTF8
by Joost (Canon) on Dec 07, 2006 at 21:57 UTC
      It's not that simple with an utf8-ttf :( Internally the module use charsets from PDF::API2::Resource::Font::CoreFont::*. In this tables any char is described. If I use some utf8-ttf, I've to write some of thoose descriptions. And I don't want to do that for thousands of characters. Can I use PDF::API2 to create PDS's from HTML directly?