in reply to Extracting MS Word text and encoding HTML entities

Just use the utf-8 encoded file and put it directly in the HTML file with utf-8 encoding specified in the HTTP headers, or META tag. No encode_entities() needed.

see the UTF-8 and unicode FAQ.

  • Comment on Re: Extracting MS Word text and encoding HTML entities