Hi monks!

I'm looking for a way to get some .doc files (MS Word 97+ documents) and convert automatically to HTML! The documents only cant have texts and tables (what is easier do convert).

This is for a management system, where the documents that they need to fill and send to the directors in the main office stay in the internet. So, the user open the html version of the doc in the internet and fill the inputs of the formulary, insted of open in the Word and type every thing. This is good, since when the user send the formulary (HTML), the name and values of the inputs are catched and saved in some DB. In the Word version, the insertion of data in the DB need to be made by hand, reading the printed version.

But why Word? The directors like to write their documents on it, and don't know HTML. We know that you can save a .doc to .html from the Word editor, but we saw that teach that for all the peoples that will write the docs don't work for all. So, we want to get the .doc directly, and convert it internally by the system.

I'm looking for a way to make this convertion automatically, and better if it can work on Linux. But if only an Win32 solution come is fine.

Graciliano M. P.
"The creativity is the expression of the liberty".


In reply to There is a way to convert .doc to .html automatically from Perl? by gmpassos

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.