Beefy Boxes and Bandwidth Generously Provided by pair Networks
Syntactic Confectionery Delight
 
PerlMonks  

Re: Re: Non-standard options to tags for CGI.pm

by dragonchild (Archbishop)
on Jan 11, 2002 at 00:25 UTC ( #137832=note: print w/replies, xml ) Need Help??


in reply to Re: Non-standard options to tags for CGI.pm
in thread Non-standard options to tags for CGI.pm

Cool script, but there's only one problem:
scorpion:-> html2cgi.pl tucson.htm Subroutine HTML::TreeBuilder::comment redefined at html2cgi.pl line 11 +9. ## unknown head tag: <meta name="Excel Workbook Frameset"> ## unknown head tag: <meta content="text/html; charset=us-ascii" http- +equiv="Content-Type"> ## unknown head tag: <meta content="Excel.Sheet" name="ProgId"> ## unknown head tag: <meta content="Microsoft Excel 9" name="Generator +"> ## unknown head tag: <link href="tucson_files/filelist.xml" rel="File- +List"> ## unknown head tag: <comment>[if gte mso 9]&gt; &lt;xml&gt; &lt;o:DocumentProperties&gt; &lt;o:LastAuthor&gt;kinyoro&lt;/o:LastAuthor&gt; &lt;o:Created&gt;2001-06-25T12:34:28Z&lt;/o:Created&gt; &lt;o:LastSaved&gt;2001-06-25T12:34:28Z&lt;/o:LastSaved&gt; &lt;o:Version&gt;9.2720&lt;/o:Version&gt; &lt;/o:DocumentProperties&gt; &lt;o:OfficeDocumentSettings&gt; &lt;o:DownloadComponents/&gt; &lt;/o:OfficeDocumentSettings&gt; &lt;/xml&gt; &lt;![endif]</comment> ## unknown head tag: <link href="tucson_files/sheet001.htm" id="shLink +"> ## unknown head tag: <link href="tucson_files/sheet002.htm" id="shLink +"> ## unknown head tag: <link href="tucson_files/sheet003.htm" id="shLink +"> ## unknown head tag: <link id="shLink"> ## unknown head tag: <script language="JavaScript">
Ummm... help?

------
We are the carpenters and bricklayers of the Information Age.

Don't go borrowing trouble. For programmers, this means Worry only about what you need to implement.

Replies are listed 'Best First'.
Re(3): Non-standard options to tags for CGI.pm
by dmmiller2k (Chaplain) on Jan 11, 2002 at 06:53 UTC

    Somewhere recently I ran across something about a tool that strips out junk from Microsoft Office generated HTML (all that 'mso...' stuff). Come to think of it, it was HTML Tidy, which comes as an integral part of HTML Kit, a freely downloadable tool.

    If you have no intention of ever converting these files (or the output of your CGI-based perl code) BACK to Excel, it might be a viable option to download the command-line version of HTML Tidy and run it from your script before html2cgi.pl.

    dmm

    You can give a man a fish and feed him for a day ...
    Or, you can
    teach him to fish and feed him for a lifetime
      HTML Tidy has gone open source and is now at Sourceforge! It is being refactored into a C library and there is a new project to make it into an XS module. I expect a fruitful future for this wonderful program.

      It should work perfectly the first time! - toma

      Yah, but the problem is that I need to convert back to Excel. *sighs*

      ------
      We are the carpenters and bricklayers of the Information Age.

      Don't go borrowing trouble. For programmers, this means Worry only about what you need to implement.

        Whoops! :)

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://137832]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chilling in the Monastery: (6)
As of 2022-08-08 07:11 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found

    Notices?