The Background

MSFT Internet Explorer has a somewhat useful feature called MHT Web Archive. What it does is save an entire web page, including images, to a single file. An added bonus is the MHT format apparently uses straightforward, non-proprietary base64 encoding to handle the images.

The Question

This useful feature is apparently no longer supported. Now with a bunch of MHT files sitting around that are no longer readable, it is time for Perl to step in and convert all those MHT files into something else. The question is, what should that something else be, and do I have to code it myself or has someone else already gone down this road.

Absent any insight to the contrary, it looks like I will have to slap together a perl script to grab each 'region' from each MHT file, and export the regions as HTML, GIF, JPG (etc) files. This ruins the single-file-encapsulation feature, but at least makes the content readable again.

Any better ideas out there greatly appreciated.


In reply to MSFT Explorer MHT File Munging by dimar

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.