Thanks for all of the suggestions. I downloaded Swish-e engine and am quite pleased with the end result. It's blazingly fast and, with the help of the SWISH::API module, I had a very nice CGI available for my users in no time.

As for all the questions about the OCR, I cannot answer them. Unfortunately for my firm, the documents in question have been OCRed by the opposing counsel and the quality does not seem to be all that great. Despite their daunting entry into the "evidence obfuscation contest", Swish-e is doing its job nicely. Of course, there's still not much I can do about bad OCR...

Now I just need to figure out how to implement the "print all results" button that I've been asked to add to my script ("all results" often seems to entail a few thousand TIFF files) I'm sure there's a node about that here somewhere...

Thanks again,

Ariel


In reply to Re: Searching many files by ariel2
in thread Searching many files by ariel2

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.