in reply to How to build a Search Engine.

For conversions from PDF to plain text, you could include CAM::PDF together with pdftotext. I only tried it in a basic PDF, but it seems to do its job and it's Perl.

Thanks for this meditation/tutorial :)

Flavio
perl -ple'$_=reverse' <<<ti.xittelop@oivalf

Io ho capito... ma tu che hai detto?

Replies are listed 'Best First'.
Re^2: How to build a Search Engine.
by Anonymous Monk on Jun 26, 2009 at 01:08 UTC
    Anybody know how to start a Java application on Apache web server ? I am trying to install a search engine using LEXST-SEA, because I suppose our search engine would be very huge scaled in the future. the system supplys a web server(TOMCAT) pre-bound to the java application. My problem is We don't want use its Tomcat, we want to use Apache. How can I start the java application in apache ? (the search engine software is here http://www.lexst.com) thank you windwasher