in reply to Can I convert a pdf to html with PDF::Extract??
use PDF::Extract; $pdf = new PDF::Extract( PDFDoc=>'C:/my.pdf' ); $pdf->getPDFExtract( PDFPages=>$PDFPages ); print "Content-Type text/plain\n\n<xmp>", $pdf->getVars("PDFExtract" +); print $pdf->getVars("PDFError"); # or # Extract and save, in the current directory, all the pages in a pdf +document use PDF::Extract; $pdf=new PDF::Extract( PDFDoc=>"test.pdf"); $i=1; $i++ while ( $pdf->savePDFExtract( PDFPages=>$i ) );
Update to provide more thorough information:
The POD also discusses the following:
With PDF::Extract a new PDF document can be:-
So I guess the short answer is that yes, this is an appropriate tool for the job. The example under the heading servePDFExtract shows how to output to STDOUT with the correct header for a PDF document served on the web.
Dave
|
|---|