avskrm has asked for the wisdom of the Perl Monks concerning the following question:

Hello, I need to extract all the images from a PDF file on my server. I don't want the PDF pages, only the images at their original size and resolution. How could I do this with PHP or any other UNIX based app (which I would invoke with the exec function from PHP). Thanks!

Replies are listed 'Best First'.
Re: How to extract the images from a PDF?
by CountZero (Bishop) on Jan 10, 2009 at 08:19 UTC
    I have no idea how to do it in PHP, but Perl can do it in three lines:
    use PDF::GetImages 'pdfimages'; $PDF::GetImages::FORCE_JPG = 1; pdfimages('/path/to/file.pdf');
    You will then find the images in the same directory as the PDF-file.

    CountZero

    A program should be light and agile, its subroutines connected like a string of pearls. The spirit and intent of the program should be retained throughout. There should be neither too little or too much, neither needless loops nor useless variables, neither lack of structure nor overwhelming rigidity." - The Tao of Programming, 4.1 - Geoffrey James

Re: How to extract the images from a PDF?
by Anonymous Monk on Jan 10, 2009 at 08:52 UTC