in reply to extract text from pdf

What do you mean "involved managing temporary files"?
open $fh, "pdftotext whatever.pdf - |" or die; ... read text from $fh ...

Replies are listed 'Best First'.
Re^2: extract text from pdf
by LANTI (Sexton) on Apr 24, 2012 at 08:46 UTC

    If I want just the PDFs text to use it for whatever (save it in a database, ...) I found this line quiete convenient:

    my $txt = `pdftotext whatever.pdf -` or die 'ERROR running pdftotext'; say $txt;
    Or if the file-name is in a variable and the PDF-file contains umlauts or other non-ascii chars:
    my $command_line = qq{pdftotext -enc 'UTF-8' '$path' -}; my $text = `$command_line` or die 'ERROR running pdftotext';