LanX has asked for the wisdom of the Perl Monks concerning the following question:

Hi

I'm having a big amount of PDF documents in a directory from various sources (word, scan, latex) which I would like to organize.

More or less the same way I can find PDFs with embedded text by using grep.

My idea is to add keywords/tags and to use an application to filter them by those keywords.

Any suggestions how to do this from command line, preferably with Perl?

Cheers Rolf

Replies are listed 'Best First'.
Re: tagging PDFs with keywords and filtering them later
by snoopy (Curate) on Nov 19, 2012 at 21:38 UTC
    Image::ExifTool is a module for adding keywords and other searchable metadata to PDF files and various other image formats.

    You can use the OO interface, or its command line utility exiftool

    % # add keywords to a PDF % exiftool -keywords=perl -keywords=snoopy test.pdf % % # read them back % exiftool -keywords test.pdf Keywords : perl, snoopy
      Awesome! Thanks :)

      Cheers Rolf

      PS: CPAN rules! Really! =)

Re: tagging PDFs with keywords and filtering them later
by Anonymous Monk on Nov 19, 2012 at 18:18 UTC