IB2017 has asked for the wisdom of the Perl Monks concerning the following question:
Hello
I have a (huge) bunch of PDF files with highlighted textual parts in them. I need to programmatically extract these parts. I am already able to extract text from PDFs, but without any distinction on however I was wandering if somebody knows of any module/approach that could help me in doing this. There are software out there that are capable of doing this (for example Zotero), so it should be technically possible, however I haven't found any module that can help me in implementing this nor any information on the Web pointing at some solution. Any ideas?
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: Read highlighted text from PDF
by vr (Curate) on Sep 28, 2018 at 11:16 UTC | |
Re: Read highlighted text from PDF
by LanX (Saint) on Sep 28, 2018 at 00:35 UTC | |
Re: Read highlighted text from PDF
by bliako (Abbot) on Sep 28, 2018 at 09:30 UTC | |
Re: Read highlighted text from PDF
by ablanke (Monsignor) on Sep 28, 2018 at 09:40 UTC |