in reply to PDF extract
Manipulating PDF files is quite complex and I found that I had to use both modules to achieve what you are trying to do as PDF::API2 doesn't seem to have any way of extracting text and CAM::PDF doesn't seem to have any way of adding an empty page.
Hopefully this will help you on your way:
use strict; use warnings; use CAM::PDF; use PDF::API2; my $pdfone = CAM::PDF->new('input.pdf'); my $pdftwo = PDF::API2->open('output.pdf'); my $font = $pdftwo->corefont('Helvetica-Bold'); for my $pagenum (1 .. $pdfone->numPages() ) { my $text = $pdfone->getPageText($pagenum) or next; my $page = $pdftwo->page(); # add a new page my $pdf_text = $page->text(); $pdf_text->font($font,12); my @lines = split("\n",$text); my ($x,$y) = (50,700); for my $line (@lines) { $pdf_text->translate($x,$y); $pdf_text->text($line); $y = $y - 20; } } $pdftwo->saveas('output.pdf');
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: PDF extract
by PerlSufi (Friar) on Apr 01, 2013 at 13:59 UTC | |
by tangent (Parson) on Apr 02, 2013 at 09:15 UTC | |
by PerlSufi (Friar) on Apr 03, 2013 at 17:17 UTC | |
by PerlSufi (Friar) on Apr 03, 2013 at 21:37 UTC | |
by soonix (Chancellor) on Apr 04, 2013 at 00:29 UTC |