in reply to Re^2: Converting PDF file to text
in thread Converting PDF file to text
Most likely the version of Java on your web server does not work with the version of Java the Tika JAR file requires. I can't help you there.
I'm sorry that the synopsis of Apache::Tika::Async is broken - it should look like the following, but it seems I never released that fix onto CPAN:
use Apache::Tika::Async; my $tika= Apache::Tika::Async->new; my $fn= shift; use Data::Dumper; my $info = $tika->get_all( $fn ); print Dumper $info->meta($fn); print $info->content($fn); # <html><body>... print $info->meta->{"meta:language"}; # en
But all of this is in vain if the Tika executable won't start.
Update: I've now published the Git repository of the module, which contains some fixes I should also release soonish.
|
|---|