Hai i am using SWISH::Filter module for converting pdf to html
while running a file the following error is comming so please help me to solve this problem
the error is:
3075 Warning - ela: Unknown filter config setting 'user_data' Error: May not be a PDF file (continuing anyway) Error: PDF file is damaged - attempting to reconstruct xref table... Error: Couldn't find trailer dictionary Error: Couldn't read xref table Error: May not be a PDF file (continuing anyway) Error: PDF file is damaged - attempting to reconstruct xref table... Error: Couldn't find trailer dictionary Error: Couldn't read xref table <html> <head> </head> <body> <pre> </pre> </body> </html>
the code is:
use SWISH::Filter; # load available filters into memory my $filter = SWISH::Filter->new; my $real_path="/vos/spain/test.pdf"; # convert a document my $doc = $filter->convert( document => \$real_path, # path or ref to a doc content_type => 'application/pdf', # content type if doc refer +ence name => 'ela', # optional name for this file (usefu +l for debugging) user_data => $whatever, # optional data to make availab +le to filters ); return unless $doc; # empty doc, zero size, or no filters installed # Was the document converted by a filter? my $was_filtered = $doc->was_filtered; # Skip if the file is not text return if $doc->is_binary; # Print out the doc my $doc_ref = $doc->fetch_doc; print $$doc_ref; # Fetch the final content type of the document my $content_type = $doc->content_type; # Fetch Swish-e parser type (TXT*, XML*, HTML*, or undefined) my $doc_type = $doc->swish_parser_type;
In reply to How to convert a pdf file in to HTML by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |