in reply to Re: Indexing of Word documents
in thread Indexing of Word documents

Thanks for all suggestions. Here are some more details. This script is to create an index for a book. The word files will only reside on one computer, and so, the issues with changing computers and different printers goes away. Using paragraphs does not work since any paragraph could be on 2 pages at once and then a page number associated with a name would be wrong. I am not forced to do this with Word. So changing to pdf could be an option since an index for a book can be provided with a pdf file. I was asking about using pdf, but could not determine if page numbers are associated with the text. For instance, if I search for bob jones in the pdf file, is there meta-data that tells what page number that name appears?

Replies are listed 'Best First'.
Re^3: Indexing of Word documents
by axiomcrs (Initiate) on Jun 10, 2013 at 19:23 UTC
    hey flexvault, does the pdftohtml program give page numbers as a metadata for the text?