in reply to Re: A copyeditor needs help to get started with a Perl project
in thread A copyeditor needs help to get started with a Perl project

Working with the HTML version of the Word file is an interesting idea. By the way, I just need to read the file; writing will be to a new report file. The original Word file will remain untouched. I will enter the corrections manually. The "automation" in my original post only refers to the process of identifying inconsistencies in the document.

But can Perl recognize document elements such as headers, fonts, superscripts, tables, etc., in the HTML file? Let me give you a sample real-life scenario. Let us say a chapter has 100 numbered reference items at the end of the chapter, which are cited in text by superscripted integers. The problem: generate a report that will identify the reference items that have not been cited in the text.

To accomplish this, Perl would have to recognize superscripted elements in the HTML file. Pardon my ignorance, but can it?

  • Comment on Re^2: A copyeditor needs help to get started with a Perl project

Replies are listed 'Best First'.
Re^3: A copyeditor needs help to get started with a Perl project
by gaal (Parson) on Nov 05, 2004 at 09:44 UTC
    Yes, it can.

    I don't have sample data at hand right now, so I can't give you the exact example, but if you inserted the footnote (or endnote) with the standard Insert Footnote menu item, the integer is well tagged with something like "<span class="footnoteReference">....</span>" tags. If you're on a Windows machine, just save a demo document as HTML and look at the result with a text editor, it should be clear enough. Don't be intimidated by the several KBs of CSS in the beginning :)