The issue with this approach is where the HTML document may include example HTML, including <body></body> tags, within <pre></pre> tags. The regular expression which BrowserUK has provided appears to be somewhat more robust, although I suspect that I will follow his suggestion to try reading the file backwards for the first </body> tag.