in reply to reg exp question
Customarily you'ld want to use one of the modules designed to parse .html, but if your data is exactly as stated (are you sure?) split may be a workable approach.