in reply to Re: parsing table .doc
in thread parsing table .doc

"That said, Word docs are XML these days"

Docx (OOXML) files are compressed archives, containing XML among other things. Doc files are a proprietary binary format.