in reply to Re: parsing table .docin thread parsing table .doc
"That said, Word docs are XML these days"
Docx (OOXML) files are compressed archives, containing XML among other things. Doc files are a proprietary binary format.