Yes, I do not need any formatting, but just plain text, and Antiword (which I had never heard about before) seems to produce exactly what I need. The result is actually very clean (surprisingly clean).
Thank you very much, aitap, I think it is likely we will go for a solution using that.
Comment on Re^2: Extracting text from MS Word files on a Linux box