in reply to advice for a project

I'm not sure exactly what form your data will become, I guess you may want to use it to fill in templates.

Perhaps this may be of interest Re^3: Extracting structured data from unstructured text - just how difficult would this be?