I found a simple solution. I simply wrote a script to drive convert which is part of the ImageMagick package. It has the capability of pulling all of the images from a pdf and storing them in any format that I want. Now it's on to the OCR step. I think I'll do that one in the Java because of how easy it is to draw on the screen.