I have a database in which each node is a hyperlink. I want to identify which links are to .jpg files and extract as much info as possible from the header of the files.
Has anyone done something like this before? What libraries should I look at to get started?