in reply to Code for 'Save html page' that contains dynamic content?

Do you actually want the formatted pages or do you want to extract data from them? If you want to extract data from them, check out:

This is the JSON data which the Javascript on those pages is fetching and displaying.

You can use JSON::PP to parse the JSON and do what you want with the data.

  • Comment on Re: Code for 'Save html page' that contains dynamic content?

Replies are listed 'Best First'.
Re^2: Code for 'Save html page' that contains dynamic content?
by bliako (Abbot) on Jun 28, 2023 at 22:25 UTC

    yes, the API is at https://opm.phar.umich.edu/download. If the OP is serious about this they should start from there and build the crawler. That said, BioPerl was already mentioned and AFAIR R has packages for downloading some types of bio-data (not sure what).

Re^2: Code for 'Save html page' that contains dynamic content?
by Anonymous Monk on Jun 28, 2023 at 07:27 UTC
    Thanks! I had not seen that. But this was just an example, because I have other ones as well, like:
    http://pdbtm.enzim.hu/?_=/pdbtm/1a0t

    where I want to check the coloured letters.
    In any case, if you have any suggestions (or code) that can be used for such tasks, it would be great :)