in reply to I want to save web pages as text rather than as HTML.
Modern Mozilla browsers do not save the page source anymore; they serialize the DOM tree instead. If the information you seek is not in the page source, but does appear when saved, then it is being added to the page using JavaScript. You will need to use the Web Developer tools (Network tab) in Firefox to find the request that loads that data and figure out how to replicate that request and parse the response (probably JSON) in your Perl code.
Finding the request you need to make is the hard part. Making the request with LWP::UserAgent and parsing the response with JSON should be easy.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: I want to save web pages as text rather than as HTML.
by anautismobserver (Sexton) on Sep 12, 2019 at 02:33 UTC | |
by marto (Cardinal) on Sep 12, 2019 at 06:41 UTC |