in reply to Combining Ffile:fetch with MySQL
Could you take a couple of example documents and show what the full URL is to these documents for say June 2022 and July 2022?.
Are you saving the .pdf doc as a "BLOB" inside the SQLite DB or are you just saving a directory path on your local machine?
I have a similar web application. My app runs once per hour, looks around on part of a particular website for any new links. If it finds one, it "clicks" on it, to see there is anything there is "interesting or not". If so, the interesting data is saved in an SQLite DB. In any event, I save the URL in the DB so that I don't go there again. If nothing changed on the website, it figures that out very efficiently. This thing has been running every hour for the past 6 years, so it is possible for apps like this to work out very well. I use WWW::Mechanize but you don't seem to need the sophistication of making sense of a webpage? Or do you?
Update:
Could you show your table schema?
I would be thinking along the lines of:
URL text (where the data came from)
Version Datetime (this is actually just text (not numeric) yyyy-mm-dd hh:mm:ss)
- you can omit time and I think also the dd if not available
- leading zeroes are mandatory because this column must be in
- ASCII sort order
Downloaded Datetime (optional but often handy to know)
title text (Name of the document)
pdf blob (actual pdf file)
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Combining Ffile:fetch with MySQL
by justin423 (Scribe) on Jul 18, 2022 at 05:57 UTC | |
by Marshall (Canon) on Jul 18, 2022 at 18:28 UTC |