Well..well... I will be very curious what others will tell you.

With my limited knowledge, this how I would solve this problem: I would write a JScript in Windows that loads the website, enters the dates into the website, performs the search, presses CTRL+A to select all and then copy. Then save the data from clipboard. Fortunately, this comes in a format that can be easily pasted into Excel spreadsheet or a text file. So, all the data is in a format that can be processed easily. So, once the first page is saved, then I would load the next page, then save that one, load the next page, save, and so forth.

How much data do you need to download using this method? How many pages? Do you want to download the entire website?

This site contains millions of entries of data (73,902,123 records to be precise). Each one can be accessed individually from the website like this where you can modify the record number: https://desmace.com/tramite?id=71015976 That number at the end of the URL is the record number. You can change it to any number between 1 and 73,902,123, and it will show you one record. The problem is, of course, the individual records are not sorted by date. So, you can't just download all the records between X and Y. They seem to be just random nodes. And if you tried to download each piece of data, it might take about 100 years to download each individual page. But if you could do that, then you would have a copy of the entire database on your device, which you could format and search and filter in way imaginable using nothing but Perl.

I am not sure what your goal is, so I don't know what to tell you.

What I did find was this website is available for Android and Windows as an app. ...which means it probably has a connection point where you could just tell the site to give you a short list of raw data. There is also an English version of this website, which helped me understand what this whole thing is about. https://stolen.desmace.com/list-stolen-cars-spain/#search-form (I don't understand any Spanish.)


In reply to Re: Script to scrap data by harangzsolt33
in thread Script to scrap data by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.