Special_K has asked for the wisdom of the Perl Monks concerning the following question:
I am trying to write a script to scrape a webpage but am not a web developer and am having trouble figuring out which module I need to install. Using Firefox's Web Developer Inspector tool, I see that an object I have highlighted has a hierarchical position in the webpage's html and sequence of
<div class="grouped-item product-purchase-wrapper-7117">
There are multiple such objects on the webpage, each with their own unique number in place of "7117", but with all other text in the div identical to the example above (and at the same level of hierarchy in the webpage). What I would like, if it exists, is a module that will read the webpage's hierarchy into a data structure, allow me to specify a base path to a specific point in the webpage's hierarchy, and then allow me to iterate over all objects/div sections at that level of hierarchy and below. Which module should I use?
|
|---|