in reply to Extracting span and meta content with HTML::TreeBuilder
poj#!perl use strict; use HTML::TreeBuilder 5 -weak; my $tree = HTML::TreeBuilder->new; $tree->parse_file(\*DATA); my @items = $tree->look_down( '_tag', 'meta' ) or die("no items: $!\n"); for my $item (@items) { print $item->attr('itemprop'); print ' = '; print $item->attr('content')."\n"; } __DATA__ <div class="review-content"> <div class="biz-rating biz-rating-very-large clearfix"> <div itemtype="http://schema.org/Rating" itemscope="" itemprop="re +viewRating"> <div class="rating-very-large"> <i title="4.0 star rating" class="star-img stars_4"> <img width="84" height="303" src="http://blah/v2/stars_map +.png" class="offscreen" alt="4.0 star rating"> </i> <meta content="4.0" itemprop="ratingValue"> </div> </div> <span class="rating-qualifier"> <meta content="2011-01-13" itemprop="datePublished"> 1/13/2011 </span> </div> <p lang="en" itemprop="description" class="review_comment ieSucks"> blah!! </p> </div>
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Extracting span and meta content with HTML::TreeBuilder
by wrinkles (Pilgrim) on Jul 16, 2014 at 21:32 UTC | |
by poj (Abbot) on Jul 16, 2014 at 21:35 UTC | |
by wrinkles (Pilgrim) on Jul 16, 2014 at 22:17 UTC | |
by tangent (Parson) on Jul 17, 2014 at 01:54 UTC | |
by poj (Abbot) on Jul 17, 2014 at 12:20 UTC | |
by wrinkles (Pilgrim) on Jul 18, 2014 at 01:41 UTC | |
|