A few things:
- The /s flag indicates that the dot is presumed to include newlines.
- The /g flag indicates that this pattern is going to return a list of all matches.
So, in effect, you will only have a single $1 at the end, but you wouldn't use those. Instead:
my @grabbed = $page =~ m#<b>(.*?)</b>#sig;
A few changes. First, dot-star will grab as much as it can, being
greedy. This isn't good since you'll get everything from the first
<B> tag to the last close, or in other words, one big match instead of smaller ones. The question mark causes the regular expression to find the shortest match instead.
The
/i modifier also catches tags that are capitalized. Another trick is to use the hash mark instead of slash, so that you don't have to escape your slashes.
If you're feeling more adventuresome, you might want to check out
HTML::Parser.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.