Yes, Perl is an excellent choice for this problem. You will need to look into one of the many HTML parsers available on CPAN, such as HTML::Parser or my personal fav, HTML::TokeParser.

Once you find the right 'keys' to extract the target info, i assume you will want to store them somewhere, such as a database. You will need to look into the DBI modules on CPAN.

If you are planning on using Perl to _convert_ each entire page into .asp - well, that's going to be a bit tougher. You are going to need some man hours any way you slice it. If the pages have a lot of commonality then the task will be easier, but you still need to plan this one out. My experience in the past with porting scripts and such is usually to issue a ton of clever Perl one-liner substitutions only to find that i still have to make some changes by hand.

Good luck, and keep it legal ;)

jeffa


In reply to (jeffa) Re: Find and replace by jeffa
in thread Find and replace by dvdauthority

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.