Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

hi guys, i am totaly new to perl, i wonder how can i make a search engine that will crawl through html files for hidden tags like "". i know this is easy for you guys, but i dont know where to start. thanks in advance.
  • Comment on searching for hidden tags into html files

Replies are listed 'Best First'.
RE: searching for hidden tags into html files
by t0mas (Priest) on Aug 14, 2000 at 12:22 UTC
    Sure you can, this is Perl you're talking about :-)

    To give you a more specific answer we would need some more info, like:
    Is the files on a local server or out there on the web?
    Do the format of the hidden tags always have a known format?
    Can they be multi-line?

    Some links to get you started:
    HTML Crawler
    How do I remove HTML from a string?
    matching comments


    /brother t0mas
Re: searching for hidden tags into html files
by ColtsFoot (Chaplain) on Aug 14, 2000 at 12:31 UTC
    For the crawling part do read the link "HTML Crawler" above

    Once you've identified the HTML you wish to parse think about
    using HTML::Parser you can over-ride the method

    $self->comment($comment)

    to perform the required action when your hidden fields are encountered.

    Hope this helps