vit has asked for the wisdom of the Perl Monks concerning the following question:

This node falls below the community's threshold of quality. You may see it by logging in.

Replies are listed 'Best First'.
Re: crawling one website
by marto (Cardinal) on May 27, 2011 at 16:47 UTC

    See the answers to your previous question. If you have further questions let us know.

Re: crawling one website
by planetscape (Chancellor) on May 27, 2011 at 19:11 UTC
      Thanks!
      I tried hgrepurl.pl with and without parameters but it does not print anything.
      Could you give me a usage example without proxy.
        Actually I got:
        perl -W hgrepurl.pl http://www.senopt.com Subroutine Cwd::fastcwd redefined at c:/ActivePerl/site/lib/Cwd.pm lin +e 812. Subroutine Cwd::getcwd redefined at c:/ActivePerl/site/lib/Cwd.pm line + 812. Subroutine Cwd::abs_path redefined at c:/ActivePerl/site/lib/Cwd.pm li +ne 812 main::get_html() called too early to check prototype at hgrepurl.pl li +ne 27.
        Is it serious? What should I do?
        I also checked:
        perl -c hgrepurl.pl http://www.senopt.com hgrepurl.pl syntax OK