r1n0 has asked for the wisdom of the Perl Monks concerning the following question:

Hello once again monks,

I am playing around with HTML::LinkExtor. It works beautifully, and I appreciate the good work by the author. I have one question though, and that is whether or not HTML::Linkextor has any problems interpreting a webpage such as the following:

<META HTTP-EQUIV = "Refresh" CONTENT="3; URL= http://www.some.org/some.html">


As you can see the page has added space and newlines thrown in to mess things up. Shouldn't HTML::LinkExtor handle this just fine?

Thank you in advance for your help.

Replies are listed 'Best First'.
Re: HTML::LinkExtor and meta http-equiv
by ikegami (Patriarch) on Mar 19, 2010 at 21:02 UTC

    LinkExtor doesn't appear to look at meta tags at all. Your issue has nothing to do with spaces or newlines.

    That said, I think newlines aren't allowed in HTTP headers, so that's not a valid value. I also question whether spaces are allowed around the "=" of HTML attributes.

      Thank you for the response. Is there a module that processes the meta tags? If not, when I come up with a working solution, I will post it for others.

      Thanks!
        You could add support to LinkExtor for this tag. It should be rather easy.