in reply to Re: HTML::Parser question
in thread HTML::Parser question

i tried your code as well bart, but it still combines some words, and now has & and the &nbsp between them. I dont know this is because the nonbreaking space are not tags so they arent removed, but how would i remove them and the &. Yahoo's clock code also shows up in the parser, ill see if its within tags as well.

Thanks for your help.

Replies are listed 'Best First'.
Re: Re: HTML::Parser question
by Juerd (Abbot) on Mar 07, 2004 at 23:18 UTC

    You really need to learn Perl before using it. If you use HTML::Parser, first learn how HTML works and then how HTML::Parser works (all after learning Perl).

    Quoth Scott Walters:

    Perl programming requires three skills:

    • Knowledge of the syntax and features of the core language. The Beginning Perl thing is the path to that.
    • CPAN. Anything more complex than a few lines of Perl, go running to http://search.cpan.org/ and search. There is probably a module to do what you want to do.
    • Critical thinking. You're on your own there.

    No language can get rid of the need for critical thinking, though many languages downplay the importance of it, or even scoff at it.

    You have the CPAN thing figured out, but lack critical thinking and knowledge of the language.

    Juerd # { site => 'juerd.nl', plp_site => 'plp.juerd.nl', do_not_use => 'spamtrap' }

      Im working on it Juerd, ive got my learn perl in 24 hours (not possible) book and have been trying my best since Feb 19th. Any way, i thought perhaps you would have an answer to my parser question, I think i might go with HTML-strip instead as it parses better, but not the whole site, maybe you know why, as of now i havent got any replies about that.

      thanks