NodeReaper has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: HTML::Parser question
by matija (Priest) on Mar 07, 2004 at 19:00 UTC
    It's hard to know exactly what's happening without seeing your code. I see two possibilities:
    • First, check the unbroken_text setting.
    • Perhaps yahoo is putting in <br> instead of newlines (yuck!). If you're ignoring HTML codes, you wouldn't see them. Hmmm. I just checked: they're not. And there seems to be whitespace between their <p>, which I think you should be getting in your text handler routine.
    What does your text capture subroutine look like? Are you handling any HTML tags, or just the text?