It's hard to know exactly what's happening without seeing your code. I see two possibilities:
First, check the unbroken_text setting.
Perhaps yahoo is putting in <br> instead of newlines (yuck!). If you're ignoring HTML codes, you wouldn't see them. Hmmm. I just checked: they're not. And there seems to be whitespace between their <p>, which I think you should be getting in your text handler routine.
What does your text capture subroutine look like? Are you handling any HTML tags, or just the text?
(This answer moved from the comment on the node about to be deleted because it was a duplicate)