![]() |
|
Clear questions and runnable code get the best and fastest answer |
|
PerlMonks |
Re: Extracting paragraphs from htmlby merlyn (Sage) |
on Sep 11, 2005 at 16:50 UTC ( #491072=note: print w/replies, xml ) | Need Help?? |
Use XML::LibXML in HTML-parsing mode, then use an XPath that looks for text() nodes that have a length greater than N.
-- Randal L. Schwartz, Perl hacker
update: See Locate large HTML paragraphs with XML::LibXML.
In Section
Seekers of Perl Wisdom
|
|