If your goal is to get a text-only output, consider using lynx to do a text-only page dump or a command-line tool like any of the various tools named 'html2txt'. The work's been done before, and unless you're turning something in as your own work, let other people's work be a starting point.
A look using your favorite search engine for the term 'html2txt' will find many versions of tools named that way by many authors for many platforms in a handful of languages. There are even a few in Perl, although I'd advise against plagiarism if you're doing homework.
In reply to Re: html to text file
by mr_mischief
in thread subroutine modules
by roc
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |