Hi Anonymous Monk,
Please, if I can make an assumption that since you used the module HTML::FormatText you intended to get your output in plain text not have the whole html page with all the tags in as text.
If this is what you want, then you can do like so:
use warnings;
use strict;
use HTML::TreeBuilder 5 -weak;
use HTML::FormatText;
my $tree = HTML::TreeBuilder->new_from_url("http://www.google.com");
my $format = HTML::FormatText->new(leftmargin=>3, rightmargin=>50);
print $format->format($tree);
Output:
Search Images Maps Play YouTube
News Gmail Drive More »Web
History | Settings | Sign in
Nigeria
Advanced searchLanguage tools
Google.com.ng offered in: Hausa
Igbo Yorùbá Pidgin
Advertising ProgramsBusiness
SolutionsAbout GoogleGoogle.com
© 2013 - Privacy & Terms
NOTE:
-
Of course, you might need the module LWP::UserAgent, to get your html file, if you don't have html file stored.
- Please, note the usage of the module HTML::TreeBuilder, if you have your html file, you might use a different method.
- However, if you are using a linux OS, you can also see lynx like so: lynx -dump http://www.google.com
I hope this helps.
graff was right on the question he asked about the clarity of what you wanted done.
If you tell me, I'll forget.
If you show me, I'll remember.
if you involve me, I'll understand.
--- Author
unknown to me
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.