How to pass output of "$token->as_is" to a variable in the following code to be able to strip out extra spaces before printing it on screen, also for doing other things with the output later as well.
I don't like to use extra modules to not make the code any bigger. so i prefer to use regex on the fly during the final stage.
the commented # are my retries so u can ignore it.
and I wonder why you guys output very small code font on this site, its very hard to read unless clicking on the download link which is not very comfortable to follow up on reading by keep clicking to display codes.
Thanks
use HTML::TokeParser::Simple; my $p = HTML::TokeParser::Simple->new(url => 'http://domain.com/?xxxxx +xx'); my $level; while (my $tag = $p->get_tag('div')) { my $class = $tag->get_attr('id'); next unless defined($class) and $class eq 'content'; $level += 1; while (my $token = $p->get_token) { $level += 1 if $token->is_start_tag('div'); $level -= 1 if $token->is_end_tag('div'); #$_ = s/<([\w-\:]+)>(.*?)<\/\1>/$2 /g; #print $_; next unless $token->is_text; #$cleaned = $token->as_is =~ s/\s{2,}/ /gs; # should remove ex +tra spaces #print $cleaned; print $token->as_is; unless ($level) { last; } } }
In reply to passing token output to a variable by SilverShadow
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |