comment on

Do not use regular expressions to parse HTML/XML. Assuming your input is indeed HTML, here's a possible solution using Mojo::DOM, based on my code here.

use warnings;
use strict;

print html_filter(<<'END_HTML', qw/pre strong i/), "\n";
aaa<pre>bbb</pre>ccc<www><i>ddd</i><strong>eee</strong>fff</www>ggg
END_HTML

use Mojo::DOM;
sub html_filter {
    my $html = shift;
    my %allowed = map {$_=>1} @_;
    my $walk; $walk = sub {
        my ($in, $out) = @_;
        for my $n ( @{ $in->child_nodes } ) {
            if ( $n->type eq 'cdata' || $n->type eq 'text' )
                { $out->append_content($n->content) }
            elsif ( $n->type eq 'tag' ) {
                if ($allowed{$n->tag}) {
                    my $t = $out->new_tag( $n->tag, %{$n->attr} )
                        ->child_nodes->first;
                    $walk->($n, $t);
                    $out->append_content($t);
                } else { $walk->($n, $out) }
            } # ignore other node types for now
        }
        return $out;
    };
    return $walk->(Mojo::DOM->new($html), Mojo::DOM->new)->to_string;
}

__END__

aaa<pre>bbb</pre>ccc<i>ddd</i><strong>eee</strong>fffggg
[download]

In reply to Re: perlre inverse check for several patterns by haukex
in thread perlre inverse check for several patterns by averlon

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.