Tag-Stripper is Insecure

Replies are listed 'Best First'.

(jeffa) Re: Tag-Stripper is Insecure
by jeffa (Bishop) on Feb 01, 2002 at 15:40 UTC

$foo =~ s/</&lt;/g;
[download]

jeffa

L-LL-L--L-LL-L--L-LL-L--
-R--R-RR-R--R-RR-R--R-RR
B--B--B--B--B--B--B--B--
H---H---H---H---H---H---
(the triplet paradiddle with high-hat)

[reply]
[d/l]

Re: (jeffa) Re: Tag-Stripper is Insecure

by japhy (Canon) on Feb 01, 2002 at 16:47 UTC

s/>/>/g

_____________________________________________________
Jeff[japhy]Pinyan: Perl, regex, and perl hacker.
s++=END;++y(;-P)}y js++=;shajsj<++y(p-q)}?print:??;

[reply]

(tye)Re: Tag-Stripper is Insecure

by tye (Sage) on Feb 01, 2002 at 16:41 UTC

jeffa++, I think this is a good idea for lots of reasons beyond fixing this particular problem.

tye

[reply]

Re: (jeffa) Re: Tag-Stripper is Insecure

by Matts (Deacon) on Feb 01, 2002 at 23:38 UTC

CERT's XSS

Luckily it's hard to find a browser vulnerable to this any more. But it's still something to watch for when trying to catch XSS vulnerabilities (the important thing here is to send the character set (encoding) along with the content-type header).

Also it's naive at best to suggest just allowing text. Most systems want to accept some form of HTML. The thing to do is make sure you do allowed tags, not disallowed tags. And never allow attributes. That's just asking for trouble.

[reply]

Re: (jeffa) Re: Tag-Stripper is Insecure

by mpeppler (Vicar) on Feb 01, 2002 at 19:22 UTC

Michael

[reply]

Re (tilly) 1: Tag-Stripper is Insecure
by tilly (Archbishop) on Feb 01, 2002 at 18:16 UTC

Why I like functional programming

[reply]

Re: Tag-Stripper is Insecure
by gav^ (Curate) on Feb 01, 2002 at 18:27 UTC

A HTML::Parser based tag stripper (like the one I posted here) can handle this.

You just need to make sure you escape the '>' and '<' in the text handler (as that is what the nested tags will be treated as).

I haven't seen any non-HTML::Parser tag strippers that don't have one problem or another.

gav^

[reply]

Re: Re: Tag-Stripper is Insecure

by crazyinsomniac (Prior) on Feb 02, 2002 at 03:16 UTC

tilly

Why I like functional programming

______crazyinsomniac_____________________________
Of all the things I've lost, I miss my mind the most.
perl -e "$q=$_;map({chr unpack qq;H*;,$_}split(q;;,q*H*));print;$q/$q;"

[reply]

Re: Tag-Stripper is Insecure
by dws (Chancellor) on Feb 01, 2002 at 22:08 UTC

The chief problem is HTML written thus: <<ILLEGAL_TAG>ILLEGAL_TAG>...<</ILLEGAL_TAG>/ILLEGAL_TAG>

  emit($1), next if m/\G([^<>&]+/gc;
  emit($1), next if m/\G(&\w+;)/gc;
  emit("&lt;"), next if m/\G<(?!<)/gc;
  # handle potentially valid REs here
  emit("&lt;"), next if m/\G</gc;
  emit("&gt;"), next if m/\G>/gc;
[download]

<ILLEGAL_TAG>...</ILLEGAL_TAG>

[reply]
[d/l]
[select]

Re: Tag-Stripper is Insecure (boo)
by boo_radley (Parson) on Feb 01, 2002 at 17:14 UTC

1 while {code to strip tags here}

[reply]
[d/l]