in reply to Sanitizing HTML

You want to do the following:
  1. Parse a complex data structure
  2. Strip out the vast majority of it
  3. Do this with a minimum of resource utilization

In other words, you sound like my VP when we bring him options. "I want option A's features, option B's schedule, and option C's cost using the development staff needed for option D." This isn't an alacarte menu. If you want to tow a boat, you're not going to use a Pinto.

In other words, use the right tool for the job. Worry about optimization later, if ever.

Being right, does not endow the right to be rude; politeness costs nothing.
Being unknowing, is not the same as being stupid.
Expressing a contrary opinion, whether to the individual or the group, is more often a sign of deeper thought than of cantankerous belligerence.
Do not mistake your goals as the only goals; your opinion as the only opinion; your confidence as correctness. Saying you know better is not the same as explaining you know better.

I shouldn't have to say this, but any code, unless otherwise stated, is untested

Replies are listed 'Best First'.
Re^2: Sanitizing HTML
by JediWizard (Deacon) on Sep 29, 2004 at 13:47 UTC

    Don't you just love managment? You're description of your VP just reminds me of this.

    May the Force be with you
Re^2: Sanitizing HTML
by skx (Parson) on Sep 29, 2004 at 13:37 UTC

    It's not so much the efficiency I care about as the need to use two modules outside the Debian package repository as apposed to one that I might not have seen.

    If this is the route I have to go, fine, if there is something more self-contained and easy to use then I'd appreciate knowing about it.

    Steve
    ---
    steve.org.uk