in reply to file substitution
I would use the Unix command, 'comm'.
-- TTTATCGGTCGTTATATAGATGTTTGCA
How?
comm requires its inputs to be sorted. I doubt that the OP would want his html files sorted lexically.
comm works on whole lines not substrings embedded within bigger ones.