in reply to Re: perl regex or module that identifies bots/crawlers
in thread perl regex or module that identifies bots/crawlers
Since I installed the simplistic testing, I've never had my load go over 1.0, even with my site's usual traffic of over 25,000 unique visitors a day. I have anywhere between 20-50 users doing searches at any given moment, according to my runtime logs.
The remnant bots that I don't check are not hurting, per se, but they are polluting my stats on what people search for. (I really want better data on what people are coming to my site FOR. The crawlers seem to be doing searches on random words.)
I also want to provide more options to searches, but because those would spin even more cpu cycles, I'd rather wait till I can really block out the cruft of these remnant bots.
I'm not concerned about blocking bots that are trying to mask themselves as normal users yet--they haven't presented themselves to be too much of a problem. I can sense illicit activity by monitoring when searches are done in a short timeframe (like within a second of the last one). That's a sure sign of a non-human, but I'd rather nip the problem in the bud if I can.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^3: perl regex or module that identifies bots/crawlers
by Sartan (Pilgrim) on Mar 20, 2007 at 23:40 UTC | |
Re^3: perl regex or module that identifies bots/crawlers
by CountZero (Bishop) on Mar 20, 2007 at 22:58 UTC | |
Re^3: perl regex or module that identifies bots/crawlers
by UnderMine (Friar) on Mar 23, 2007 at 10:24 UTC |