dws has asked for the wisdom of the Perl Monks concerning the following question:
I have a web log analysis script that threads together "sessions" and reports on them in blocks, so that I can get some idea of what paths individuals take when they visit one of my sites.
For my purposes, a session is a set of visits from a unique hostname/IP address, ending after a certain time has passed since the last visit. This works pretty well, except for people coming in through AOL proxies, where an actual session can be spread across different proxy servers. This blows the idea of using the hostname/IP address as a hash key for tracking ongoing sessions. I've tried a couple of techniques for threading these sessions together, including using the user-agent string to disambiguate simulataneous AOL sessions, but I'm not happy with the results.
Have you run into this problem and come up with a satisfactory approach? Or have you run across an article that deals with this problem? (I've checked merlyn's columns.) Thanks in advance for any insights/pointers you can provide.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Threading together "sessions" from browser logs
by Zaxo (Archbishop) on Sep 20, 2002 at 18:18 UTC | |
by perrin (Chancellor) on Sep 20, 2002 at 19:04 UTC | |
by dws (Chancellor) on Sep 20, 2002 at 19:14 UTC | |
|
•Re: Threading together "sessions" from browser logs
by merlyn (Sage) on Sep 20, 2002 at 18:46 UTC | |
|
Re: Threading together "sessions" from browser logs
by sauoq (Abbot) on Sep 21, 2002 at 01:14 UTC | |
|
Re: Threading together "sessions" from browser logs
by particle (Vicar) on Sep 20, 2002 at 18:21 UTC | |
by dws (Chancellor) on Sep 20, 2002 at 18:31 UTC | |
|
Re: Threading together "sessions" from browser logs
by blm (Hermit) on Sep 21, 2002 at 17:33 UTC | |
by dws (Chancellor) on Sep 21, 2002 at 17:58 UTC |