I don't understand exactly what you're looking for.

Domain access count ranking data, processed by the attached script, in exactly the format specified :-)

Is it just a list of the most common sites UK residents are visiting?

Effectively yes, but to a depth that is not available commercially ie accurate down past 100,000 which requires a userbase into the 10s of thousands.

How would that help a filtration program?

In and of itself it does not. What is does do is provide an ordered list of important domains where important means people really go there and their browsers actually request pages from those domains. We have a dozen guys/gals that do little else but poke these domains into categories - currently we have a million domains classified into the 100 cats the filter uses. From the filtration point of view knowledge of frequently visited sites has an impact on the performance of the product. Having a classification for http://please.someone.visit.my.site.com/ where no one ever goes is a waste of review effort and makes no difference to the end user experience.

Why would an ISP turn over such data?

In the past it has helped to be good friends with the sysadmins ;-) The crisp folding stuff has also been known to tip the balance.

It is rather broad and I can't see any real privacy concerns with it, but that won't be everyone's impression.

The data is distilled to a form that is totally anonymous in that there is no user information or even specific page info retained. The lawyers say this does not breach the Data Protection Act. Funny how you have to pay a small fortune to get to the same conclusion you came to in a few seconds.....but that's lawyers for you.

cheers

tachyon


In reply to Re: Re: OT: Looking for friendly UK ISP Sysadmins by tachyon
in thread OT: Looking for friendly UK ISP Sysadmins by tachyon

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.