Beefy Boxes and Bandwidth Generously Provided by pair Networks
go ahead... be a heretic
 
PerlMonks  

Re^3: grepping CPAN?

by marto (Cardinal)
on Oct 01, 2021 at 13:30 UTC ( #11137164=note: print w/replies, xml ) Need Help??


in reply to Re^2: grepping CPAN?
in thread grepping CPAN?

You really have to filter this now, there's just so much bloat, it's a far cry from fitting it onto a CDROM :)

Replies are listed 'Best First'.
Re^4: grepping CPAN?
by LanX (Sage) on Oct 01, 2021 at 13:35 UTC
    wasn't there recently a call to authors to delete old versions of their modules, or do I misremember ...

    Cheers Rolf
    (addicted to the Perl Programming Language :)
    Wikisyntax for the Monastery

      You remember correctly. However, some authors are unreachable and/or apathetic and the number of dists constantly increases as well.


      🦛

      That's not the issue, minicpan, unless you're doing something weird, should only pull back the latest releases required to build distributions. Over the years people have uploaded many modules, and some very large in the App space (including vast bundles of other software). Unless you configure it to ignore bloat then you won't avoid this, and even then I've come across legitimate modules that have a dependency on ACME modules (for 'test' data).

        How do you define bloat in a filterable way?

        (Did I miss a bloat flag in the meta files? ;)

        FWIW: For the purpose of this thread downloading only pure text like Perl code should be fine. (or just excluding any binary)

        Cheers Rolf
        (addicted to the Perl Programming Language :)
        Wikisyntax for the Monastery

        > Unless you configure it to ignore bloat then you won't avoid this

        For the aim of parsing all Perl&POD source locally I'd need to pull all text and ignore binaries and other "bloat" (to be defined) to save disc space.

        But this won't be faster in net-load, since AFAIK does filtering happen after downloading the full dist's tgz.

        Cheers Rolf
        (addicted to the Perl Programming Language :)
        Wikisyntax for the Monastery

        ) well probably avoiding extracting certain files from the tgz might speed up things a little tho.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11137164]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (6)
As of 2022-01-22 17:58 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    In 2022, my preferred method to securely store passwords is:












    Results (63 votes). Check out past polls.

    Notices?