Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl-Sensitive Sunglasses
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??

I was recently looking at the volunteer page at the Tor Project, and they have a piece of Perl, soat.pl, that needs a revamp. What it does is go to various web pages both via the open Internet and via a proxy to make sure that the proxy is not altering the pages on the way through.

There are things you could improve without changing the functionality. These include simple things like open without the requisite or die after it. It spawns a shell to md5sum instead of using Digest::MD5, wget instead of LWP::UserAgent, etc.

Functionally it could be improved by using HTML::Parser to look over the web pages and focus on parts that aren't expected to change (i.e., because of normal dynamic content or GeoIP differentiation). It could do with a config file or the judicious application of Getopt::Long (and the corresponding Pod::Usage).

This looks like low hanging fruit to me, but I might not be able to get the time to put into it. It would be a win for Perl evangelism because it might counter the idea that Perl "sucks at life", as the volunteer page suggests.


In reply to Re: Perl Project? by kyle
in thread Perl Project? by why_bird

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others perusing the Monastery: (6)
As of 2024-04-19 10:17 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found