I was figuring on being able to report on how many times a particular URL came up. I just changed my code to deal with 1 hash, like this:
%all_links = ( URL_STRING = ( occurances, visit, title, content, traversed,) );
Code-wise, I will check if the URL needs to be traversed via a positive value is in the VISIT key (VISIT eq to the value returned by INDEX function) if the URL_STRING contains constraining factors.
I would then need to traverse those pages that contain a positive VISIT value and have a undefined or (if I preset the value to -1) a negative TRAVERSED value.
Note: Keying off the TITLE and/or CONTENT values might cause a inifinate loop if a page has no title or body content (why a page would have no title or body is beyond me but it's a possible issue).
I know I left some of these specifics out of the original post so I humblily bend over and shout: "Thank you sir, may I have another!" ;-}
PS: Thinking ahead: can I increment the value of a key like this:
$all_links{URL_STRING}{occurances}++;
...or do I have to do this:
$occurances = $all_links{URL_STRING}{occurances} + 1;
$all_links{URL_STRING}{occurances} = $occurances;
TIA (again) ;)
======================
Sean Shrum
http://www.shrum.net |