in reply to (jeffa) 3Re: Problems splitting HTML in to hash table
in thread Problems splitting HTML in to hash table
Firstly, the reason I am looping like that is I'm reading a file in to an array, indexing the count until I find my target text, then I know the index from which I need to count to find what I'm after. There may be a more efficient way to do it, but for now I want it to work. : )
As far as my problem I've at least found it. For some reason the author of this particular page had put all their news headlines, links and text, on one long line. Now that I know that I *think* I can take it from there.
And you didn't sound grumpy at all. For now, I think I'm going to steer clear of the modules to practice my regexes as I'm still a bit rusty on some of the finer points of that. However once this thing is running, I will definitely look at the module aspect to see if I can shorten the code.
At present I've got 79 newspaper websites that I want to look at, but I've managed to pare it down to 19 rules so that isn't too bad I don't think.
Oh, btw, is the ESP::PSI module in the ACME section of CPAN? I think I could really use that for some *serious* debugging... ; )
Some people fall from grace. I prefer a running start...
|
|---|