Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re: Searching through a document and reporting results.

by mvaline (Friar)
on Jan 30, 2011 at 05:40 UTC ( [id://885099]=note: print w/replies, xml ) Need Help??


in reply to Searching through a document and reporting results.

The approach that occurs to me would be to put your keyword tests into a subroutine and use the each function to test the sentences in a way that makes the index / sentence number easily accessible.
while (($key, $value) = each @sentence) { if (has_one_or_more_keywords($value)) print "$key: $value\n"; }
I second the suggestion to consider an HTML parsing module. You may also want to consider replacing your sentence and word splits with a more sophisticated grammar for parsing sentences and words using a module like Parse::RecDescent. For example, the period character is not a sentence terminator when used in an ellipsis, as a decimal point in a number, etc.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://885099]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others exploiting the Monastery: (5)
As of 2024-04-19 04:57 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found