This isn't specifically about Perl but since I'm sure some of you have worked with this, maybe you could help clear some things up.
I don't understand the logic behind site searches on how they work. When you type in a few words to search for, somehow the script will pull apart some pages and give you some results, right? How does it do that in such a timely manner?
My impression is for the search to work, it has to open each file (or node) and rip apart it's context THEN display the results. But if this were the case, how could it search all the nodes in a matter of seconds? Could it really open and read that many pages at one time?
It's really confusing to think of how Yahoo! pulls this off. They have millions of pages to search for but the results are still brought up in a matter of seconds. Can someone explain how searches are run?
Thank you, Wise Monks!
"Age is nothing more than an inaccurate number bestowed upon us at birth as just another means for others to judge and classify us"
sulfericacid
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.