Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??
If you don't have $80,000 to buy Google's solution then the chances are very good you don't have the money to develop it either. So the question becomes, can you live with what you can build?

First off, you want to be able to "translate some pages". That's a pretty tall order. Are you at least planning to buy this piece or are you going to build this too?

Second, you need to get a clearer idea of how good the search results need to be. Google has the best algorithms in the business and they aren't publishing them! People have a pretty good idea how they do it but replicating it will take a lot of hard work and more than a few braniacs in the barn. I recommend you look at Glimpse to see what some really smart people have been able to do with quite a lot of time. Google it ain't, but it's not bad either. Maybe you can use it as a backend component the way I did.

Unfortunately the search engine I built went straight to /dev/null along with the company that paid for it. They never sold a single copy to my knowledge. Wasn't the 90s fun?

-sam


In reply to Re: Re: Re: Perl Search Applicance by samtregar
in thread Perl Search Applicance by PyroX

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others romping around the Monastery: (4)
As of 2024-03-28 14:38 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found