in reply to Re^4: Super search use DuckDuckGo link broken
in thread Super search use DuckDuckGo link broken

I don't have this problem with org domains:

https://www.google.com/search?q=warnings+site%3Aperlmonks.org

Probably depends on robots.txt settings and subsequent indexing.

Update

On a side note, my mobile Chrome can't use the site anymore when logged out.

So the choice of domain in the site-search has direct consequences.

Cheers Rolf
(addicted to the Perl Programming Language :)
see Wikisyntax for the Monastery

  • Comment on Re^5: Super search use DuckDuckGo link broken

Replies are listed 'Best First'.
Re^6: Super search use DuckDuckGo link broken
by ikegami (Patriarch) on May 04, 2025 at 01:39 UTC

    IIRC, something was added to specify that perlmonks.org or www.perlmonks.org is the official domain, so no surprise you get better result from that one.

        According to the robots.txt specifications I found at Google, it's possible to exclude "orthogonal" pages like &displaytype=print or ;displaytype=edithistory with wildcards.

        Any reason not to add

        • Disallow: /*displaytype=

        to the list? (Untested)

        Bing also suggests adding noindex-meta tags to the pages.

        On a tangent

        Ideally robots would be presented with a page without nodelets, but I'm not aware of an efficient solution, except checking the user-agent before building the page.

        Cheers Rolf
        (addicted to the Perl Programming Language :)
        see Wikisyntax for the Monastery