Slashdot Mirror


The Google Search Server

An anonymous reader submitted a reasonably indepth review of the Google search appliance. The guys from anandtech put it through it's paces, and included a variety of pictures and comments on one of those Google products most of us will probably never play with.

8 of 178 comments (clear)

  1. AnandTech not very search optimization saavy by DeadSea · · Score: 5, Informative
    The Mini considers any unique URL string to be a unique document, which makes sense (but is a bit surprising the first time that you run an index). After four hours of indexing, the Mini had managed to reach its document limit and we had to improvise.
    Anybody who doesn't know that search engines consider each url to contain a unique document does't know much about getting their site to be properly represented in search engines.

    Their solution was to create a list of urls for the appliance to crawl. If they had to do that for the search appliance, there is no way that googlebot, msnbot, or yahoo slurp is going to be able to properly index their site.

    Your public accessable urls need to managed and canonicalized through judicious use of robots.txt, 302 redirects, site wide linking, and just plain thinking out the layout of your site.

  2. Re:Neat insides by b0r1s · · Score: 4, Informative

    These are neat little boxes - we've managed 2 (the yellow appliance, and the blue mini appliance), and the performance of both was pretty nice.

    The tools google provides (very easy binary updates, strong web control panel, for example) turn the relatively common task into a dead-simple, point-and-click configuration.

    They even provide a decent interface for skinning the search pages, and while it's not perfect, it's certainly adequate for even the best looking sites on the internet.

    --
    Mooniacs for iOS and Android
  3. It's "its"! by dtmos · · Score: 5, Informative
    The guys from anandtech put it through it's paces

    It's really easy: It's "his", hers", and "its". Even a flower knows!

    --cycling through grammar Nazi mode. Please wait.

    1. Re:It's "its"! by radishes · · Score: 4, Informative

      and use its' when it's possesive

      john's coming to get johns' hat

      Don't listen to this guy. He has lied to you twice. 1) Its' is never valid. 2) The example with John is just so wrong it hurts. "John is coming to get John's hat." You use 's for possessive; s' is for possessive plural, like this: "Slashdotters tend to live in their parents' basement."

      --
      [ Reply to This | Parent ]
  4. Re:where's the raid? by slim · · Score: 5, Informative

    I guess if you want RAID, you pay more than $3,000.

    What you're really buying here is closed-source software, wrapped in the hardware that turns it into an "appliance". Assume $2,000 of that $3,000 pays for the software.

    By specifying the hardware in this way, and by keeping the BIOS and root passwords to themselves, Google greatly simplify their support role.

    This is common practice: an IBM HMC (Hardware Management Console) is a 1U PC with a custom Linux distribution and the management software preinstalled. You don't get the root password; you just use the software as delivered.

  5. For those who're interested... by Homicide · · Score: 5, Informative

    I admin a full blown Google Search Appliance, the mimi's big brother.

    If you want the specs:
    Dual Xeon 2.6GHz
    12GB RAM
    4 250GB HD's in RAID(something) with a hot-swap spare.

    Never tried taking off the cover though, since we want to keep the warranty.

    All of the money you pay is a license for the software on the box, the system itself is effectively free, so once the 2 year warranty expires, you've effectively got a nice powerful linux box for free. You can keep running the software, but without any support.

    As for performance, this thing works great, we have about 250,000 pages that it can index, both public and private (and it can do searches cleverly checknig username/pasword to see if you should have access to certain results), and we've had nothing but positive responses from our users. The results come up quickly, they're the results people want, and the results that management think should be at the top, are at the top.

    1. Re:For those who're interested... by Homicide · · Score: 4, Informative

      It submits a HTTP HEAD request for the URL to the server the page is on, with the username and password supplied, so the server at the other end decides if you should be able to see the search results, thus saving you from having to faff around telling the google box who can get to what pages.

  6. Don't use GET to modify application state! by Augusto · · Score: 4, Informative

    The problem is not google, is the way your app is designed!

    Universal Resource Identifiers -- Axioms of Web Architecture : Identity, State and GET

    In HTTP, GET must not have side effects.

    In HTTP, anything which does not have side-effects should use GET

    If somebody visited your site with a pre-fetching tool like the google web accelerator, you will also find the "delete" button being checked automatically like this. Change those deletes to use POST instead.

    --

    - sigs are for wimps.