Slashdot Mirror


Google Juice

mpawlo writes: "I guess it is time to start using them bookmarks again, since favourite search engine Google seems to be on the verge of Altavista doom and search engine chaos. BBC News reports of Google bombing (often referred to as 'Google juice' by the infamous Crackmonkey subscribers). 'The users have found a way to "bomb" Google to improve the rankings of particular webpages, and ensure a site is near the top of the results for particular search phrases.' There is also the sport of Google Whacking affecting your search results."

26 of 357 comments (clear)

  1. How to Google Whack... by kenthorvath · · Score: 4, Funny
    Step 1: www.google.com

    Step 2: "autistic paraplegic donkey porn"

    Step 3: I'm feeling lucky

    Step 4: Google Whack

    1. Re:How to Google Whack... by jamie · · Score: 5, Interesting
      The idea of the sport is to type two words into Google and come back with exactly one page on the web that contains both of them. Not zero pages, not two, but one.

      To make the game challenging, you can't quote the words (making them into a phrase). And they both have to be actual English words which Google itself recognizes (I think Google uses dictionary.com's dictionary, so you can doublecheck yourself there).

      The more common the words, the cleverer the Googlewhack is considered to be -- few Googlewhacks use words you would consider "common."

      Try it yourself, just think up two obscure words and type them into Google. If you get zero hits, you're too obscure, try again. Much more likely that you'll get 5 or 10 or 20 or 150 hits, in which case your goal is simple: get more obscure, until that number gets down to 1!

    2. Re:How to Google Whack... by Peyna · · Score: 5, Interesting

      Assumming about 3,000,000 words in the English Language (http://www.wordorigins.org/number.htm), then we get about 8,999,997,000,000 possible two word combinations (correct that number if needed.) So, I would say it is some time before that is exhausted.

      There is also a somewhat easy to prevent google from picking up the posted whack: post it as an image.

      --
      What?
    3. Re:How to Google Whack... by friscolr · · Score: 5, Informative
      Try it yourself, just think up two obscure words and type them into Google.

      i've written a how-to on this; it's at http://www.blackant.net/other/random/how-to-google whack.php and repeated below for your convenience.

      HOW-TO GoogleWhack

      1) think of complex word, mispell it, and search dictionary.com for the misspelling.

      example:
      word: insullatory
      http://www.dictionary.com/search?q=insullatory

      2) look through dictionary.com suggestions for a very odd-sounding word, look at definition of word.

      example:
      word: inculcation
      http://www.dictionary.com/search?q=inculcation&r=3

      3) do google search for the word and for derivatives of the word.

      example:
      words tried: inculcation, inculcator

      4) choose word that has the least google returns

      example:
      inculcation: 14100 returns
      inculcator: 238 returns
      we choose inculcator.

      5) if the returns number more than 1000 for any word or derivative, go back to step 1.

      6) in the google returns for the word selected, look for an odd word in the returns, preferably one that is unrelated to the definition of the first word.

      example:
      words: inculcator, adepts

      7) do a google search for both words. if it has more than one return, go to step 5.

      8) submit your googlewhack

      example:
      words: inculcator, tablet.

      9) once you find one googlewhack, look at the page returned for more odd/awkward words. use these as potential new googlewhacks.

      using this method, i found a googlewhack in less than ten minutes (took me longer to write this up) and have repeatedly done so.

  2. Proves strength of Google by Stephen · · Score: 4, Insightful
    Really this proves how good the Google search algorithm is, because Googlebombing needs a coordinated effort from several popular sites. (The Google algorithm ensures that simply setting up several sites yourself doesn't help, unless other people point to them).

    Of course, as I'm all of the top three Stephen Turners already, I don't need to do this. :-)

    --
    11.00100100001111110110101010001000100001011010001 1000010001101001100010011
  3. Not as bad as all that by mblase · · Score: 5, Informative

    The users have found a way to "bomb" Google to improve the rankings of particular webpages, and ensure a site is near the top of the results for particular search phrases.

    Well, yes, but it's not easy. The article describes several dozen to several hundred bloggers working together to drive a certain word or phrase toward a certain URL. In other words, it takes a large, concerted effort to deceive Google's engine, and this fact alone provides reassurance that Google is working according to plan.

    Somewhere else, on this site, Scientology has been accused of using their large network of sites and members to do the same thing, driving searches for "Scientology" and related words to their own sites rather than those of debunkers. Again, this takes a large and concerted effort, which is a virtue of Google rather than a vice.

    Is Google on the verge of breaking because such a thing is possible? Of course not. But there are people powering the search engine on the back end, making improvements constantly in response to issues like this. And their cross-linking approach to ranking pages, while not perfect, remains the most reliable way yet found to judge a match's relevance.

    If it works correctly 99% of the time, and Google is constantly working on the last 1%, that still makes it better than anything else out there.

  4. Google has a long way to fall... by bmooney28 · · Score: 5, Insightful

    before I start using bookmarks as religiously as I had done before... Besides, the Google team seems to respond to new ideas (good or bad) like white blood cells responding to an infection... Companies have been attempting to boost their rankings on Google for years... yet, for the most part, they have been unsuccessful. I doubt seriously that this is by chance...

  5. Another article by Sludge · · Score: 5, Informative
    Here is a link that lets you actually know what the exploit is. Note that you need a lot of people and a lot of time to this.

    You can't simply go to www.google.com/bomb and drag a slider to move a URL up the listings. You have to actually have a concentrated effort. They talk about getting a webpage such as Geocities and getting your friends to do the same. It seems to me mass posting to bulletin boards would do the trick, unfortunately. There is even marketing software out there which posts your 'press releases' to hundreds of bulletin boards automatically.

    1. Re:Another article by MadAhab · · Score: 5, Interesting
      Nevertheless, this is the way google is supposed to work; it finds content on the web. If the content is BBSpam'd press releases, then the folks running the boards will figure out how improve the quality of their sites by bitchslapping spam out of their house. You can't blame Google for accurately finding crap. If a bunch of people with popular sites want to goof on their friend, they can. If they do it to an annoying degree, their sites will become less popular, fewer people will link to them, and they will lose their ability to influence rankings. Otherwise, google is just correctly reflecting the fact that a lot of people want to say that so-and-so is a talentless hack.

      It should be noted that direct links as advertisements could get a rebound under Google. Why pay for a link that bounds through another domain when you could have, say, Slashdot provide a direct link to your site and therefore give you a Google boost? Does anyone know if the link from a site gets you any Google boost if it clicks through, say, a redirect through doubleclick?

      --
      Expanding a vast wasteland since 1996.
  6. Corante article by ZigMonty · · Score: 5, Informative
    The BBC article is very short on details. I read the Corante article a while ago and it has some good info on Google Bombing (first use, effect, etc). Guess what? I used google to find it again. And it was the first link on the page. Seems to be working OK to me.

    If this really does start to get out of control, Google will adjust their techniques to work around the problem. I hope.

  7. Technical explanation on how this works .... by geirt · · Score: 4, Interesting

    ... on slashdot !

    --

    RFC1925
  8. Bad perhaps by baptiste · · Score: 5, Informative
    But the end of Google? I sincerely doubt it. Altavista and the others have been driven by greed since day one (ever look at license prices for Altavista for an Intranet in the late 90's?)

    Google has always seemed to be driven by a happy medium of civic duty and profit. Take their text ads - I love them - unobstrusive, get the point across, and NOT in teh main search results - they are clearly marked. So I expect that the geniuses @ Google will attack this problem and come up with a solution. SO yelling about Google's demise seems VERY premature.

  9. How to spam the web with links by brunes69 · · Score: 4, Insightful

    1. Write a perl script using an automatic comment generator to post comments to all your favoirte weblogs and blogs (Not as hard to generate seemingly relavant comments as you think!)
    2. Add Script to crontab
    3. Wait.

    As you can see, it's not that hard to spam the web with links to your site. Don't even count automated newsgroup posting, whch all gets indexed because of google groups.

    1. Re:How to spam the web with links by Codifex+Maximus · · Score: 5, Interesting

      Sounds to me like the fix for weblog bombing of Google *may* be a fairly easy thing.

      Google could use the same method of rating that they do now to raise the importance of pages to also demote weblogs in importance. A way may be found to determine if a page is a Weblog and take it out of the equation.

      Slashdot could be considered a weblog. Any page that allows a user to post a message with links embedded in it is a Weblog is it not?

      Let Google's Deja worry about the Weblogs and then the user can opt to include the extra results or not.

      --
      Codifex Maximus ~ In search of... a shorter sig.
  10. Re:Try using unique words by gazbo · · Score: 5, Funny
    From the article:
    He first used a bomb to ensure that whenever anyone typed the phrase "talentless hack" into Google they got the site of his friend Andy Pressman.
    ...replacing Jon Katz who had held the position previously.

  11. Re:User input could solve problems by Carmody · · Score: 5, Insightful

    That is to say, give users a free user account which could be used to give input on whats crap and whats not

    For the sake of the discussion, let us call the users who are giving input "moderators."

    As another poster mentioned, this system opens up a NEW can of worms, as spammers, idiots, and conservatives will use the system to call certain sites "crap", not because they are not relevant, but because they want the sites' listing to go down.

    So then people would demand that the "moderators" were overseen, perhaps by a system of "meta-moderators", and you see where I am going with this.

    --
    God is real unless declared integer
  12. vector based filtering by toothless+joe · · Score: 5, Informative

    In addition to other spam prevention methods, google uses complex matrix/vector filtering to ignore link circles. Basically, if (say) the same 100 different sites link to the same set of 20 other sites, and no one else links to them, Google will map them out and realize that they are all working in a concerted effort. That way if a spammer sets up 100 ostensibly independent sites and then links them all to his e-commerce sites, google will realize what he is doing and penalize his rankings for it. The only way that a spammer can 'bomb' google is if he gets a large array of other sites (for instance weblogs) that have significant traffic and link to other, different sites, as well as the ones that the spammer is trying to promote. The long-and-short of it is that a group of bloggers could bomb google with a large effort, but the average spammer would have to set up an incredibly complex web of interwoven pages that garner significant traffic to fool google. Even if large groups of spammers formed a cabal to promote their varied interests, it would likely be discovered by humans working at google. So, I'd put away that violin.

  13. The link that should have been in the story by lowy · · Score: 5, Informative
    Net analysis site Corante has explored the workings of Google bombs in depth.

    Here is the Corante article.

  14. Not quite as bad as it seems by alexjohns · · Score: 5, Insightful
    We've been talking about this on weblogs for a couple of months now. It's not as bad as it appears.

    Imagine you're the patriarch of a clan, and everyone in your clan has a homepage. All of your descendants' home pages have links to your home page, since you're the head dude. Your home page only has one word on it - say it's 'thrombosis'. Since Google bases the relevance of its search results on how many links there are to any page, any search for 'thrombosis' will likely show your home page as the number one search result, because you've got the word on your web page and dozens of links to your home page on other sites.

    Once you think about how Google's rankings work, you can easily figure out how to game the system. That's why Dave Winer (token head of all webloggers) is usually the first result of a search on 'Dave'.

    As far as googlewhacking is concerned, it's not as easy as it looks. Try 'parrhesia verboten'. I stopped once I found that one, proving to myself that it can be done. :)

  15. Heisenwhack by Plutor · · Score: 4, Funny

    This phenomenon is known as a "heisenwhack", after famed theorist Werner Heisenberg. A heisenwhack compensator has been developed, however. Adding the term "-googlewhack" to your search will fairly reliably eliminate these kind of hits.

  16. Some of you sound like Gollum by medcalf · · Score: 4, Funny

    Mmmm...Google my precious...musn't let the nasty bloggers get it, no, not my Google precious, no...

    --
    -- Two men say they're Jesus. One of them must be wrong. - Dire Straits
  17. This is an example of Google working! by sigmond · · Score: 5, Interesting

    This does _nothing_ to undermind the relevance of Google's rankings. When you perform a search on Google and the first "hit" is one that has been juiced in this way you are getting a hit that a larger number of individual sites, all of which are respected by other sites, agree is important to the subject. That is the beauty of Google.

    Yes, this effect can be choreographed, but the result is the same. All of the sites choreographed to achieve this result are voting that site A is relevant to subject B. If the sites involved consistently show bad judgment their ranking in Google are likely to decline and therefore their contribution to the Google ranking for subject B will lessen.

    The fact that a large number of highly ranked blogs can drive a URL up the Google pop-chart is evidence of both the respect blogs are given and the power of Google's algorithms to find such non-corporate backed content.

  18. This is a non-problem. by anser · · Score: 5, Insightful

    The value of a search engine lies in its ability to return usable results when you are actually looking for something. Most of the "exploits" people are discussing don't affect Google's usefulness as a search engine. (When is the last time you searched for "talentless hack" or, for that matter, "david gallagher"? Only someone already participating in the prank, or curious about it, would even know it existed.) And "Googlewhacking" is the most harmless of all - the only search results it can "affect" are its own, as listed winning word pairs lose their uniqueness at the next crawl. So what?

    Google folks are not stupid. If the integrity of searches that people really make is affected, they will change the code.

    In the meantime, is it really necessary to squelch every last bit of fun on the Net?

  19. If you type... by Anonymous Coward · · Score: 4, Funny

    If you type "Free Porn", then you can whack your google all you want!

  20. Anti-Semitic Advertisements on Google by sam_handelman · · Score: 4, Interesting

    A warning to those considering using Google's page ranking service (which tracks your surfing habits, which isn't a problem since it is very upfront about it.) Overall, it works pretty well and it has found several pages of genuine interest to me that I would not have found otherwise. Also, I have no reason to think that they're doing anthing sinister with the information (and I don't care.)

    However, since I like slashdot so much (I assume that is why) it's been serving up advertisements for other projects that link to SourceForge whenever I run google searches; for example, the white supremacist publication the Free Occident, which is powered by SourceForge.

    Now, I'm not one of those people who thinks Google should try and filter hate speech from search results. Likewise, I don't think that the Free Occident should somehow be prevented from using SourceForge's software - open source means open, Voltaire was right, etc. However, I think google should draw the line at serving advertisements for articles about how "If you hear about a 100-million-dollar swindle, then you know that it has to be a Jew."

    I've dumped a copy of the html for the search result in my journal - paste the Extrans into an html file to see it in close-to original format. It appears from the first version in my journal that the ad appears ABOVE the search results - this is not the case.

    Free Occident is a web log, but I find it far more worrisome that they've purchased an ad on google than if they were trying to blog some search term, like "White Power," or even "Occident."

    Yes, I'm Jewish.

    --
    The good and new comes from no quarter where it is looked for, and is always something different from what is expected.
  21. Another method by detritus. · · Score: 4, Insightful
    I discovered this yesterday in my searching. It appears that if you have a class C of IP addresses, lots of domains or subdomains, you can flood search results like this. Notice as you browse the results, the domains in many of the listings:


    www.office-supplies-st0res.com/ (66.33.85.157)
    office-storage.1nf0-office-equip.c om/ (66.33.85.74)
    pens-pencil.search-office-supplies. com/ (66.33.85.74)
    buy-furniture.furniture-sh0p-search .com/ (66.33.85.93)
    printer-toner.supplies-1nfo-office. com/ (66.33.85.99)
    office-product.office-supplies-sh0p .com/ (66.33.34.105)
    office-computers.supplies-1nfo-off ice.com/ (66.33.85.117)
    calculators.supplies-1nfo-office.c om/ (66.33.85.155)
    discount-office.supplies-1nfo-offi ce.com/ (66.33.85.95)


    If you look at the HTML source code (after clicking on one of these results from google.com), you can see it is obviously a deliberate measure to track it's referring URL and search keyword, and logs the results to bizrate.com. Stuff like this makes me furious, especially if you take into account the potential long-term costs. Google's spider has to waste traffic by going through these sites, searchers like me have to skip through a bunch of garbage results, resulting in more traffic. Sure, maybe a few kilobytes of data, but IMO, it contributes to the expenditures of search engines, eventually resulting in more ads, etc... Maybe i'm exaggerating a tad, but it's wasteful to say the least.