Slashdot Mirror


Google's Search Results Degraded?

scrm writes "According to this Wired article, recent tweaks to Google's PageRank search algorithm have degraded rather than improved the accuracy of the results." I noticed this firsthand the other day, but only when I was searching for pictures of famous people, but all my technical queries came back fine.

204 comments

  1. Famous People? by inc0gnito · · Score: 4, Funny

    Heh, and just who are these famous people you had to have pictures of?

    1. Re:Famous People? by ceejayoz · · Score: 2
    2. Re:Famous People? by Lxy · · Score: 5, Funny

      More importantly, was the mature content filter off? :-)

      --

      There is no reasonable defense against an idiot with an agenda
      :wq
    3. Re:Famous People? by Rushuru · · Score: 2

      Probably pics of that guy so famous he appears on most slashdot polls.

      What's his name again?

      --
      !
      ^_^
    4. Re:Famous People? by Anonymous Coward · · Score: 0

      Heh, and just who are these famous people you had to have pictures of?P... and are any of them wearing clothes?

    5. Re:Famous People? by Anonymous Coward · · Score: 0

      This guy is pretty (in)famous

  2. Oh no... by joyoflinux · · Score: 0, Funny

    Those pidgeons must be slacking off...

  3. Yeah sure! by unixmaster · · Score: 5, Funny

    Old algorithm was better . See http://fantomaster.com/graphics/googhell.gif

    --
    Never learn by your mistakes, if you do you may never dare to try again
    1. Re:Yeah sure! by dahamsta · · Score: 0, Redundant

      I love the way you "copyright" a screenshot of Google. Muppet.

  4. actually ... by js995 · · Score: 5, Interesting

    The article suggests that many people are saying pagerank is working badly because they have lost their previous power to affect search results. Overall, the pagerank seems to have improved in this latest incarnation (IMHO)

    1. Re:actually ... by Lshmael · · Score: 2

      When I search heavily on Google (many different search terms) for somewhat common stuff, I find a lot of the pages are the *exact same* free hosting pages with random words on them, that now redirect to this one company's webpage, which had absolutely no useful information, but wants me to pay them anyway.

    2. Re:actually ... by NineNine · · Score: 1

      I agree. I just finally got the page rank that I deserve. I'm very happy with the change. They got rid of a LOT of intentional search engine spammers. Good riddens. Google is much better now, not worse.

    3. Re:actually ... by roddymclachlan · · Score: 5, Interesting

      Mmmm ... no one tries harder to influence google search than scientologists, they have countless different web sites and front groups all linking to each other to boost Google ratings. However shortly after the cult tried to censor Xenu (Operation Clambake) a Google search on scientology ranked Xenu top. Now it comes second rated, although you'd be hard pushed ot find it at all among the 2-million Scientology sponsored links on the results page ... (so be sure to add Xenu to your links page if you have one, you could save the life and wallet of some naive soul) ...

    4. Re:actually ... by zyklone · · Score: 5, Informative

      There is a company called Search44 which seems to have made this kind of stuff their living.

      They index lots of other sites pages and when google comes around spidering they return random content from them. If you follow one of these links from google you will be redirected to their portal.

      No doubt it gives them quite a bit of traffic.

    5. Re:actually ... by Blkdeath · · Score: 1
      I just finally got the page rank that I deserve. I'm very happy with the change.
      My site has been relegated to the bottom of the second page of results. Ho-hum.. It could be due to the switchover from 'free' dynamic hosting providers to my own domain name, or it could be because of a PageRank switchover; who knows? Such is the dynamic nature of the Internet, I suppose.

      Incidentally, does anybody have any sort of listing of search terms that turn up broken pages, porn/spam sites, etc..?

      --
      BD Phone Home!

      Shameless plug. Like you weren't expecting it.

    6. Re:actually ... by kirisu · · Score: 1

      My ranking got better, too. My site went from the bottom of page 4, to the top two on page 1 when you do a search by the name of the site.

    7. Re:actually ... by g_attrill · · Score: 1

      scientology.com was #1 long before the current Google update - I checked it at least two weeks before the current update.

      Gareth

    8. Re:actually ... by ces · · Score: 2

      If you ignore the sponsored links that's not so bad. 5 of the first 10 results are anti-Co$ sites and 3 of the first 5 are anti-Co$.

      --
      Happy Fun Ball is for external use only.
    9. Re:actually ... by Spy+Hunter · · Score: 2

      Try searching for cell phone ringtones or popular song lyrics sometime. Those guys have tricking google down to a science. Javascript redirects, fake URLs with keywords in them, you'll find it all. The sad thing is it seems to be working. It is obvious that it is very hard to get a high ranking in these kind of searches without using some sort of google-tricking mechanism.

      --
      main(c,r){for(r=32;r;) printf(++c>31?c=!r--,"\n":c<r?" ":~c&r?" `":" #");}
    10. Re:actually ... by richie2000 · · Score: 2
      No doubt it gives them quite a bit of traffic.

      "Come on, Slashdot me! Gimme your best shot! What, are you yellow? Chicken! I've got bandwidth coming out of my ears! Slashdot me! Just you try!"

      --
      Money for nothing, pix for free
  5. Quote ... by uq1 · · Score: 4, Funny

    "I noticed this firsthand the other day, but only when I was searching for pictures of famous people, but all my technical queries came back fine."

    Famous people Google Search: "Courtney Cox getting a facial"
    Results: "Courtney Cox advertises facial creme from Loreal"

    Technical Google Search: "How do best use lube and type one handed"
    Results: "www.pinkbits.com"

    1. Re:Quote ... by Anonymous Coward · · Score: 0

      I came up with this - not pinkbits. Perhaps you were typing one handed?

  6. Good Overview and Links to Discussion... by Carnage4Life · · Score: 5, Interesting

    ...available in Mark Pilgrim's blog

  7. My pagerank... by turnstyle · · Score: 1

    Well speaking for the only site whose PageRank I regularly keep track of (my site) the new algorithm hasn't changed all that much. I, for one, am now coming up a little higher.

    --
    Here's what I do: Bitty Browser & Andromeda
    1. Re:My pagerank... by Directrix1 · · Score: 1

      Oh, its definetely flawed then. Seriously, though isn't the efficiency of this system just a matter of oppinion? Also, thats what happens when you get rid of those pesky META tags. Comment on my sig, and I'll shoot you.

      --
      Occam's razor is the blind faith in the natural selection of least resistance and in universal oversimplification. -- EF
    2. Re:My pagerank... by turnstyle · · Score: 1
      "Seriously, though isn't the efficiency of this system just a matter of oppinion?"

      I absolutely agree. The people who go down will say that the new algorithm is worse, and the people who go up will say it's an improvement.

      In my case, I've been steadily going up for a while, and so the fact that I went up doesn't seem obviously related to a change in algorithm.

      For a while I've held the #1 spot for 'Andromeda' and I just went to #5 from #7 for 'MP3 server'.

      Personally, I can't tell the difference in searches for other sites.

      From what I understand, the change is supposed to help new sites, which seems like a good thing (unless it's easy to exploit).

      --
      Here's what I do: Bitty Browser & Andromeda
  8. Still the best! by iai · · Score: 0, Redundant

    Even with a degraded algorythm (if it is in fact degraded) it is still better than all other search engines IMO.

    -i@i-

  9. Pideon by joyoflinux · · Score: 5, Funny

    Those pideons must be slacking off...:)

    1. Re:Pideon by handsomepete · · Score: 0, Offtopic

      What a bizarre bit of slashdottery... you posted this exact same thing a couple posts down here and it was modded -1 offtopic, you misspell pidgeons and add a link and a smiley face and then it ends up +2 funny. Weird.

      Oh no... (Score:-1, Offtopic)
      by joyoflinux (thejoyoflinux AT yahoo DOT com) on Saturday October 05, @04:43PM (#4393071)
      (User #522023 Info)
      Those pidgeons must be slacking off...

    2. Re:Pideon by Anonymous Coward · · Score: 0

      Actually, he misspelled pigeon both times...

    3. Re:Pideon by Blkdeath · · Score: 4, Funny
      you misspell pidgeons
      ... and in a clever twist of anti-spelling irony (gee, this so rarely ever happens) you misspelt pigeons!
      and add a link
      Slashdot mods like links, is this news?
      and a smiley face and then it ends up +2 funny. Weird
      This is a happy place. A HAPPY, HAPPY HAPPY PLACE, DAMNIT!

      So for GOD'S SAKE MAN, SMILE!

      --
      BD Phone Home!

      Shameless plug. Like you weren't expecting it.

    4. Re:Pideon by handsomepete · · Score: 1

      And so did I. I thought that seemed funny. Whoops.

    5. Re:Pideon by joyoflinux · · Score: 1

      I guess some people didn't get it the first time, so I posted it again with a link...:)

    6. Re:Pideon by Anonymous Coward · · Score: 0

      It's a typo; get over it..

  10. Ssshhh... by bsDaemon · · Score: 0, Troll

    don't let anyone know, otherwise people will get wise to the fact that people buy their rankings on search engines, stop using them, they will lose "business", and will have to go back to being fair and objective and useful 100% of the time!! that's just plain un-American!!

    1. Re:Ssshhh... by targo · · Score: 2

      Gimme a break. See this link, it has nothing to do with buying rankings.

    2. Re:Ssshhh... by Reziac · · Score: 2

      I read it. I think the *concept* is great (ie. ensuring that how a site is indexed and where it ranks actually have some valid relationship to its content) but from symptoms both reported and that I've personally experienced, it does appear that the *implementation* went awry. Tho history suggests that Google will soon find and fix the problem.

      Any Google folk reading here today who'd care to comment on the situation??

      --
      ~REZ~ #43301. Who'd fake being me anyway?
  11. Working just fine by Anonymous Coward · · Score: 0

    JLH

  12. other search engines/ They all need to get better by acomj · · Score: 5, Interesting

    google still seems the best. Sometimes I use teoma or lycos because they give different top results. Being tied to one search engine seems bad as you miss alot.

    I had an instructor point us to a page on networking that was amazing good but not found on any of those 3 search engines, at least in the top 30. Most of those top 30 hits wern't very good either.

    Maybe yahoo has it right, The web should be indexed by people.

  13. Hype hurts by jetlag11235 · · Score: 4, Insightful

    Regardless of whether or not the changes have degraded the service Google provides, unless Google (quickly) addresses this problem to the (at least superficial) satisfaction of people, it will hurt Google.

    AlltheWeb.com must be soaking this up with glee.

    -- jetlag --

  14. Re:Vanity, thy name is me by handsomepete · · Score: 1

    Strange, I would think "Coward" would come up with quite a few results before coming up with your personal page. That is, unless you're the owner of the Noel Coward society.

  15. Worked OK for me... by ackthpt · · Score: 0, Offtopic
    I was just looking up "Droit de Seigneur" and it popped urban legends and a few other helpful sites.

    Seems it was rarely if ever imposed, and I was sooo looking forward to being a King someday.

    BTW, this term used in Terry Pratchett's works, "Wyrd Sisters" and "Lords and Ladies" He's go another work due in Novemeber, but the US cover art looks like crap, again, so I'll be ordering Night Watch from the UK.

    --

    A feeling of having made the same mistake before: Deja Foobar
  16. Re:other search engines/ They all need to get bett by jafiwam · · Score: 5, Informative

    Yes, the web should be indexed by people.

    But how about not one that requires sites to pay to get in? (Yahoo)

    At Google, go to the "Directory" tab, or go to DMOZ.org (Open Directory) itself. DMOZ is bigger, better organized, has fewer broken links, no ads, and is built by hand by people who know their categories and are interested in keeping them linking only to sites with meaningful content.

    Semi-mindles search spiders are not all there is in finding stuff on the Internet.

  17. Goatse.cx no longer in googles search results by Anonymous Coward · · Score: 1, Funny
    1. Re:Goatse.cx no longer in googles search results by blibbleblobble · · Score: 3, Insightful

      F.F.S.: I wouldn't be surprised if Slashdot showed up top result for goatse.cx, the number of people linking to it from here...

    2. Re:Goatse.cx no longer in googles search results by Billy_D_Goat · · Score: 0, Troll

      Oh no! I'll never be able to find that picture again! My world is collapsing!

    3. Re:Goatse.cx no longer in googles search results by LegendLength · · Score: 1

      Slashdot is in 31st place for a google search of 'goatse.cx'.

      Maybe this shows that google really doesn't give a link more weight if it is repeated often on a popular site.

    4. Re:Goatse.cx no longer in googles search results by autocracy · · Score: 3, Informative
      Anybody read the URL before clicking on it? http://www.google.com/search?q=goatse& sourceid=cmdrtaco_has_10_gigs_of_kidde_pr0n

      I wonder what the folks at Google are thinking when that shows up as a high-ranking source of traffic... I also wonder why somebody felt that was neccesary to even put. Ah well.

      --
      SIG: HUP
    5. Re:Goatse.cx no longer in googles search results by __aaklbk2114 · · Score: 1

      I wonder what the folks at Google are thinking when that shows up as a high-ranking source of traffic... I also wonder why somebody felt that was neccesary to even put. Ah well.

      To see if ./ will censor it...

    6. Re:Goatse.cx no longer in googles search results by almightyjustin · · Score: 1

      This has always been true...check out http://goatse.cx/robots.txt

      --

      Omnes arx vestrum sunt adiuncta nobis.

    7. Re:Goatse.cx no longer in googles search results by Anonymous Coward · · Score: 0

      Did anyone besides me notice category link on the google page was to scientology?

      What a cowinkydink!

      http://directory.google.com/Top/Society/Religion _a nd_Spirituality/Opposing_Views/Scientology/?tc=1

    8. Re:Goatse.cx no longer in googles search results by kyletinsley · · Score: 1

      No. Slashdot tells google not to spider discussion pages.

      Bullshit. They just don't get indexed until the discussions are archived (older than 2 weeks I think). See here (search for "Microsoft" matches within slashdot.org).

  18. Who cares by LegendLength · · Score: 1
    Pilgrim, who earns his living as a Web accessibility consultant, said in a phone call [...] "And 404's in the top 10? Hello? Pages that are completely blank -- how did those get in there?"

    This seems to be the most evidence presented in the article. Looks like baseless whining to me.
  19. Google Classic? by Anonymous Coward · · Score: 2, Funny

    We need the original-flavored Google back. Maybe a cache of Google linked from Google?

  20. Don't Panic by targo · · Score: 4, Insightful

    This is most probably not intentional.
    There are glitches in every complicated software. Maybe they were trying out some new algorithm that wasn't completely refined yet. Maybe it was a random off-by-one bug that has been already fixed. Shit happens all the time, Google is no different.

    There will probably be many people who try to see a conspiracy theory behind this and say that Google has sold out.
    This is very unlikely. The nature of the described flaw suggests that all queries are affected. Now why should they skew the results of everything to appease a single entity who might have given them some money? That just doesn't make sense.

    1. Re:Don't Panic by Anonymous Coward · · Score: 0
      This is most probably not intentional.


      Whew! I was thinking they degraded their results on purpose.

    2. Re:Don't Panic by thogard · · Score: 1

      This is most probably not intentional.

      Maybe it is. There are several advantages of this. The 1st is it convinces people to look past page 1. This is a good thing from Googles point of view. It also convinces people to create better search terms when they are looking for things. I don't see that as a bad thing either or else Google could just come up with a 10 link static page for each the 20,000 most common english words and everyone could go home. Many of the google searches to my pages are single word searches. People need to learn type two words in the box and maybe even use thouse qote things that are hiding on their keyboard. The third advantage is it forces the spamsites to change. Remember they have last month data and they will have next months data. A quick compare might show a large number of sites that should never be crawled again. I'm willing to look at page 2 if it means better searches in a few months.

      This new page rank could only be running on part of their server farm. I've noticed recently that the same complex search will offten give different results just 10 minutes apart.

      Then again it could just be a bug.

  21. Dmoz is King by DeadSea · · Score: 5, Interesting
    Folks in the forums at webmasterworld speculate that google is putting the most weight on words that are found in the title of the site and in the listing of the site on the open directory project.

    We who are editors at dmoz hold a lot of power right now. Its time for you to share in some of that power. Head over to dmoz and apply to edit your favorite category.

    Can't decide where to apply?

    1. Re:Dmoz is King by great+throwdini · · Score: 2, Interesting

      We who are editors at dmoz hold a lot of power right now.

      I fear that is all too true. I have encountered Google entries ranked "top five" in result sets of greater than 10,000 members where: (a) the search term(s) are not present in the document; (b) no other Google-indexed document links to the one in question; (c) sole use of the search term(s) is in a DMOZ listing of the document.

      It would seem that getting listed on DMOZ can really affect Google "placement" -- I cannot determine whether this is the result of simply weighting DMOZ or due to a "multiplier effect" as Google indexes all those many, many sites who utilize the RDF generated by the Open Directory Project.

      Then again, Google top-ranks a Web site that hasn't been on the Web in over a year for one query I run with frequency. I don't think I can point the finger at the ODP for that one.

    2. Re:Dmoz is King by danpbrowning · · Score: 2

      The only thing I don't understand about Dmoz is why "Gay, Lesbian, and Bi-Sexual" is in every single top category. Do they have it in for the heterosexuals or something? Do they think that sexual preference is so much more important than race, creed, gender, etc. that it should enjoy its own category, while the others don't?

      That's why I prefer yahoo. :-q

      --
      Daniel
    3. Re:Dmoz is King by Anonymous Coward · · Score: 0
      Folks in the forums at webmasterworld [webmasterworld.com] speculate that google is putting the most weight on words that are found in the title of the site

      I believe this is true because are recently as last week, microsoft.com appears when you type "go to hell".
    4. Re:Dmoz is King by Anonymous Coward · · Score: 0

      what the hell are you talking about? what is wrong with that type of person and what the hell is wrong if there are distinct sites for them?

    5. Re:Dmoz is King by Anonymous Coward · · Score: 0
      what is wrong with that type of person

      gay = wrong
      Hello?? Haven't had your morning coffee I assume.

  22. "Dead pages" complaint is real by Reziac · · Score: 5, Interesting

    Whether the rest of the article and of Google's changes are simply causing a rash of sour-grape whining or not, one thing I did notice when I used it yesterday: for a current topic of major interest at least to its part of the world, I got a helluva lot of dead links and blank pages ("Document contains no data" and when I checked, sure enough, it was just the HTML and /HTML tags, with no content). This did strike me as unusual not to mention annoying. More to the strange, none of these "dead pages" were in Google's own cache.

    (I still haven't found what I was looking for :( and no, it's not pr0n :)

    --
    ~REZ~ #43301. Who'd fake being me anyway?
    1. Re:"Dead pages" complaint is real by alienw · · Score: 1

      and tags are what mozilla shows when it doesn't get anything from a server. It looks more like connection problems on your end than anything else. However, it seems to me like these pages are cgi scripts designed to spit out the search terms that people find them with to increase their search ranks. A lot of them simply redirect to other sites when you click.

    2. Re:"Dead pages" complaint is real by Reziac · · Score: 3, Informative

      This isn't Mozilla, it's Netscape 3.04 which is less prone to make things up -- when it gets nothing from a server, it usually whines "NS is unable to find the file named blah-blah".

      Tho I did try in Mozilla as well, same result. THEN I checked docsource in both browsers, and saw what I noted in my previous post.

      The live pages on the SAME server were behaving normally in Netscape, so it's clearly not a connexion problem (in NS, that usually produces the "connexion reset by peer" complaint). When I went to their root site and tried looking thru their links and site search, it appeared that the entire branch I'd hoped to read had been deleted (anyway, no reference to it anywhere in sight) -- assuming it ever existed!! There was no scripting involved, and no reason for anyone at this site to give a damn where they're ranked.

      If you're wondering, I was looking for historical maps incrementally showing the shoreline for Devil's Lake, North Dakota, to supplement what I've found in ancient atlases.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    3. Re:"Dead pages" complaint is real by Gekko · · Score: 1

      Also happens when you get a gzipped compressed stream from a server. Mozilla can't handel it and goes booooom. I hate the fact that IE can handel it. O well I traded off a slight performance gain in my PHP to make it accesiable to all.

      --
      I mod down any one who says "I'm sure I will get modded down for this"
    4. Re:"Dead pages" complaint is real by great+throwdini · · Score: 4, Interesting

      I got a helluva lot of dead links and blank pages.

      Definitely so - check the first result. This is nothing new. For the referenced query, the faulty top-ranking has existed for a long time, though the site in question hasn't existed for over a year. I've even written support a number of times (blatant errors such as these are shameful). Google is far from perfect.

    5. Re:"Dead pages" complaint is real by Anonymous Coward · · Score: 0

      But the next few pages are full of good hits. So is your point that people never check anything but the first result, or that google is not allowed to have 1 page out of several dozen be a bad hit?

    6. Re:"Dead pages" complaint is real by great+throwdini · · Score: 2

      But the next few pages are full of good hits. So is your point that people never check anything but the first result, or that google is not allowed to have 1 page out of several dozen be a bad hit?

      My point is that Google decidedly indexes and top ranks "dead" sites. Check that. *Long* dead sites. I can accept a dead entry for a recently pulled site. I might even be tempted to accept a dead entry for a site pulled several months ago should it be ranked deep within a given set of results. But to top rank a site gone for twelve months or more points to exceedingly flaky behind-the-scenes behavior on the part of Google. Whether results [2 .. n] are valid has no bearing on my position.

      As to whether people only check the first result: Judging from traffic to my sites through Google and the corresponding ranking of them over time, it would seem that people do frequent top-most entries more than bottom-most. That's common sense. Regardless, I would hazard the guess that leaving a "first" entry broken for so long collectively wastes many people's time. It certainly doesn't put a best foot forward in quelling concerns about the relevance of Google search results, does it?

    7. Re:"Dead pages" complaint is real by Anonymous Coward · · Score: 0

      I just wrote them about it as well. Hopefully this will help you. :)

    8. Re:"Dead pages" complaint is real by Anonymous Coward · · Score: 0

      Resubmit the bad page on their link to "add this page"... if its not there, it disappears quickly from the index.

    9. Re:"Dead pages" complaint is real by Reziac · · Score: 2

      I hadn't really thought about it til I read your replies, but I hardly ever click the top result (unless I was simply looking for a site root, not some specific page), since usually I'm looking for something slightly different from what it coughed up, and what I really wanted is more typically found about halfway down the first page of results. I've become so used to this that I tend to start reading results in the middle of the page!

      As to indexing dead links, since the alternative of finding a dead page in Google's cache is useful, I can understand doing this for that purpose. However -- perhaps Google should INDICATE known dead links, so we don't waste our time chasing them, and can immediately go to the cached page instead.

      --
      ~REZ~ #43301. Who'd fake being me anyway?
    10. Re:"Dead pages" complaint is real by great+throwdini · · Score: 2

      I hadn't really thought about it til I read your replies, but I hardly ever click the top result.

      Sounds like a variant of "banner blindness" ... Google Goggles?

      As to indexing dead links, since the alternative of finding a dead page in Google's cache is useful [...]

      Ah, but the cited example doesn't offer a cached version. So much for that.

      Google cache is only beneficial in cases where the page no longer exists or cannot be accessed. Neat and all, but of use to only a small portion of users, I would think. For sites which update frequently (given the general lag in Google indexing), a cached version is detrimental, since the content would not accurately reflect the "true" site. The benefit in other cases is broadly questionable, as "dead" sites are (usually) pruned from Google's index, along with cached contents.

      Apart from Slashdotted servers where Google already indexed the content (not likely for new documents), the window of utility is limited -- assuming that people can keep their servers up. I would suggest that archive.org serves the same purpose nicely, but that collection has its own troubles.

      Perhaps Google should INDICATE known dead links.

      Perhaps Google shouldn't index dead links at all. There's geek chic in placing document copies in cache, but that's about it. Google only caches the document, not the images or other embedded content, or referenced stylesheets or script elements. For many, many sites (unfortunately) much is lost in retrieving only the base document. This again reduces utility.

      Favoring inclusion of "dead" documents will quickly lead to an immense repository of this and that, some live and some not. Sounds an awful lot like archive.org and I don't really understand why Google would even bother. Indexing dead pages (cache or no cache) only clutters up results and confuses what one is actually searching.

      Flagging the dead from the living isn't a whole solution, but it's a start. Although, were Google to team up with archive.org to permit cross-linking and separation of duties (live Web / dead Web) that might make a bit more sense (to me). I doubt that will happen, given the ties of archive.org to another search technology.

    11. Re:"Dead pages" complaint is real by Zagadka · · Score: 2

      The first result to that query when I tried was http://www.win.tue.nl/~kroisos/roguelike.html, which does appear to be up...

    12. Re:"Dead pages" complaint is real by great+throwdini · · Score: 2

      The first result to that query [...] does appear to be up...

      You're posting two plus days, one more email to support (see thread), and a direct removal request (see thread) by me later ... they did finally kill the errant entry. One year after it was removed from the 'Net, but still it's gone.

      One small step for...

  23. I noticed this myself by saginaw · · Score: 2, Informative

    I had been searching for information on US bills that are still in circulation, and I found that google did a poor job in finding what I was looking for. The catch is that I did the same search about two months ago, and I got all kinds of decent sites from google.

    1. Re:I noticed this myself by /dev/trash · · Score: 1
  24. They Kill PageRank to protect Microsoft by registro · · Score: 2, Informative

    Well, so much for PageRank. Im afraid this is whats going on: Link base PageRank, like 1 person per vote democracy, was turning dangerous for the few with enought resources to put some pressure at google. Lest see how they solved it. Problem: http://www.google.com/search?q=%22go+to+hell%22 Microsoft was #1 at "go to hell", thanks to popular vote like linking. AOL was #3, Disney #4. Solution: http://www2.google.com/search?q=%22go+to+hell%22 They are no counting any more Anchor links from "no-authoritative" web pages. Joe Doe pages dont count any more. Only Anchor links coming from pages oficialy "recogniced" at big sites whit a superior page rank to start with, or directories like Dmoz or yahoo count now. If this is the case, we can say PageRank is DEAD. From now on, big corporation marketing rules over popular choice. take this as a example: http://www.google.com/search?q=correo+gratis "correo gratis" is spanish for "free mail". Hotmail was #1. Now, at www2, is nowhere to be found. Hotmail is pagerank 9, and hundreds of spanish web pages where pointing at it as "correo gratis". Now is not, but is still #1 if you look for "free mail". Why? joe doe pages dont count, hundres of spanish users linking at it dont count any more. Only the "official", msn network pages count now, plus the few Dmoz pages pointing at it using that text as links. Most of those pages happen to be English Only, so only the english version of the query survives. Oh well. .

    1. Re:They Kill PageRank to protect Microsoft by Peyna · · Score: 1

      Must be how I moved up in some search results. My page is linked from about 20 slashdot pages, so I must be pretty popular. (since it is linked on every slashdot post I make)

      --
      What?
    2. Re:They Kill PageRank to protect Microsoft by Anonymous Coward · · Score: 0

      Just another example of a couple people ruining something for everyone.

    3. Re:They Kill PageRank to protect Microsoft by TKinias · · Score: 2, Insightful

      registro wrote:

      From now on, big corporation marketing rules over popular choice. take this as a example: http://www.google.com/search?q=correo+gratis "correo gratis" is spanish for "free mail". Hotmail was #1. Now, at www2, is nowhere to be found. Hotmail is pagerank 9, and hundreds of spanish web pages where pointing at it as "correo gratis". Now is not, but is still #1 if you look for "free mail". Why? joe doe pages dont count, hundres of spanish users linking at it dont count any more. Only the "official", msn network pages count now, plus the few Dmoz pages pointing at it using that text as links. Most of those pages happen to be English Only, so only the english version of the query survives.

      This doesn't make sense. The current top result on a search for `correo gratis' is LatinMail - Tu Correo Gratuito En Español. It seems to me that the listings have gotten more accurate, not less, if my search in Spanish for free mail returns a Spanish company instead of a Yankee one (Hotmail is Microsoft, in case you weren't aware). LatinMail belongs to eresMas, who are headquartered in Madrid.

      --
      In principio creauit Linus Linucem.
    4. Re:They Kill PageRank to protect Microsoft by registro · · Score: 1

      Hi, >This doesn't make sense. The current top result on a search for `correo gratis' [google.com] is LatinMail - Tu Correo Gratuito En Español >[latinmail.com]. It seems to me that the listings have gotten more accurate, Not at all. Latinmail have arround spanish 500.000 accounts. Direccion.com (#3) has 300.000 users. Hotmail has more than 2.000.000 spanish accounts.

    5. Re:They Kill PageRank to protect Microsoft by Anonymous Coward · · Score: 0


      So exactly how does pushing Hotmail way down the list protect Microsoft?

    6. Re:They Kill PageRank to protect Microsoft by registro · · Score: 1
      > So exactly how does pushing Hotmail way down the list protect Microsoft?

      That is a collateral damage. That is what happens when you try to fix something that is no broken (Popular linking), to protect a big corporation from same public embarrassment (Microsoft, AOL, Disney, and yes, Google). The system becomes unstable, you get weird results, and the quick hack ends up causing more damage that good.

    7. Re:They Kill PageRank to protect Microsoft by Alsee · · Score: 2

      if my search in Spanish for free mail returns a Spanish company instead

      You have a good point there, but that should reasonably weight the results. I checked correo gratis. Hotmail.com/ES was ranked five hundred and eighty seven. In my oppinion that result is broken.

      -

      --
      - - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
  25. How are the rankings worse? by greenfield · · Score: 0, Redundant

    I wish the article had given some more concrete examples rather than just quoting a few disconentented webloggers. I use Google many times on a daily basis and have found no change in the quality of their results. I guess some people are just fixated on their own ranking rather than the quality of the first results. At least the article touched on that: "I was beating out Mark Twain before -- that's probably not fair."

    --

    --Sam

  26. I noticed the same thing by zoomshorts · · Score: 1

    It seems like whenever someone makes a decent search engine, someone has to tinker with it until it breaks. I wish they would stop that.

  27. The Date on Google. by rapidweather · · Score: 2, Informative

    I notice that google gives a date with the results, and that this is when the googlebot, apparently, visited the link. As a test, I put my keywords in over several days, and google returned the page I was looking for with a date that was, in most cases, only a few days old, together with a short sentence from the beginning of the page. I had thought that googlebot took a month to go around the internet and list all the pages, but from this date thing, I would think that googlebot only takes a few days at the most. Try this: put "gary gray tropical discussion" in google. I just did, and the date returned is 10-1. I didn't put it there, googlebot must have, IMHO. Very nice, anyway, and google and search engines that work with a bot remain my favorite.

  28. xenu.net by nyri · · Score: 1

    Well there is a problem solved. Searching 'scientology' comes with www.xenu.net at #2. This is a Good Thing(TM).

    I encourage people not to judge Google's update before a few patches. All x.0 releases are non-functional. All you software engineer should know.

    -- nyri

  29. They did it to protect Microsoft and Adword$ by registro · · Score: 5, Interesting
    Well, this is what we found until now. In order to fight "Googlebombing" and "Pagerank for sale", they may have downgraded results when the Keywords is not in some important part of the on-page text, (to stop googlebombing), and the anchortext in ranking may have been tuned down, (to stop Pagerank monetization), specially if the linking pages do not have a good PageRank to begging with. Internal links, and links from interlinked pages may have been tuned down also. Still, we have sees as many as 200 regional competitive cats easily dominated by unscrupulous Dmoz editors. We have done some testing on that.

    To test if we really are in front of a Dmoz dominated Update, we have set up a Aspseek based small search engine, a GNU search engine with a crude PageRank-like ranking system. We have indexed around 1.500.000 pages, using as the starting point 700 Dmoz very competitive dmoz cats, including up to 250 pages per site, following up to 10 outside link, with up to 100 pages per outside link. What we found was that 59% of our top 20 results on the 100 cat-related competitive Keywords where also top 20 in Googles new index, and 26% of our Top 10 where also Googles Top 10.

    But we must also said that we have not been able to find a so-compelling relationship using no-competitive categories. A 2.000.000 pages index with non-competitive regional cats, using non-competitive Keywords, showed a very small correlation between top 10, top 20, and even top 50 results.

    So, our working theory right now is, yes, small changes, probably committed in order to fight both googlebombing and Pagerank commoditization, have affected the index accuracy in many different ways.

    We think the index is unbalance, or unless much more unbalanced than the last one, and, as a result, the weight of some previously no-so important characteristics are souped-up, opening the door for abuse.

    The main effect of all of this is small well managed sites at competitive categories can not relay anymore in good content + good linking to get a good listing. They will have to pay Google using Addwords program. That is the main change here. Forcing small business to pay to get at the top, using advertising space.

    But we do think this update and the changes committed are, to say the least, unbalanced, and the new algo is rampantly open to easy abuse. Lets hope Google good old Phd common sense returns soon, and a new, improved update takes place as soon as possible. Lets hope they are not tring to make a easy killing by forcin small popular sites, all of the sudden deprived from traffic, to pay google using Adwords.

    1. Re:They did it to protect Microsoft and Adword$ by DNS-and-BIND · · Score: 1, Offtopic

      Your post is so full of insider lingo I found it incomprehensible. Certainly not "Interesting" at all.

      --
      Shutting down free speech with violence isn't fighting fascism. It IS fascism!
    2. Re:They did it to protect Microsoft and Adword$ by registro · · Score: 1
      Sorry about that, DNS-and-BIND, my English is not as good as it should.

      I tried to make it clear, but some times it is difficult for no-native English speakers to tell out plain English from technical English, specially for slashdot reader like myself :)

    3. Re:They did it to protect Microsoft and Adword$ by ces · · Score: 1, Redundant

      This sentence no verb.

      --
      Happy Fun Ball is for external use only.
    4. Re:They did it to protect Microsoft and Adword$ by packeteer · · Score: 2

      Hes just being a punk dont worry about it. I found your article interesting yet difficult to read only because you wasted few words and packed in as much content as possible. If i could have i would mod you up but i have no points. Anyone willing to take the tmie to read your post will understand it fine.

      --
      unzip; strip; touch; finger; mount; fsck; more; yes; unmount; sleep
    5. Re:They did it to protect Microsoft and Adword$ by registro · · Score: 1
      And all your base belong to us.

      :)

    6. Re:They did it to protect Microsoft and Adword$ by Anonymous Coward · · Score: 0

      I thought your article was clear and informative; thank you.

    7. Re:They did it to protect Microsoft and Adword$ by Omnifarious · · Score: 1

      Thank you for your post. Better English would've made it much easier to parse and understand, but it was informative regardless.

  30. End paragraph says it all by Styx · · Score: 4, Insightful
    Pilgrim, whose blog dropped from first to sixth place in a search for "mark," admitted that weblogs may have been overrated prior to the latest index. "I was beating out Mark Twain before -- that's probably not fair."
    --
    /Styx
  31. They want us to pay by registro · · Score: 1
    They degraded the "competitive" categories, aka categories where people is more likely to pay Google using Adwords (Google Ad program) if they don't get a good listing.

    Most of dose categories are now dominated by garbage sites and spamers. Good sites with a lot of content and a lot of popular links who used to be #1 or #2 are now #60 or #100, precede with spammers and affiliation programs and 1 page redirection sites.

    Those sites are being forced to pay Googles "affordable" Adwords program to stay alive.

  32. Something weird was happening by Anonymous Coward · · Score: 0

    Thursday and Friday when I was doing searches I was getting back these really weird hits. When
    I would click throught it would be a page with a bunch of links to unrelated sites.

    The hits would all point to different domain names and have a different "look", but I would click through and get the same result.

    I checked this morning after reading this article, and the bogus hits were gone. Looks like someone had found a way to fool pagerank and google quickly closed the hole.

    Did anyone else notice this?

  33. Wow, can you people even read? by autopr0n · · Score: 4, Interesting

    According to this Wired article, recent tweaks to Google's PageRank search algorithm have degraded rather than improved the accuracy of the results

    Actualy, according to the artical, a few people who run blogs seem to think that google has been degraded, while google itself has not seen a higher number of actual complaints.

    Basicaly what happened is that google took some mesures to reduce the effects of "googlebombing" by bloggers.

    --
    autopr0n is like, down and stuff.
    1. Re:Wow, can you people even read? by Anonymous Coward · · Score: 0

      > Basicaly what happened is that google took some mesures to reduce the effects of "googlebombing" by bloggers.

      In the process, Google made the biggest changes to their weightings for a long time. Some kinds of searches are returning lower quality matches now.

    2. Re:Wow, can you people even read? by Chester+K · · Score: 2

      Actualy, according to the artical, a few people who run blogs seem to think that google has been degraded, while google itself has not seen a higher number of actual complaints.

      I'm not a blogger and I've noticed more and more crap results on the first pages of my google searches in the past month or so. Unfortunately for Google, I haven't really stopped to think about why my search results have been crappy (and thus, complained to Google about it), but I have noticed I've been getting better results from alltheweb.com.

      The silent majority aren't going to complain about bad search results, they'll just move to another site that does better. How do you think Google managed to steal the crown from Altavista, et al. in the first place?

      --

      NO CARRIER
    3. Re:Wow, can you people even read? by Anonymous Coward · · Score: 0

      "Bloggers?" Let's call them by their true names: egomaniacal whiners.

    4. Re:Wow, can you people even read? by superyooser · · Score: 1
      Google is bombed alright (in a worse sense of the word), and they did it to themselves. I'm not a blogger nor associated with any bloggers, but I agree that Google has been degraded. Last night I was searching for something, and the very first result was a 404. And it wasn't available in the Google cache either. Google turned up other 404s in my searches besides that. That is simply un-Googlian. :)

      Their "I'm Feeling Lucky" button needs to be changed to "I'm Feeling Really Really Really Lucky."

    5. Re:Wow, can you people even read? by bad-badtz-maru · · Score: 2


      The results are hosed, the timing of the article is interesting for me, because I just sent a complaint to google about 30 days ago. Any search that contains the term "wholesale" in it contains pages and pages of results (some times as many as ten consective pages) that all redirect to ebay. Now obviously "wholesale" is going to be a spam magnet, but to have in excess of 75% of the first 10 pages of results all link to some sort of redirection script indicates some sort of page ranking issue.

      maru

    6. Re:Wow, can you people even read? by Anonymous Coward · · Score: 0

      this appears to not be true any more. perhaps some of the changes have finally filtered through, and dealt with this?

  34. Not quite by registro · · Score: 1

    Not quite. Now, at the competive categories. Is pretty easy to detect who is the one at the top at conpetitive searches:
    The spammer with the greater number of Dmoz Editor accounts.

  35. Was it just me ? by Kuts · · Score: 0, Offtopic

    "According to this Wierd article, recently...."
    :)
    [cleaning spects...]

    1. Re:Was it just me ? by Kuts · · Score: 1

      Oops... it should be Weird
      :)
      [need some more sleep...]

  36. Re:other search engines/ They all need to get bett by great+throwdini · · Score: 5, Informative

    At Google, go to the "Directory" tab, or go to DMOZ.org (Open Directory) itself. DMOZ is bigger, better organized, has fewer broken links, no ads, and is built by hand by people who know their categories and are interested in keeping them linking only to sites with meaningful content.

    First, I would suggest going directly to the categories at dmoz.org rather than the Google relistings. Google picks up revised RDF dumps from DMOZ whenever they please, but the lag in the cycle is pretty long. If you are looking for the "fresher" data, go directly to the source.

    Second, DMOZ can become what you say it is only with proper editing. The project itself may list 50000+ editors, but they're volunteers and there is a lot of ground to cover. A large number of edits are made by those "high up" in the directory structure to "lower"/"deeper" categories less well understood. Certain branches of the project are neglected; others eat editors for breakfast with the amount of work that needs to be done. Volunteer and help out.

    You may also want to investigate ChefMoz and MusicMoz, too.

  37. Re:other search engines/ They all need to get bett by marmoset · · Score: 1

    I've had occasion to play around with AllTheWeb, which also often gives different (but still relevant) results on lots of queries. Worth a look.

  38. That damn algorithm by the_pooh_experience · · Score: 1

    What do you expect when Google's technology is based on the spatial orientation of pigeons as outlined in Google's own white pages!

  39. People are cheating by chrysalis · · Score: 5, Insightful

    Google's results are now less accurate because people are cheating. The way pagerank works is widely documented, and people abuse from it to get better scores.

    I work for a company that hosts pr0n sites. Maybe 95% of our partneers are cheating that way. Fake sites, fake auto-generated HTML pages (with pseudo real sentences), cloaking (what Google sees is not what visitors see), javascript tricks, etc. are a must. They spend most of their time on trapping google, it brings more money than working on the site itself.

    The company I'm working for has even a team working full-time on this (spamming search engines, and creating thousands of fake sites just to promote one real site) .

    --
    {{.sig}}
    1. Re:People are cheating by Tablizer · · Score: 1

      If I were google, I think I would *give up* indexing porn and simply sell the ranking to the highest bidder (with a disclaimer of course).

      Then again, didn't Overtour copyright sorting by rank (of all the ridiculous things)?

  40. darn by Raiford · · Score: 4, Funny

    never should have stopped using gopher...

    --
    "player 4 hit player 1 with 0 stroms"
    1. Re:darn by Anonymous Coward · · Score: 0

      Maybe you don't need to...
      gopher://google.com

  41. Slashdot has influence? by Anonymous Coward · · Score: 0

    I couldn't help but thinking of this story dealing with PageRank abuse, and wondering whether it inspired some contemplations at Google about the value of PageRank today.

    BTW: The story should point out that a few people feel that Google "degraded" the results, but that is not a conclusive statement: They don't like that their BLOG isn't coming up at the top because of their recursive link arrangements, but that doesn't mean that the results have degraded for you and me.

  42. Search Engine Optimisation - Don't waste your time by Boss,+Pointy+Haired · · Score: 5, Insightful

    Here's why.

    Your job, as a webmaster, is to produce a user friendly, useful, maybe informative website.

    Google's job, as a search engine, is to find the sites most likely to be of interest to a user, based on their search terms.

    Therefore,

    To get good rankings, all you have to do as a webmaster is produce a user friendly, useful, maybe informative website.

    It is Google's job to optimise to the web, not the web's job to optimise to Google.

    So,

    Search Engine Optimisation is big massive NET LOSS to you, because all it results in is getting visitors who aren't the slightest bit interested in your website or product.

    It also results in a soon to be pretty useless Google, so please don't do it.

  43. You gotta try this one.... by Anonymous Coward · · Score: 0

    In Google, type in "God" and click the I'm-feeling-lucky button. Very interesting results!

  44. observations... by Lazy+Jones · · Score: 5, Interesting

    * you can't beat the best google-spammers in the end ... they're always smarter, quicker, difficult to identify

    * worse rankings with a particular keyword mean that a company will seriously consider using AdWords to maximize the traffic gain from google - so hey, it's good for google ... let's hope this isn't a reason

    * the big mistake is to use a "static" relationship between websites as a measure for a site's traffic or importance - better offer a "google counter" (google has the resources, I suppose)

    All things considered, Google is still doing pretty well.

    -lj

    --
    "I love my job, but I hate talking to people like you" (Freddie Mercury)
  45. Google to Meet the Webmasters? by Anonymous Coward · · Score: 0

    Webmasterworld, the forum cited in the Wired article have a conference next week.

    Looks like Google will be there .

    :)

    1. Re:Google to Meet the Webmasters? by Calum+I+Mac+Leod · · Score: 1

      Apparently, Google haven't been getting much feedback about the changes yet.

      With quite a few of the world's top Internet marketeers assembled in one place, I suspect they'll get plenty of feedback next Saturday...

  46. Been slipping for awhile by Anonymous Coward · · Score: 2, Interesting

    Google used to be pretty damn good. Have a hard time finding stuff these days though. I think one part of it is time. Google made its debut during an explosive time of growth for the 'net. That means the majority of the searchable items were roughly the same age, and so timewise context wasn't really an issue. But now that we've all got a few more years around our waists, some of those documents aren't going to be much use to us folks who are living here in the next millenium. An aged document should get a heavy negative score unless the user is explicitly looking for old stuff.

    Was having problems with a program last night and kept finding links from 1996. I kept saying to myself, "My good man, surely those bugs have been fixed since then!"

    Looking for things on the net with any search engine is a distinctly unpleasant experience. Kind of an impotent feeling to type in an explicit search with all the trappings ("bla bla" -foo -gar..etc), and to get a bunch of nothing in the results.

    And don't ask me why, but I've been getting an uneasy feeling about google. Where do they get all the money such a company needs to stay afloat? They're in a pretty good position to learn all kinds of neat details about everybody. Pretty powerful position to hold.

  47. Alltheweb by autopr0n · · Score: 2

    Alltheweb works pretty well as a google alternative, and it has a better design too :)

    Don't get the wrong idea, I like google, but having just one search engine that's commonly used is a bad idea. No other company can really compete due to the fact that google has so much traffic to sell ads on.

    So anyway, that's why I pimp alltheweb whenever this topic comes up.

    --
    autopr0n is like, down and stuff.
    1. Re:Alltheweb by alphaseven · · Score: 2
      I like alltheweb as an alternative too. Their indexing isn't as good as google's, but the do seem to index a lot of pages that google has missed.

      I use it if google gives me less than a dozen hits for a query.

  48. Mod This Up... by InfraredEyes · · Score: 1

    ...it's the most intelligent comment I've seen on Slashdot since my last lot of mod points expired.

    1. Re:Mod This Up... by Anonymous Coward · · Score: 0

      Actually don't mod it up. it's competely false.

      "Search Engine Optimisation is big massive NET LOSS to you, because all it results in is getting visitors who aren't the slightest bit interested in your website or product. "

      so what if I optimize for words that are likely to bring visitors that are interested in my products. OH MY LORD, thats exactly what search engine optimization is.

      do you think we're just idiots who run porn websites and try to get traffic from geneatrics?

      I write pages for products, then SEO for keyterms which will make me money. Highly related words. Duh.

  49. My rank has gone down! by Tim+Ward · · Score: 2

    Panic! Disaster!

    "tim ward cambridge" always used to get me in seven of the top ten places - now it's down to six!! Better get my mates to put up a few more links ...

    But as I'm still on the first page with "accommodation cambridge" I can agree - famous people are worse off, but technical queries are unaffected.

  50. They obviously DO tinker... by dpbsmith · · Score: 5, Interesting

    ...because I've noticed some odd correlations.

    For a long time, a search on "Samuel Johnson" returned Frank Lynch's "Samuel Johnson Sound Bite Page" as the first hit. And, flatteringly, but mysteriously, a search on "Eyeglass Prescription" returned a web page of mine as the first hit. (I say "mysteriously" because the only page that Google reports as linking to my page is... my own home page! So it is not PageRank that accounted for its ranking).

    About a month ago, Frank's page dropped to #3 and mine dropped to about #20. In Frank's case, the #1 spot went to a fine Samuel Johnson web site at Rutgers; in mine, I was edged out by a bunch of commercial sites selling eyeglasses.

    The interesting thing is that two or weeks ago both sites popped back to number 1.

    And then a few weeks later, Frank's is again at #3 and mine is down around #10 or so.

    I don't think there's any reason why eyeglass prescriptions and Samuel Johnson would be connected. (And, no, Frank's page and mine do NOT link to each other!) So the changes must reflect tinkering by Google.

    Neither Frank nor I use any kind of "cheating" to boost our ratings. And I don't think the sites that climbed above our did, either. Nor do I think many of the sites involved changed ANYTHING significant that would have altered their rankings.

    (BTW I'm NOT giving URL's because the contents of these pages are irrelevant to my observations, I don't want them slashdotted, and this is NOT an attempt to boost the rankings of either page).

  51. I doubt it by autopr0n · · Score: 2

    Come on, Microsoft and bill gates have had bad terms associated with them at google for ages. Assuming that google changed this just for M$'s sake is ludicris.

    If that was all they were worried about, they could simply have manually changed those searches to exclude MS (as they have done for people as small as Bernie Shiftman), that guy who spammed his resume around everywhere. Searches on his name would turn up pages bitching about him.)

    Btw, you definitely deserve a +5 for plagiarizing this post verbatim. Well, except for the paragraph breaks, I guess.

    --
    autopr0n is like, down and stuff.
    1. Re:I doubt it by registro · · Score: 1

      Well, is not just Microsoft. At "Go to hell", M$ was #1, AOL #2, and Disney #1. Waht a colection of power players!
      And the most embarrased company of that buggy result was... Google!
      Yes, I thing they were compeled to change it.

  52. Re:Search Engine Optimisation - Don't waste your t by Calum+I+Mac+Leod · · Score: 1
    To get good rankings, all you have to do as a webmaster is produce a user friendly, useful, maybe informative website.

    If someone can produce an informative website, they might benefit from some usability advice. In the same way, that person might benefit from some Search Engine Optimisation advice.

    For example, choosing good HTML page titles that reflect the content of each page helps Google to understand that page. This will drive relevant traffic to the site.

    Someone who gets you "visitors who aren't the slightest bit interested in your website or product" is not optimising your site.

    If everyone optimised their site for the subject matter on that site then search engines become more relevant, and the site owners make more money.
  53. Re:People are cheating -- so you condone it? by Anonymous Coward · · Score: 0
  54. Stanford's not #1 any more by K-Man · · Score: 3, Interesting
    When I wanted to look up Jeff Ullman's home page, I used to go to google and type "ullman" and he would come up first. I assumed that being a Stanford professor guaranteed a number one ranking, but now he's slipped a notch, and some company that makes sails is in his slot.

    He still beats Tracey Ullman, though.

    --
    ---- "If we have to go on with these damned quantum jumps, then I'm sorry that I ever got involved" - Erwin Schrodinger
  55. How exactly is this a bad thing? by Have+Blue · · Score: 2

    Most of those "funny" search results (like go to hell) were the result of googlebombing, one of the very few loopholes in Google's spider. That loophole has now been closed, and I personally don't give a crap. Remember that Scientology was using it to inflate their Google rank too.

  56. Re:Search Engine Optimisation - Don't waste your t by The+Good+Reverend · · Score: 5, Insightful

    I agree with you 100%, but I don't think it'll ever happen. If everyone did as you suggested, and no one cheated, we'd live in a perfect index world - search results would accurately reflect what's on the web. Searches would be easier, and more relevant sites would come up with more specific searches. The number of hits you got coming from Google (or anyone else) would likely depend on the "real" content of your site.

    Then, one day, someone would rediscover that by putting "Anna Kournikova Blowjob" on their site, they'd get some hits they wouldn't have otherwise. No harm, they think, it's a few bytes and it's not hurting anyone. Webmaster tells his friends, they tell their coworkers, and this continues until the search engines have to begin working around the issue.

    Sound familiar?

    Unfortunately, people aren't always honest, especially when it's something they perceive as benefiting them without hurting anyone else, especially if the benefit is financial. They don't care about the integrity of some other website's engine, they care about profits. Realize also that even personal sites do this; people like other people to see what they've made. While your idea is a great fantasy, near-perfect search results will only come from human-edited sites, or a better algorithm than we have now.

  57. Plural oddity by Tablizer · · Score: 4, Informative

    I noticed that google is plural-sensative. For example, "SQL alternative" will give a bit different answer than "SQL alternatives".

    It does not seem like a very good idea to me in most cases.

    1. Re:Plural oddity by Anonymous Coward · · Score: 2, Informative

      the reason they're doing this is that google's indexing a *lot* of non-english pages... stemming algorithms (which would index 'alternatives' as 'alternative') generally work decently well on english, but applying even the most basic rules to other languages has really strange, unpredictable, and generally bad results. so, instead of trying to figure out what language each page/query is in and stemming appropriately, which would be very, very painful to do, they just ignore stemming. also, IIRC, another reason that stemming isn't used much is that its been found in academic studies that it doesn't really improve the results you end up with that much...

    2. Re:Plural oddity by Tablizer · · Score: 2

      (* the reason they're doing this is that google's indexing a *lot* of non-english pages *)

      If it is an English word, then they should ingore plurality. Generally, it should be pretty easy to determine the language of a page anyhow using statistical probabilities, etc. If the metric suggests that it is mixed, then maybe be more literal.

    3. Re:Plural oddity by Anonymous Coward · · Score: 1, Informative

      (* Generally, it should be pretty easy to determine the language of a page anyhow using statistical probabilities, etc. *)

      yeah, that shouldn't be too hard... there's loads of optimizations they could make, that would probably return *very slightly* better results *sometimes*... the problem is that they need to be able to return your results to you very, very quickly, or people get frustrated... they chose speed over stemming.

      a habit i've gotten into is using wildcards for this sort of thing -- 'alternative*' . yeah, it'd be nice if they did it for you, but i think that they just don't want to deal with checking for languages, stemming everything, etc, as it'd be too much of a slowdown. just a choice they made, and i'm sure that it was consciously done, and with a reason.

    4. Re:Plural oddity by Tablizer · · Score: 1

      the problem is that they need to be able to return your results to you very, very quickly, or people get frustrated

      What I suggested would be something they do when they index or parse your page, and NOT when somebody searches. IOW, it would be a back-end process.

    5. Re:Plural oddity by Anonymous Coward · · Score: 0

      only problems there - 1) they'd have to reindex everything from scratch before implementing it, and 2) they still have to figure out what language your query is in.

    6. Re:Plural oddity by Anonymous Coward · · Score: 0

      you forget that virtualy every english word is made of roots that are from other languages. 10,000 or so of the most common enlish language words appear in other languages.

    7. Re:Plural oddity by Tablizer · · Score: 1

      (* they'd have to reindex everything from scratch before implementing it *)

      They periodically reindex everything anyhow.

      (* they still have to figure out what language your query is in. *)

      Well, maybe. There is no language selection option? If a query only has words from one language (based on lookup), then it would be easier to know. The IP address can be used to guess in many cases.

      How many languages do plurals significantly change the meanings of anyhow? How large is that audience?

    8. Re:Plural oddity by Anonymous Coward · · Score: 0

      you forget that virtualy every english word is made of roots that are from other languages. 10,000 or so of the most common enlish language words appear in other languages.

      Yeah, damn thief English! A better statement would be "most languages derive from common roots, and liberally steal from each other". Claiming that English takes from French is foolish when both have a common ancestry in Latin, for instance.

    9. Re:Plural oddity by Tablizer · · Score: 2, Funny

      Claiming that English takes from French is foolish when both have a common ancestry in Latin, for instance.

      And Latin stole its words from medical texts.

    10. Re:Plural oddity by pjp6259 · · Score: 1

      Removing the -s ending from words is an example of stemming. Another example is removing -ed from past tense verbs. In general removing any inflections from words.

      Traditionally this has not always worked as well as might be expected. The example I always see used, is for stocks & stockings. Both end up with the same stem "stock". When search engines first started using stemming, someone who was searching for stocks, would get results to pages that had porn ("stockings"). This is just the most common example, and I imagine other common examples exist. In general, stemming has not been found to improve the quality of searches because of words such as these that share the same stem, but have dissimiliar meanings.

      --
      Computers don't make mistakes. What they do, they do on purpose.
  58. Will we will pay for our relevance? by QuantumWeasel · · Score: 2, Interesting
    This suggests a related thread, not so far off topic as it may seem.

    The service search engines provide to all of us, in the face of what can only be charitably called a glut of data (some of it actually useful to some of us) is remarkable. (And let's be clear: We're talking about Google, possibly the only thing keeping the net from collapsing under it's own weight in ether.)

    It's great that Google is effectively "free" to us, but should that be so? I'm personally willing to pay a nominal charge -- perhaps in the form of a subscription -- for access to the most effective search algorithms. It has that much value to me. If someone wants to create a free system -- as Google has done -- that's great. But as the web gets bigger and hairer, we might need to think about ways to support (read, fund) continued development of effective search/navigation tools.

    There's some serious linear algebra in publicly disclosed portions of the PageRank(TM) algorithm, and the really good stuff is probably even funkier. This sort of thing isn't going to be hacked together on the weekend by your average penguinista. Could the brains behind Google go diving for herring in richer seas? I suspect so.

    1. Re:Will we will pay for our relevance? by markprus · · Score: 1

      The Brin/Page paper The PageRank citation ranking: bringing order to the web., which describes the PageRank algorithm, does't have any "serious" linear algebra that I could see. The PageRank implementation does become non-trivial because you have to use out-of-core sorting (disk based merge sort) and you have to optimize it for speed because of the huge size of the dataset (the web graph). Other than that there is nothing magical about Pagerank.

  59. PUTO SUDACA by Anonymous Coward · · Score: 0

    PUTO SUDACA

    1. Re:PUTO SUDACA by registro · · Score: 1

      Madrileño, Actually.

    2. Re:PUTO SUDACA by Anonymous Coward · · Score: 0

      I shit on all you spaniards.

  60. Perfect search results? by teetam · · Score: 4, Informative
    Perfect search results are only present in the minds of the searchers. Google is, without doubt, the best search engine around.

    The pagerank algorithm is one of the most important reasons why it is so good in bringing up relevant and popular results. But, this is just one of the ways of searching for good results and will not always work to your satisfaction.

    Google gives preference to the number and quality of links to a particular site rather than the content of the site itself. One can easily come up with cases when this is probably not the best approach.

    For example, consider a portion of the web containing lyrics of songs. If you search by artist name or song name, google will return excellent results, because the pages are probably linked using the names. However, if you only know the soneg from radio, you might want to search for songs containing a few particular lines. The pagerank algorithm might not be the best fit here.

    --
    All your favorite sites in one place!
    1. Re:Perfect search results? by theBOPfromH*LL · · Score: 1

      Your example concerning lyrics is not accurate. I must conclude you did not actually try searching for song lyrics. I do so frequently. Google works great for finding song lyrics, even if you don't know the name of the artist or song. Include the keyword 'lyrics' and several quoted phrases that you know. Be sure to add a + to words like 'the' that google automatically ignores. Chances are, you'll find what you're looking for.

    2. Re:Perfect search results? by RedWizzard · · Score: 5, Informative

      Unfortunately for your theory Google is very good at finding song lyrics given only a small quote. I've done this several times. Try it.

    3. Re:Perfect search results? by r7 · · Score: 1

      >Perfect search results are only present in the minds of the searchers. Google is, without doubt, the best search engine around.

      So far it is, assuming your queries are straightforward, involve clear refs, and you don't search newsgroups.

      Otherwise be prepared to spend some time submitting multiple queries. It's especially bad in the newsgroups search where they've made "relevance" the default instead of "date" (See also www.pbs.org/cringely/pulpit/pulpit20021003.html").
      I also have a hard time with foreign language results and have to use the advanced search too often for comfort (I guess they assume the translate engine covers all the bases. It doesn't)

      Is this a result of information overload? bottom line focus (i.e, bean counter influence)? or just the inevitable drifting off course? We may never know but if anything it may be a good opportunity for competitors.

  61. Google should get spiteful by Julian+Morrison · · Score: 5, Insightful

    If a site is "convicted" of google-spamming (use the ranking engine to prescreen, and human checking to verify), or of helping to spam, it should be permanently blocked from the results by name and IP.

    Result: pr0n sites will be too terrified of deletion to munge their ratings.

    1. Re:Google should get spiteful by Anonymous Coward · · Score: 1, Insightful

      Nope, it's still too lucrative. You would just do it until you got caught and then change your company name and start up a new website.

    2. Re:Google should get spiteful by Julian+Morrison · · Score: 2, Interesting

      First: you'd get caught instantaneously. The ranking engine that scores you for display == the spammer detection engine.

      Second: for persistent offenders, don't just delete the spamming site, delete all the sites they own or operate. Or threaten the ISP with black-holing.

    3. Re:Google should get spiteful by vegetablespork · · Score: 1

      Easier said than done. Spammers, and those who operate questionable web sites, often operate through a multitude of shell companies and false entities. Check out NANAE for some of the sluething that has taken place to find the most notorious offenders in the email realm. I imagine those cloaking, etc., are just as "clever."

      --

      Call (206) 338-5780 COLLECT for information about a genuine BA, BS, MA, MS, MBA, or Ph.D.

  62. What? by Lukey+Boy · · Score: 1

    My site (submail.net) uses mod_gzip and sends compressed content to Mozilla all the time with not a single problem so far.

    1. Re:What? by Gekko · · Score: 2

      Well I was using PHPBB with gzip enabled and mozilla/netscape7

      --
      I mod down any one who says "I'm sure I will get modded down for this"
  63. For Me: Google First, Alltheweb Second by Snork+Asaurus · · Score: 5, Interesting
    The following is highly subjective, but I do a great deal of searching.

    Background: Among other things, I am always trying to discover music from independent (especially blues) artists that post mp3's of their stuff on the web. I have been boycotting the major record labels 100% for about 15 years (hooray for independents!) for several reasons: 1: CD prices have always been a rip-off, 2: most major label artists suck, 3: I have worked in music business (artist/bands/production) and detest the industry for the way that it exploits artists, 4: I have always loved discovering talented "unknowns" and turning other people on to them. I went through a new music dry spell until the web started to become a vehicle for independent artists to promote themselves.

    It's amazing what's out there now - I've found great artists from all over the Americas, Europe, Australia, Russia and even a few from Asia. I have found a lot of crap, too. The mp3 search engines are essentially useless for this purpose (I don't want major label music) and I have never used Napster or any of the off-spring. Links pages are more often out of date than not and webrings have similar problems. I have contrived several search techniques that try to exploit the strengths of search engines and the likely information on an artist's site. One very simple one is to look for "mp3 +(insert name of a well-known blues standard) -(a lot of keywords to exclude the many sites that put "mp3" on every page that simply lists a song title just to pull in traffic) -(specific sites that pollute the searches)", to find artists that cover the song and also have their own tunes.

    I have been a proponent of Google for many years. It came along just as I really started to dislike Altavista and I was an almost instant convert. But I am always on the lookout for a backup or something better. I have tried Teoma several times in the last year (as recently as last night), but I'm not terribly impressed. I find its interface and the way it presents results simplistic and dumbed down and it appears to have indexed far less of the web than Google. I got turned off Lycos years ago, when it seemed to want to become another portal/Yahoo (as if we need another one).

    The one search engine that I do use as a regular alternative to Google is Alltheweb. For one thing, IMO, its advanced search is currently better than Google's (I swear that I have brought Google to its knees by entering too many keywords - it stops responding and is inaccessible for several minutes thereafter - this has happened several times). When I've done back to back comparisons with Google, Alltheweb seems to fare pretty well and seems to find more international pages than Google. The difference in top rankings can also be useful. Google has some nice features that Alltheweb does not, such as the elimination of duplicate pages.

    For one-stop searching, I find Google best for me, but Alltheweb is a good alternative.

    --
    Sigs are bad for your health.
  64. Open source software for PageRank by ramjam1729 · · Score: 2, Interesting

    I know Pagerank algorithm is patented.(6285999) Will I get into any trouble if I write any open source software using a similar algorithm? Thanks.

  65. Re:Search Engine Optimisation - Don't waste your t by great+throwdini · · Score: 2

    Your job, as a webmaster, is to produce a user friendly, useful, maybe informative website.

    Google's job, as a search engine, is to find the sites most likely to be of interest to a user, based on their search terms.

    And somewhere in the middle, the two concerns meet. There are a *lot* of unintentionally odd entries in Google that should be cleared out as misleading. Three examples: (1) Redirection URLs from one site but content from the targeted site; (2) Past versions of multi-version documents such as Wiki revision URLs; and (3) Static redirection documents from relocated sites.

    All should probably be combatted by site maintainers through application of robot exclusion rules and a little redirection know-how. However, given the frequency with which I've encounted these awkward entries, I'm guessing that attention to such detail has gone by the wayside.

  66. How will I trick kids into looking at my Butt?? by HamNRye · · Score: 2

    The web is not optimizing Google.

    Your job as a webmaster: Generate massive quantities of traffic... While figuring out how to be profitable.

    Google's job, as a search engine: Generate massive quantities of traffic... While figuring out how to be profitable.

    Now, the webmaster could accomplish that by offering tons of great content at a better price, and Google can accomplish this by having the most relevant results. That takes care of the generating traffic.

    Now, profitability... Tons of great quantity at a reasonable price will not be supported by 100 users... You need hundreds of thousands for the economies of scale to work in your favor. End result, you do everything in your power to drive traffic at your website. Top of mind awareness, etc... What you postulate is that they should stop sending me Credit Card offers because if I wanted one, I'd call. True, but would I call your company or another one??

    SEO actually shows that we are achieving a Search engine monoculture. Back in the day when we had varied competing engines, bombing one site hardly had any impact. (Actually, most results were bombed so you got used to checking several engines.) And these techniques were around about two minutes after the first Search Engine went online.

    Google was an evolution beyond the early Excite, Lycos, et. al. If an unusable Google is the result, new ideas will surface that are evolutionary. This shows that there are areas where you could improve on Google search technology and either compete or license it to Google. Then we will have an improvement over Google like Google was over Infoseek, like Infoseek was an improvement over Yahoo.

    Webmasters are NEVER going to play nice with the search engines. The job of the search engine designer will be to limit the ways in which his system can be abused. XHTML should be a big help, as most search engines get easily fooled by having their parsers parse verrrrrrry loose HTML. When they start actual content checking and checking for invisible divs and other HTML tricks, the engines will evolve again. I'm suprised with the increase in processor speed it hasn't happened more recently, but the extra horsepower has been dedicated to indexing PDF etc... Next generation maybe.

    Hammy

  67. Re:Search Engine Optimisation - Don't waste your t by asackett · · Score: 2
    That's not how the real world works. Consider micro$oft -- they do not create the best, most useful, most stable software; they created the biggest marketing machine the world has ever seen. I do not recommend emulating micro$oft; I am merely using that famous example to illustrate my point.

    On the real web, the one where I make my living building and promoting commerce sites, the truth is that no matter how aesthetically pleasing, no matter how well engineered for human use, no matter how informative your site is or how low your prices are, if your site is not easily located by the methods people actually use, it is worthless.

    I don't waste my time doing search engine optimization -- I make very good use of my time doing search engine optimization. I don't use doorway pages, spamdexing, googlebaiting, or any such cheats, but I do build every site from the ground up with the primary focus being search engine spidering and indexing. That's why I'm still in business, and my clients are not only still in business but are spending right now on expansions and upgrades to their sites.

    Your utopian vision would be fine if the real world would just consent to be ruled by it, but the real world is a fickle and contrary organism that defies such simplistic wishfulness.

    --

    Warning: This signature may offend some viewers.

  68. Observations on Google cache by prostoalex · · Score: 3, Interesting

    I e-mailed the problem to Google about a week ago, but so far they didn't seem to get around it. Anyway, a Google search on my last name reveals my personal homepage as the result number one, which is no surprise, considering the last name. However, the cached version of what supposedly is my site is an entirely different site that I have never heard of. Furthermore, since the results of Google search use the title and description from the cached version, the title for my homepage as well as description come up pointing to RhytmicPalmz.com or something of that nature. It seems to be a cache glitch, at least so far I haven't been able to come up with valid explanation for that.

  69. So? by Anonymous Coward · · Score: 0

    So what? It's their site... go use altavista if you don't like it.

  70. Re:other search engines/ They all need to get bett by SacredNaCl · · Score: 1

    One nice thing with Opera (6.0) is just type "A" then your query in the box where you would enter urls and it auto feeds your query to All The Web. Supersearch feature in Opera is also handy. I hope when they release 7.0 they add more engines to that very handy feature. When I'm on the Windows machine I sometimes break out FirstStop they have a free version which is decent for meta searching a few engines from the desktop. Searchlores is another interesting place to look for way too much information about search engines, and very strangely organized at that. Google is still my default search engine, but sometimes you can't find what you want there easily. Files can be particularly hard to come across. I use Atomic File Finder when that becomes bothersome. FTP-AFFD

    --
    Freedom is merely privilege extended unless enjoyed by one and all.
  71. mod parent up by Anonymous Coward · · Score: 0

    here here

  72. google results by Espectr0 · · Score: 1

    yes, the old code was better! "go to hell" doesn't show microsoft's web site anymore!

    1. Re:google results by luap2000 · · Score: 1

      maybe they need to switch to crows or something...

      http://www.google.com/technology/pigeonrank.html

      they had to get rid of the googleBomb. i'm sure results will be more in tune as the month progresses...

  73. Re:other search engines/ They all need to get bett by Khalid · · Score: 2

    MusiMoz is a really a good idea, but still not a lot of material, I really miss Audiogalaxy for discovering new music and artists. It will be intersting later to make some connection betwen p2p programs and MusicMoz when the later will really mature.

  74. +1 INSIGHTFUL by Anonymous Coward · · Score: 0

    I wish you didn't post as an AC because I think you hit the nail on the head: Google was merely responding to public sentiment about the manipulations of PageRank.

  75. kartoo.com instead of google? by Anonymous Coward · · Score: 0

    if it wasn't for the slow search time, kartoo would beat google. Well, atleast in my dreams :P

  76. Re:People are cheating -- so you condone it? by Swaffs · · Score: 1

    That was one of the funniest things I've read in awhile. Thank you.

    --

    --
    "Karma can only be portioned out by the cosmos." - Homer Simpson [1F10]

  77. Its fantastic! by quakeroatz · · Score: 1

    Try searching for these:
    "will you stand above me"
    "was once just a baby"

    Can you guess?

  78. It's Courteney... by Anonymous Coward · · Score: 0

    ...with an additional 'e'.

    And while we're at it, could she be any thinner? Put on some weight, girl!

  79. Scientology by Anonymous Coward · · Score: 0

    It seems that Xenu.net is now number two again, and scientology's official page is number one. I haven't check this recently, but Xenu.net was number one for a long time, and although I'm not sure if Xenu.net would have changed this much in such a short time, I think it's important that Xenu.net stays on top. Once again, Xenu.net, who Xenu.net Xenu.net, would Xenu.net till Xenu.net Xenu.net's.

  80. By selling to Law Enforcement et al by Anonymous Coward · · Score: 0
    And don't ask me why, but I've been getting an uneasy feeling about google. Where do they get all the money such a company needs to stay afloat?
    Don't think for a moment that AdWords is Google's only source of revenue. Google likely has a large clientele of "private" customers, including law enforcement.

    This is especially true when it comes to groups.google.com. The posts you make with "X-No-Archive: Yes" don't show up to the public, but they're archived alright. And believe me, there are organizations who're more than willing and have more than enough money to pay to get at those posts. Google doesn't archive binary Usenet groups, or binaries in non-binary groups, eh? Right... Look at their technology. Extrapolate. Imagine if you were Mr. BSA trying to figure out who's posted a pirated copy of Photoshop this year. Who would you call?

    Think also along the lines of targeted searches, periodic results of top matches for particular keywords, etc. When the US Customs Service wants a list of the most frequently-located sites for the keywords "child porn" or "preteen pics" or "little boys," who do you think they call?

    Google has an enormous amount of data, much of which none of us will ever see. Because we don't pay.
  81. google Theft bandwith of internet users by jamejamdaily · · Score: 1

    while i search on images.google.com,my bandwith theft, speed slow and slow and slow, before google cashe servers were highspeed and independent,but now is not and that depending on bandwith of online users.

  82. Irish Porn Star by meehawl · · Score: 2

    I think Google has only improved in recent weeks. For example, my blog now gets the top hit for Irish Porn Star. That's what I call progress.

    --

    Da Blog
  83. Google Search Quality by GeekSoup · · Score: 1

    There is no mention of Peter Norvig. I would think as the guy in charge of 'Search Quality' he would have the answer.

  84. They do tend to oscilate: cache effect? by Per+Abrahamsen · · Score: 2

    I have some pages ranking various search terms according to google hits (see my home page for links). I noticed that new terms tend to switch between two numbers for some time, but finally they stabilize. I suspect this is a cache effect, Google is a big system, and information may take some time to propagate within it.

    1. Re:They do tend to oscilate: cache effect? by luap2000 · · Score: 1

      it's called the GoogleDance - the shifting effect during the monthly update.

      also, though, there is a new 'minty fresh' bot crawling some sites frequently for fresh relevant content. so, some pages are added (daily?) to the index, causing it to change a little (everflux, they're calling it...)

      http://www.mallasch.com/journalism/search.php?qu er y=google

  85. Re:other search engines/ They all need to get bett by Zagadka · · Score: 2

    DMOZ also has the problem that many of the editors are biased. This is especially true in commercial categories, where editors will often make it difficult for competitors to get their sites listed. Yes, there are supposed to be rules against that, but it still happens. For commercial categories it's hard to find expects who are also impartial.

  86. Re:Search Engine Optimisation - Don't waste your t by luap2000 · · Score: 1

    exactly! this needs to be said more.

    of course, sadly, there will always be the hucksters with the get rich quick schemes...

    in the end, as google matures, i'm sure it will continue to be about the content...

  87. Last Post! by alpg · · Score: 1

    "I am convinced that the manufacturers of carpet odor removing powder
    have included encapsulated time released cat urine in their products.
    This technology must be what prevented its distribution during my mom's
    reign. My carpet smells like piss, and I don't have a cat. Better go
    buy some more."
    -- timw@zeb.USWest.COM

    - this post brought to you by the Automated Last Post Generator...

  88. Degraded? Sometimes by vegetablespork · · Score: 1

    degrading, but not degraded that I've seen yet. BTW, last post.

    --

    Call (206) 338-5780 COLLECT for information about a genuine BA, BS, MA, MS, MBA, or Ph.D.