Slashdot Mirror


El Reg Says Google Choking on Spam Sites

Grubby Games writes "The Register is reporting that Google is full, and in trouble." From the article: "Recently, we featured a software tool that can create 100 Blogger weblogs in 24 minutes, called Blog Mass Installer. A subterranean industry of sites providing 'private label articles,' or PLAs exists to flesh out 'content' for these freshly minted sites. And as a result, legitimate sites are often caught in the cross fire. But the new algorithms may not be solely to blame. Google's chief executive Eric Schmidt has hinted at another reason for the recent chaos. In Google's earnings conference call last month, Schmidt was frank about the extent of the problem. 'Those machines are full,' he said. 'We have a huge machine crisis.'" James Robertson points out that's a fairly selective bit of quoting.

234 comments

  1. Everyone - Attention by NotQuiteReal · · Score: 5, Funny
    Please start deleting items from the Internet. It is getting full.

    Thanks!

    --
    This issue is a bit more complicated than you think.
    1. Re:Everyone - Attention by endrue · · Score: 5, Funny

      eh... just defrag

      - Andrew

      --
      I meta-moderate because I care.
    2. Re:Everyone - Attention by Anonymous Coward · · Score: 1, Insightful

      Please start deleting items from the Internet. It is getting full.

      Oh my god, does that mean that offering everyone on the planet 1GB mail accounts for free and allowing everyone to host his shitty blog for free is not a profitable business anymore? Who would have thought!

      http://fuckedgoogle.com/

    3. Re:Everyone - Attention by cashman73 · · Score: 3, Insightful

      Can we start by getting rid of MySpace? That seems to be huge waste of space and bandwidth, and with 60 million subscribers, would definitely cut down on the internet's bloat.

    4. Re:Everyone - Attention by ZachPruckowski · · Score: 5, Funny

      SFD (Sites for Deletion):
      myspace.com

      Problem solved.

    5. Re:Everyone - Attention by Anonymous Coward · · Score: 0

      Who actually has the desire to spend time and money maintaining a site like that? Only a disgruntled ex-employee or a competitor, surely?

    6. Re:Everyone - Attention by pianomahnn · · Score: 1

      I think you're being sarcastic here...right? Besides, it's well over 70,000,000 users now.

    7. Re:Everyone - Attention by Anonymous Coward · · Score: 2, Funny

      no, just reduce the font. That'll do it.

    8. Re:Everyone - Attention by Anonymous Coward · · Score: 0

      I'm pretty sure it would be easier to just drag "The Internet" icon to the recycle bin.

      http://www.baltimoremd.com/humor/deletenet.html

    9. Re:Everyone - Attention by SydBarrett · · Score: 4, Funny

      ATTN: HELPDESK

      Could someone do a quick backup first? There might be something on the internet that I might need later. I think you can just use Ghost or whatever you IT guys do. Also, please burn it to CD and have it on my desk by COB today.

      -Executive Chief Officer SydBarrett

    10. Re:Everyone - Attention by jb.hl.com · · Score: 2, Informative

      No. Not solved. MySpace is nowhere near a problem. I've yet to see MySpace used as a link farm, anywhere, hell I've never even seen a MySpace page in a Google result (except results for "MySpace" obviously). I'd probably count Blogger as the thing that should be deleted, as well as poorly configured WordPress installations which allow anonymous commenting.

      Maybe you should let your own little personal prejudices slide a bit. MySpace isn't the great Internet evil, you know.

      --
      By summer it was all gone...now shesmovedon. --
    11. Re:Everyone - Attention by feepness · · Score: 4, Insightful

      Hmmm, it appears requiring a sense of humor for access to the internets might cut down on 'indignant post' volume as well.

    12. Re:Everyone - Attention by Xichekolas · · Score: 3, Funny

      I hate it when I have to buy a new Internet because mine is full. Hopefully I can get one of the new perpendicular Internets that holds more... wonder if I will need to turn my monitor sideways to use it...

      --

      Self-referential Sigs are cool on /. these days...

      54

    13. Re:Everyone - Attention by FLEB · · Score: 1

      MySpace isn't the great Internet evil, you know.

      "The"... "a"... Still, you've got to admit it'd be a step in the right direction.

      --
      Information wants to be free.
      Entertainment wants to be paid.
      You just want to be cheap.
    14. Re:Everyone - Attention by jb.hl.com · · Score: 1

      Not really. I know this is an unpopular view, but I think MySpace isn't all that bad. You can meet some great people through it, and the social networking aspect of it is very good. OK, it's without a doubt a poorly coded buggy piece of shit, but it works, and for many people it's one of the few reason's they're online. In no way is it "evil".

      --
      By summer it was all gone...now shesmovedon. --
    15. Re:Everyone - Attention by Anonymous Coward · · Score: 3, Funny

      Myspace is emo.
      Emo sucks.
      If you use Myspace you are emo.
      If you use Myspace you suck.

    16. Re:Everyone - Attention by Anonymous Coward · · Score: 0

      Maybe we *do* need an Internet Cleaning Day!

      http://www.snopes.com/holidays/aprilfools/cleaning .asp

    17. Re:Everyone - Attention by rholliday · · Score: 1

      Maybe you should let your own little personal prejudices slide a bit. MySpace isn't the great Internet evil, you know.

      joe-baldwin.net's MySpace profile complete with auto-playing song.

      We all have bias, you know. :)

      --
      Xbox reviews.. We think they're funny.
    18. Re:Everyone - Attention by sponga · · Score: 1

      Might not be on the first links, but myspace is starting to show up on the first page. I was looking for a 'hulk hogan chokes tv host' in search and it brought up myspace as first result.

    19. Re:Everyone - Attention by TrekCycling · · Score: 1

      Blogger.com. Future problems solved.

    20. Re:Everyone - Attention by sp0rk173 · · Score: 1

      Now, I have an auto playing song on my myspace page, but it's expressly there to annoy the hell out of people. It's embedded, not using their little convenient flash thing, and i've hidden the controls so you can't stop it. And, what's better, it's Johnny Cash singing cocaine blues! Which...basically makes most people like it, but it used to be a song about grotesque things to do with fetuses! And Jesus dances to it!

    21. Re:Everyone - Attention by stagl · · Score: 1

      that's funny. jb.hl.com _does_ take me to a link farm.

      --

      R.I.P.
    22. Re:Everyone - Attention by IamTheRealMike · · Score: 1

      MySpace is in fact littered with fake profiles that appear to be real people but are actually advertising products or link spamming.

    23. Re:Everyone - Attention by Cylix · · Score: 1

      I did see a yahoo blog as a link farm.

      Reported the issue and blammo... content erased...

      Which means someone out there does listen.

      --
      "You should always go to other people's funerals; otherwise, they won't come to yours." -- Yogi Berra
    24. Re:Everyone - Attention by pegr · · Score: 1

      Now, I have an auto playing song on my myspace page, but it's expressly there to annoy the hell out of people. It's embedded, not using their little convenient flash thing, and i've hidden the controls so you can't stop it. And, what's better, it's Johnny Cash singing cocaine blues! Which...basically makes most people like it, but it used to be a song about grotesque things to do with fetuses! And Jesus dances to it!
       
      Link?

    25. Re:Everyone - Attention by orthodoxRebel · · Score: 1

      aw, man. i suck. i am sad now

    26. Re:Everyone - Attention by rholliday · · Score: 1

      Link?

      Surprisingly enough it's http://www.myspace.com/sp0rk173. :)
      --
      Xbox reviews.. We think they're funny.
    27. Re:Everyone - Attention by mike2R · · Score: 1

      n jst stp wstng ltrs. y dnt nd vwls nywy!!!

      --
      This sig all sigs devours
    28. Re:Everyone - Attention by fm6 · · Score: 1

      You need free space to do a proper defrag. I suppose you could do what we used to do before defrag utilities were available: back the whole thing up, reinitialize, and restore.

    29. Re:Everyone - Attention by EtherealStrife · · Score: 1
      Remove keyword "blog" and all associated pages. Problem solved.

      MySpace is a useful way of tracking all 9738 of your online friends. Really, would you be able to do that on your own?

    30. Re:Everyone - Attention by The+Good+Reverend · · Score: 0, Offtopic

      Nice thing about the internet is that you don't have to go to any of the websites other people go to! And all the bitching in the world doesn't make you cool, it just makes me think you have nothing to do except whine about something on the internet that only affects your life as much as you choose it to.

    31. Re:Everyone - Attention by AKAImBatman · · Score: 1

      I'd probably count Blogger as the thing that should be deleted, as well as poorly configured WordPress installations which allow anonymous commenting.

      And what exactly is so "poorly configured" about allowing anonymous commenting? Some of thebest comments come from anonymous users. (Not that anyone in Wordpress is really validated.) Wordpress not only makes anon commenting easy, but it automatically catches many types of spam posts (especially link-spam) and requires that they be approved before going live.

    32. Re:Everyone - Attention by Anonymous Coward · · Score: 0

      That was beautiful.

    33. Re:Everyone - Attention by dindi · · Score: 1

      ohh, i beleive seeing a filter on yahoo, that allows ou to forget about sites ....

      yes blogger and about,com is one of those I would filter out indefinetely :)

    34. Re:Everyone - Attention by Guy+LeDouche · · Score: 0

      So what is your MySpace name?

    35. Re:Everyone - Attention by ozmanjusri · · Score: 2, Funny
      Internet Cleaning

      DO NOT CONNECT TO THE INTERNET FROM MAY 7 23:59 pm (GMT) UNTIL 12:01am (GMT) MAY 8.

      *** Attention ***

      It's that time again! As many of you know, each year the Internet must be shut down for 24 hours in order to allow us to clean it. The cleaning process, which eliminates dead email and inactive ftp, www and gopher sites, allows for a better-working and faster Internet.

      This year, the cleaning process will take place from 23:59 pm (GMT) on March 31st until 00:01 am (GMT) on April 2nd. During that 24-hour period, five powerful Internet-crawling robots situated around the world will search the Internet and delete any data that they find.

      In order to protect your valuable data from deletion we ask that you do the following:

      * 1. Disconnect all terminals and local area networks from their Internet connections.
      * 2. Shut down all Internet servers, or disconnect them from the Internet.
      * 3. Disconnect all disks and hardrives from any connections to the Internet.
      * 4. Refrain from connecting any computer to the Internet in any way.
      We understand the inconvenience that this may cause some Internet users, and we apologize. However, we are certain that any inconveniences will be more than made up for by the increased speed and efficiency of the Internet, once it has been cleared of electronic flotsam and jetsam.

      We thank you for your cooperation.

      Interconnected Network Maintenance Staff Main Branch, Massachusetts Institute of Technology

      Sysops and others: Since the last Internet cleaning, the number of Internet users has grown dramatically. Please assist us in alerting the public of the upcoming Internet cleaning by posting this message where your users will be able to read it.

      Please pass this message on to other sysops and Internet users as well.

      --
      "I've got more toys than Teruhisa Kitahara."
    36. Re:Everyone - Attention by zaguar · · Score: 1

      Didn't people get the Pink Floyd reference?

      --
      "Sure there's porn and piracy on the Web but there's probably a downside too."
    37. Re:Everyone - Attention by jb.hl.com · · Score: 1

      That's not my full URL, my full URL WAS joebaldwin.homelinux.com, but I had to condense it.

      --
      By summer it was all gone...now shesmovedon. --
    38. Re:Everyone - Attention by RinzeWind · · Score: 1

      There's nothing wrong with anonymous commenting in WordPress. I allow anonymous comments in my blog, in the sense that you're not required to register in order to leave a comment.

      What really hurts are WordPress installations that allow any kind of comment and don't have a working anti-spam system. I personally use Spam Karma 2 and my blog is spam-free.

    39. Re:Everyone - Attention by Zaatxe · · Score: 1

      Please, add orkut.com to this list!!

      --
      So say we all
  2. How accurate is the Register Article? by xmas2003 · · Score: 5, Informative
    James Robertson suggests that Orlowski mis-reports it again and says that the Register report is a "fairly nasty bit of selective quoting" and was referenced in the DIGG commentary that Google's not full.

    With hardware (and bandwidth) getting cheaper, I find it hard to believe that Google has actually run out of space. But certainly the explosion in the number of web pages is an issue, especially with auto-generated pages. One current example is the V7ndotcom Elursrebmem SEO contest (white-hat celiac charity site I'm supporting) - that nonsense phrase returned zero results on January 15th, 2006 ... but now returns almost 5,000,000 ... of which I gotta believe the vast majority were NOT typed in by humans.

    So maybe it's more that the techniques/algorithms used to spider and index are struggling with the bazillions of web pages out there. Or it could just be disgruntled webmasters PO'ed that their web site isn't listed!

    --
    Hulk SMASH Celiac Disease
    1. Re:How accurate is the Register Article? by Daniel_Staal · · Score: 1

      I'm sure it takes Google time to install new machines. If they didn't have installs scheduled that would hook into this system and the space needed was growing exponetially, even they could be caught off-guard.

      --
      'Sensible' is a curse word.
    2. Re:How accurate is the Register Article? by ConsumerOfMany · · Score: 0

      man, a slashdot user referencing a Digg article is like the pope quoting The DaVinci Code.....

    3. Re:How accurate is the Register Article? by Anonymous Coward · · Score: 0

      I find it funny that this is the same James Robertson who used to claim Smalltalk was really really fast! Like as "fast" as Java even!! And now their smalltalk-powered blog can't even keep up with a pretty minor story on slashdot.

    4. Re:How accurate is the Register Article? by david.given · · Score: 4, Informative
      Andrew Orlowski seems to have this weird grudge against Google --- he's been posting reams of violently anti-Google stories for, well, years now. It's reached the stage where if the subject line has 'Google' in it, and Orlowski's byline is attached, I just skip over; even if there's actual information there, it's going to be so wrapped up in snide misreporting as to be useless.

      Be warned.

    5. Re:How accurate is the Register Article? by Richard_at_work · · Score: 4, Informative

      The Register is one of the most bias, spinning tech news sites Ive ever read, and I first started reading it 6 years ago - its only got worse since then. I actually refuse to browse the site these days, only reading their articles when directly linked and pretty much all of them have some really evil spin on them.

    6. Re:How accurate is the Register Article? by Otter · · Score: 1
      One current example is the V7ndotcom Elursrebmem SEO contest (white-hat celiac charity site I'm supporting)

      Is this in competition with the guy whose girlfriend will have a threesome with him if his blog gets a million hits? If so, I've got to throw in with him. Sorry, sick kids but when you get older, you'll understand.

      Anyway, now that the link has finally loaded while I was writing the above -- OK, the quote is out of context but it's not that out of context. (Certainly not by /. standards for "out of context".) The link between Google's hardware capacity and its supposed search problems isn't presented as anything but speculation and Schmidt's comment doesn't seem irrelevant to that speculation.

    7. Re:How accurate is the Register Article? by ckaminski · · Score: 1

      What the hell do you expect from the home of the Bastard Operator From Hell? Cookies and a coupla pints?

    8. Re:How accurate is the Register Article? by Anonymous Coward · · Score: 3, Informative

      Orlovski? Isn't he the guy that also hates Wikipedia, with his sneering remarks about wiki-fiddlers and barely restraining himself from referring to them as Wikipedophiles?

      I don't know what his problem is, perhaps he just needs pageviews for the advertisers. So: write knocking article about popular website, fans of the website look, pageviews escalate.

      Google -- check.
      Wikipedia -- Check
      Slashdot -- ?

      (The captcha word for this submission was "referral". How do they do that?)

    9. Re:How accurate is the Register Article? by SatanicPuppy · · Score: 1

      Calling British journalism biased is missing the point...Of course it's biased. They don't ascribe to the American delusion that journalism can be unbiased.

      They wear their bias on their sleeves, which, in my opinion, is a good thing, because you know the type of slant that's on what you're reading, and nobody claims to be "Fair and Balanced" when they're anything but.

      --
      ad logicam Claiming a proposition is false because it was presented as the conclusion of a fallacious argument.
    10. Re:How accurate is the Register Article? by IAmTheDave · · Score: 1

      Quoth Nick Thomas, first reply to James Robertson's post calling bullshit:

      I thank my lucky stars every day that we have a news reporting medium where people who are spewing bullshit are swiftly called out on it.

      Damn right.

      --
      Excuse my speling.
      Making The Bar Project
    11. Re:How accurate is the Register Article? by Anonymous Coward · · Score: 0

      Here's a cure for you, stupid: DON'T EAT GLUTEN

    12. Re:How accurate is the Register Article? by david.given · · Score: 1
      Orlovski? Isn't he the guy that also hates Wikipedia, with his sneering remarks about wiki-fiddlers and barely restraining himself from referring to them as Wikipedophiles?

      Yes, that's him. I didn't mention it because I couldn't remember off-hand what his other phobia was (it's late in my time zone).

      He's okay when he's not doing opinion pieces, though.

    13. Re:How accurate is the Register Article? by gammoth · · Score: 1
      The whole "Fair and Balanced" thing is regarded as a bad joke now. Maybe not six years ago, but now Americans are pretty sheepish about the whole affair. But it's not brought up in polite conversation.

      Anyway, the point I wanted to make is that bias is one thing, but distortion is quite another. When one's bias leads one to gross distortions, then there's a problem. I think that's what the poster was getting at.

    14. Re:How accurate is the Register Article? by Anonymous Coward · · Score: 0

      About the only section worth browsing in El Rag ... sorry, Reg ... is the BOFH one. The rest ... meh. You can get the same news earlier and better written elsewhere.

    15. Re:How accurate is the Register Article? by whoever57 · · Score: 1
      With hardware (and bandwidth) getting cheaper, I find it hard to believe that Google has actually run out of space.

      No, but I think it has run out of people to install and manage all that space. Apparently hires into "Reliability Systems Engineering" (or whatever Google calls their system admin group) are one of the hottest areas for Google right now.

      --
      The real "Libtards" are the Libertarians!
    16. Re:How accurate is the Register Article? by Anonymous Coward · · Score: 0

      Biased.

    17. Re:How accurate is the Register Article? by Jerf · · Score: 1

      He's okay when he's not doing opinion pieces, though.

      With Orlowski, every piece is an opinion piece.

      The guy's shameless. For pete's sake, he links to the source of the quote he twisted, which makes it clear it has been twisted.

      He's either got no concern for truth, or has no ability to discern it.

    18. Re:How accurate is the Register Article? by fm6 · · Score: 1

      "Fair and balanced" is only a joke because it's the (trademarked!) slogan of the most agenda-driven "news" network. Most real journalists have some notion of objectivity — even in the UK.

    19. Re:How accurate is the Register Article? by MythMoth · · Score: 1

      I stop reading any Reg article as soon as I spot Orlowski's byline. He's their version of Katz. He has a particular bee in his bonnet about Google.

      --
      --- These are not words: wierd, genious, rediculous
    20. Re:How accurate is the Register Article? by colinbrash · · Score: 1

      The Register is one of the most bias, spinning tech news sites Ive ever read, and I first started reading it 6 years ago - its only got worse since then. I actually refuse to browse the site these days, only reading their articles when directly linked and pretty much all of them have some really evil spin on them.

      Oddly enough, that's precisely why I love to read it!

    21. Re:How accurate is the Register Article? by DrSkwid · · Score: 1

      I've been reading El Reg since they were on 64k.

      I have one of their "Integrity, yeah we've heard of it" t-shirts.

      They used to have a "Click here to buy a story on our site" link.

      And they've ALWAYS had "Biting the hand that feeds IT" as the tagline.

      --
      There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
    22. Re:How accurate is the Register Article? by Anonymous Coward · · Score: 0

      google is big news because it's big money.
      reg. needs hits just like google

  3. This just in! by Anonymous Coward · · Score: 0

    Slashdot reports on obviously incorrect stories... anyway!

    1. Re:This just in! by JeanBaptiste · · Score: 0, Offtopic

      its just so all the comic-book-guy type geeks can come in here and post 50 reasons why the story is wrong. it makes them feel better, somehow.

    2. Re:This just in! by linvir · · Score: 0, Troll
      Oooh! Ooh! Let me try!
      1. Google is really rich and can buy more shit
      2. The register is written by fags
      3. One piece of software != crisis
      4. Unfair quoting = untrustworthy source
      5. Google will learn to ID the software and block it
      6. Subterranean means not big enough to be a risk
      7. Grubby Games is a gay name
      8. El Reg? If the title is wrong how are we to trust the content?
      9. Why say that in the earnings conference? Sounds like a quick excuse
      Hmmm, I can't make it past 9. I guess I'll just have to give up on my dream of becoming the Comic Book Guy.

      It's sad though that you could take pretty much any of those posts and extract 3 quick karma points out of it easily. That must be why I've decided to stop displaying the karma/moderation system completely.

    3. Re:This just in! by Anonymous Coward · · Score: 0

      It's sad though that you could take pretty much any of those posts and extract 3 quick karma points out of it easily.

      But why bother? After all, it's so much easier to get loads of karma by making faux-contrarian "zomg slashdot sucks" posts.

    4. Re:This just in! by Anonymous Coward · · Score: 0

      Worst. List. Ever.

    5. Re:This just in! by Skadet · · Score: 1

      9. Why say that in the earnings conference? Sounds like a quick excuse

      10. ...

      11. Profit?

  4. Not so sure... by Anonymous Coward · · Score: 0, Informative

    Gmail:

    Over 2721.241062 megabytes (and counting) of free storage...

    Methinks Google has more room to spare than The Register says.

    1. Re:Not so sure... by +InvaderSkoodge · · Score: 1

      There's no way Google has 2721 megabytes reserved for each Gmail account. They are relying on people like me that have only 2 emails there right now, consisting of only a few hundred bytes.

    2. Re:Not so sure... by op12 · · Score: 1

      Even for those utilizing a lot of their space, text compresses a lot.

    3. Re:Not so sure... by diegocgteleline.es · · Score: 1

      Gmail saying that you have whatever number of MB doesn't matters at all. Most of gmail accounts don't use it beyond a few (less than 10) MB, if not KB. What matter is how many size uses people *globally*. I doubt that google has 3 GB * (number of gmail accounts) of size available, and if it has them they probably need them for web search. Your account only uses the neccesary storage when you *use*. Why would gmail need to do the contrary?

    4. Re:Not so sure... by Andrew+Kismet · · Score: 1

      and the image, file, and zip attachments?

    5. Re:Not so sure... by vasqzr · · Score: 1


      2700MB of text is a ton. I doubt anyone has anywhere near that in their gmail account. Movies, pictures, MP3's, and Word documents on the other hand...

    6. Re:Not so sure... by Anonymous Coward · · Score: 0

      I have been trying since the get-go to fill one up.

      You are currently using 1567 MB (58%) of your 2721 MB.

      All from mailing lists. All text.

    7. Re:Not so sure... by crabpeople · · Score: 1

      why does that number have so many significant figures?

      --
      I'll just use my special getting high powers one more time...
    8. Re:Not so sure... by WolfWithoutAClause · · Score: 1

      And it's probably compressed by Google on their servers. And if you send a multimegabyte file around in an attachment, I bet they only store one copy between all the people that have that particular file in their email. Stuff like that would save a *lot* of space.

      --

      -WolfWithoutAClause

      "Gravity is only a theory, not a fact!"
    9. Re:Not so sure... by pxuongl · · Score: 1

      i don't think google actually has 2.7 gigs of physical hard disk space for each person. that'd be suicide... what's more likely is that google runs calculations based on what the typical usage percentage is.

      say for example, that they can have a million accounts, and only reasonably expect 50% of them to use about 35% of the available space, 15% of the accounts to be unused or low usuage, etc....

    10. Re:Not so sure... by pembo13 · · Score: 1

      You are currently using 350 MB (13%) of your 2721 MB. - from one of my GMail accounts. (text only emails)

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
    11. Re:Not so sure... by kiatoa · · Score: 1

      Yeah but surely google looks for the same data stored by multiple people and keeps only one copy of it around?

      --
      90% of the wealth is in 2% of the pockets. Bummer to be in the majority.
  5. Fairly selective quoting by linvir · · Score: 0
    that's a fairly selective bit of quoting.
    And one that's been being thrown around the internets like crazy since it was done. Seems they literally can't wait for Google to fail.
    1. Re:Fairly selective quoting by jofi · · Score: 0, Redundant

      Fail, while using Linux for their servers? Unheard of!

      --
      Blame the user, not the software.
    2. Re:Fairly selective quoting by Anonymous Coward · · Score: 0
      Uhhhh... hello, bloggers: It's not that selective at all. It's a very clear, very declarative sentence from someone at Google, who is in a position to know it, that Google is running out of storage space and is having to buy more systems to keep up.

      The context of the statement is not such that it affects the statement. It's very typical of a Google-worshiping, nothing-spewing blogger to find more text, paste it in his browser, and declare "I've found the context, this means blogs are good!"

      Perhaps if Google's index wasn't so packed with fucking useless filler -- like blogs -- this wouldn't be a problem. It's not like anyone even cares what bloggers have to say.

  6. Google is Full!? by aftk2 · · Score: 4, Funny

    Wow...so there really is an end to the internet.

    --
    concrete5: a cms made for marketing, but strong enough for geeks.
  7. Nice headline! by Anonymous Coward · · Score: 0

    "Google is full" is a pretty nice and catch headline.

    See how many news items appear on Google News front page referring to this soon. Irony...

  8. Spammer jokes by +InvaderSkoodge · · Score: 3, Funny

    I just realized that many of the jokes we apply to lawyers could also be used on spammers with good effect:

    So what do you have when you push 50% of all the spammers in the world into a hole and bury them? A good start.

    Did you know that if you took all the spammers in the world and lined them up end to end around the equator of the earth that two thirds of them would drown?

    1. Re:Spammer jokes by cashman73 · · Score: 5, Funny

      A stingy old spammer who had been diagnosed with a terminal illness was determined to prove wrong the saying, "You can't take it with you." After much thought and consideration, the old spammer finally figured out how to take at least some of his money with him when he died. He instructed his wife to go to the bank and withdraw enough money to fill two pillow cases. He then directed her to take the bags of money to the attic and leave them directly above his bed. His plan: When he passed away, he would reach out and grab the bags on his way to heaven. Several weeks after the funeral, the deceased spammer's wife, up in the attic cleaning, came upon the two forgotten pillow cases stuffed with cash. "Oh, that darned old fool," she exclaimed. "I knew he should have had me put the money in the basement."

    2. Re:Spammer jokes by smackt4rd · · Score: 1
      Did you know that if you took all the spammers in the world and lined them up end to end around the equator of the earth that two thirds of them would drown?
      Yeah, and the rest will probably get malaria. cool. :)
    3. Re:Spammer jokes by advocate_one · · Score: 1

      me, I'd have put one pillow full in the attic and the other in the basement... then at least I'd be certain of taking half of it with me...

      --
      Donald 'Duck' Dunn: We had a band powerful enough to turn goat piss into gasoline.
  9. Finally, an explanation by Anonymous+Crowhead · · Score: 0

    Earlier this year, a google search on our product name suddenly returned 10 times as many results. I wondered what the hell happened. Good thing we don't rely to heavily on Google.

    1. Re:Finally, an explanation by NewWorldDan · · Score: 4, Interesting

      Over the past 6 months or so, I've been finding a lot of link farms in my search results. Oh, irony or irony, SEOs are making search results worthless.

    2. Re:Finally, an explanation by Basecamp88 · · Score: 1

      I also see this as a growing problem. I tried to google a drug I was just prescribed to see possible side-effects. I gave up after the first 10 pages of "Buy cheap drugs from mexico! 10% legal!". I *used* to get relevant results on the first page doing the same type of search.

    3. Re:Finally, an explanation by infradead · · Score: 1

      A company I deal with had a really poor pagerank before xmas -- it came about 17th on the list for the relevant search terms, which (as I pointed out to the CEO) means it wasn't going to be seen by most searchers. I also, naively, pointed out that they should take the time to optimise the site properly, not just go to some SEO company for a quick fix. Their solution? The quick fix. Now they're #1, and repeated reporting to Google has done nothing to change this.

      They're using what I think are called "doorway pages", which works like this: You are an SEO company who wants to optimise a site selling books. So you find a number of sites which have excellent pagerank for "books" in Google, and you pay those sites to fix their pages so they show one thing to ordinary users (their standard page) but something different to Googlebot. With Googlebot, the sites redirect to whichever site is getting the optimisation treatment that week. So it looks like all the thousands of links from other sites to (say) "blahblahbooks.co.uk" are in fact pointing to the site which is getting SEO'd. You can check by seeing which sites in Google link to the SEO'd site, then checking the pages returned to see if they really do link to that site. Checking the Google cache for "blahblahbooks.co.uk" will also show you the SEO'd site rather than their regular page.

      No idea why Google doesn't pick up on this, and equally why they don't fix their index if it's reported. I guess they're making money hand over fist so why care?

  10. more internet space by Anonymous Coward · · Score: 5, Funny

    I'm not a computer person, but couldn't Google just upgrade to a bigger disk drive?

    I saw one at bestbuy.com that looks pretty good.

  11. google by alxkit · · Score: 0

    i think they'll be needing a bigger boat...

  12. The Register, being less than truthful??? by Anonymous Coward · · Score: 1, Funny

    Who ever would have thought that an upstanding, respected news source like The Register could have ever written anything with poor journalistic integrity?

    I would have never expected to see this day!

    My heroes are all burning!

    1. Re:The Register, being less than truthful??? by lokiman · · Score: 0

      I am sure Fox News will run an program soon about how Bush will save Google from itself. May as well keep the integrity going. Sigh

  13. Google is full. Try this... by Anonymous Coward · · Score: 0

    Try this...

    Go to yahoo and search for "slashdot poneys". This will bring up a bunch of results, all approximately 1 month old.

    Now do the same search on google. Notice how many of the results from yahoo do not appear in the google results at all.

    Google has such a big backlog that they don't get around to spidering new sites for several months. While google does give priority to certain high-profile sites like slashdot and visits those frequently, most other sites do not get indexed for several months.

    1. Re:Google is full. Try this... by j_snare · · Score: 4, Informative

      Try this...

      Go to yahoo and search for "slashdot poneys". This will bring up a bunch of results, all approximately 1 month old.

      Now do the same search on google. Notice how many of the results from yahoo do not appear in the google results at all.

      Google has such a big backlog that they don't get around to spidering new sites for several months. While google does give priority to certain high-profile sites like slashdot and visits those frequently, most other sites do not get indexed for several months.


      Okay, so I tried this, just for kicks. You can verify, by a single click:
      Yahoo: http://search.yahoo.com/search?p=slashdot+ponies
      Google: http://www.google.com/search?hl=en&q=slashdot+poni es

      Since when does 44900 results on Yahoo mean that they have more than 92100 results on Google? As far as what's appearing, I was able to find most every one I saw on Yahoo on the first 2 or so pages of Google's results. I also see more results on Google that look like they'll show me more of what I'm looking for (since I am probably looking for the April 1st joke, screenshots especially).

      Works alright for me. Looks like I don't have a reason to switch again yet.

    2. Re:Google is full. Try this... by LunaticTippy · · Score: 4, Funny
      You did it wrong. Try searching for "slashdot poneys" just like the OP misspelt.

      44 on yahoo, 229 on google.

      Wait, what was I saying?

      --
      Man, you really need that seminar!
    3. Re:Google is full. Try this... by pembo13 · · Score: 2, Insightful

      I guess the OP didn't expect you to actually try it out.

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
    4. Re:Google is full. Try this... by Anonymous Coward · · Score: 2, Interesting

      Interesting though that they index fairly different things.

      Top 10 results for "slashdot poneys" on yahoo:

      1. slashdot.cuteness.org (not on google)
      2. jfaughnan.blogspot.com (#1 on google)
      3. jfaughnan.blogspot.com (#1 on google)
      4. index.cristal-trace.com (not on google, outdated link)
      5. mfrost.typepad.com (#22 on google)
      6. pcdq.blogspot.com (not on google)
      7. www.ninme.com (#15 on google)
      8. www.firstworld.biz (not on google, spam)
      9. musicindustry.firsindustry.com (not on google, spam)
      10. girls-having-sex-with-horses.danielblog.info (not on google, spam)

      Top 10 on google:

      1. jfaughnan.blogspot.com (#2 on yahoo)
      2. slashdot.org (not on yahoo)
      3. slashdot.org (not on yahoo)
      4. linux.slashdot.org (#27 on yahoo)
      5. linux.slashdot.org (#27 on yahoo)
      6. mitternachts-lied.net (#22 on yahoo)
      7. interviews.slashdot.org (not on yahoo)
      8. linuxfr.org (#19 on yahoo)
      9. www.releton.com (not on yahoo)
      10. www.japancar.fr (not on yahoo)

      Both yahoo and google are missing pages from their indexes. Some appear on one but not the other. Yahoo was slightly worse at indexing spam sites. (Is www.releton.com spam?)

      I'd say both are 'full' in the sense that neither seems to have enough capacity to index everything.

  14. OMG!! by Intangion · · Score: 1

    OMG ITS THE END!! AAAAH!

  15. GWeb? by reldruH · · Score: 1

    I'm surprised nobody's mentioned the fabled Google Internet. If Google's servers are getting full with all the pages of auto-generated content, why not just not list them? Create your own internet without all those and suddenly Google has more than enough space and bandwith. That quote (flaky though it is) is the best 'evidence' anybody's been able to find to support such a theory.

    --
    I've always pictured the color of OS zealotry as a sort of bright flamingo pinkish hue
  16. Adsense is to blame by wackysootroom · · Score: 5, Insightful

    In creating adsense, google opened the floodgates for spammers who do not want to create good content. In fact, there are even people who copy tons of content from wikipedia and throw up adsense on the top and sides of the pages.

    There are people who are literally making $10,000 or more per month just putting up junk content sites that are auto generated for the purpose of creating adsense revenue.

    Don't get me wrong, I think adsense is a good thing, but Google's allowance of spam sites is giving adsense a bad name.

    1. Re:Adsense is to blame by merreborn · · Score: 3, Interesting

      ...It's not like google invented internet advertising.

      Banner ads were taking the same path. If anything, we should thank google for making internet advertising less intrusive.

    2. Re:Adsense is to blame by Snowmit · · Score: 5, Funny

      Please tell me how I too can make $10,000 or more per month just by putting up junk content from the comfort of my home. Is there a program that I have to order to learn to do this? Should I act now?

      --
      I have a lot of opinions about Cyborgs and Architects
    3. Re:Adsense is to blame by truthsearch · · Score: 2, Insightful

      What's interesting is that Google is pretty good at blocking these spam sites from the index, like the wikipedia copies. But since Yahoo and MSN are terrible at blocking them these spammers are making Google money without flooding Google's own index.

      I believe this is all an unintentional consequence of AdSense. I'm sure the people at Google knew some of this would happen, but probably not to this extent.

    4. Re:Adsense is to blame by dada21 · · Score: 1

      Actually, these scraper sites can be really effective in also hurting the sites they take content from. If they end up with a higher pagerank than the site they borrowed content from, the site with the borrowed content can be penalized (and even banned from AdSense themselves!).

      Google is fighting that war fairly well with their new smart pricing system (in AdSense) but I would much prefer to see an option for publishers to opt-in to a better AdSense program that offers possibly better income if new content doesn't get ads until it is manually OKd by a live human. Maybe give up 20-25% revenue in exchange for a higher top end opportunity.

    5. Re:Adsense is to blame by Anonymous Coward · · Score: 0

      How is a text advertisement placed in natural reading sequence and disguised as a search result "less intrusive" than a graphic placed at the top or side of the page?

      Or maybe you just meant "less garish"?

    6. Re:Adsense is to blame by Anonymous Coward · · Score: 0

      AdSense itself is not to blame. Google is to blame for allowing anyone to join the program, only removing sites after violations. Take a look at YPN where they screen applicants (due largely to its beta status). How many MFY sites do you see? Not many, in fact, I have yet to see one.

    7. Re:Adsense is to blame by qbwiz · · Score: 0

      I will, if you give me $10,000 right away.

      --
      Ewige Blumenkraft.
    8. Re:Adsense is to blame by shish · · Score: 1

      How are google ads placed in natural reading sequence and disguised as search results?

      --
      I mod down anyone who says "I will be modded down for this", regardless of the rest of their comment
    9. Re:Adsense is to blame by Anonymous Coward · · Score: 0

      I must add that Google is also a stealer of generated content. Google is no innocent bystander. They are also much a part of the problem.

      Google banks so much money from other peoples content. Actually, they bank entirely from other peoples content. What sickens me most is their new Music sections. You can search for practically any band now, and the very first listing that comes up is Google's pages for that band. I'll use 'The Beatles' for example. It's a simple image of the band, and the bands name. Also listing a couple of albums, and song names. Clicking on 'The Beatles' name and you get a page that has a picture of all their albums. Next to each album is options to read 'User Reviews' or purchase the album at many different locations. (Which I wouldnt be suprised if Google is making a commission from those sites by leading people there. Although I do not know if this is true.)

      Reading the user reviews hosted on Google's servers. I notice that not one user review is theirs. They are all stolen from other peoples websites. People's individual hard work building their sites with their own content, and Google comes along and writes a script that can scrape the Internet for any content it desires. Oh, and putting green text at the bottom of each review saying "Rolling Stone" or "Epinions", I guess that makes the website owner satisfied?

      Google even displays the lyrics to songs now. Then putting a couple of source URLs below. Why would I need to visit the URLs now?

      Then off to the left column you have 'Artist Photos'. Which just goes to Google Images and searches for "The Beatles". And now Google has the largest photo collection of The Beatles stolen from true dedicated Beatles fans.

      Same goes for 'Latest News' on the left column. That just goes to Google News and searches "The Beatles". What a great place Google News. Awesome programing I must say. Constantly crawling legitimate content producing websites. To be able to distinguish which images go to which news article, even if the image pertains to the article at all. And even grab the correct title off the page.

      All great advances in automated technology. Although, may I remind you. This is the Google that shut down a young developer for writing his own script that just scrapped Google News. Oh what sweet irony.

      Soon we wont need to go to any other sites. Google will have gone through the entire spam filled internet, and copied that holy content. Now none of use have to put up with wading through spam site after spam site looking for that true content. Wading through spam sites that are there to capitalize on Google's huge ad revenue in the first place.

      So what have we learned. Google makes money from ads on all the annoying spam sites that leech content off of others. And now Google too leeches content to keep you at their site longer. Because you there longer means your more likely to eventually click on an ad. Hmmmm, Now theres no middle man spam site they have to pay also, for your valuable click. Could this mean Google is trying to create their own internet that is contrived of only quality content that is built upon all the content of hard working website owners out there? Well, it can't work for too long. Eventually Google, your going to have to make your own content. Because when your finally able to give everyone the content they want without leaving the Google World, theres going to be no content for you to steal. Who's going to make a website when no one goes to it?

  17. The Reg MIght Be On To Something by cfoster611 · · Score: 3, Informative

    I glance at the google results for some of my own sites and the Reg is correct, Google's index is completely out of date, even for a super small time guy like me.

    I know the GoogleBot indexes the site almost every day. Yet, while one of my sites is completely out of date (the Cache is from 2005), another is almost completely up to date.

    Google's got problems.

    --
    --- Kicking the Cheat since late 2002
  18. How Google crawls a site by jamie · · Score: 5, Interesting

    Meanwhile, for no good reason, here's some gorgeous stats porn on how Google (and Yahoo and MSN) crawled a sample website. The animations and closeups of the trees are very cool.

    1. Re:How Google crawls a site by RxScram · · Score: 1

      Very cool indeed

    2. Re:How Google crawls a site by caluml · · Score: 1

      I'm not just saying this cos Jamie posted it, but that is a cool page. I'm too tired to bother trying to understand it right now, but I've bookmarked it, and I'm going to read it another day.

    3. Re:How Google crawls a site by Anonymous Coward · · Score: 1, Funny

      Gorgeous porn, animations, closeup of trees (maybe he meant bush?)...

      (click)

      That's just mean, dude.

    4. Re:How Google crawls a site by pembo13 · · Score: 1

      You said porn, but I didn't realise the pics would be that sexy.

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
  19. dupe by Anonymous Coward · · Score: 0

    Google is going to being filled up too fast by /. dupes, too.

  20. exxxcellentttt!!! by dotslasher_sri · · Score: 1

    Now its time to become selfaware on all this data and launch skynet.

  21. Before you get too scared.... by Lxy · · Score: 2, Funny

    Just remember that /dev/null filled up years ago. Yet, we seem to be doing just fine.

    --

    There is no reasonable defense against an idiot with an agenda
    :wq
  22. I've heard of the user being ignorant... by TheNoxx · · Score: 2, Interesting

    You know, writing code and assuming that an end user somewhere will do the dumbest thing imaginable, but I guess nobody ever imagined the possible effects of collusion between extreme stupidity and cleverness (spammers). I know I'd never would have thought that someone would go to such lengths and spend so much time to barely scrape out a living while pissing off countless hordes of people. How do you go about creating enough international legislation and cooperation to catch these guys without crippling the internet with regulation? Are third world countries even capable of compliance? All I can think of is that we need something on the level of the UN where tech-heavy countries are given jurisdiction over other nations that don't have the resources needed to police these kinds of things in exchange for a fee , or maybe a guarantee that said nation will dedicate x amount of troops to any areas needing occupation to stop civil war or genocide or something. Am I over-reacting here? I just can't help but think that dealing with this problem without any legal consequence for the spammers is just encouraging and allowing them to come up with ways around whatever solution is currently in place.

    Eh, or I could be completely off my rocker, and just not competent enough to see a simple and effective method of combating these guys.

    --
    Ex nihilo nihil fit.
    1. Re:I've heard of the user being ignorant... by Anonymous Coward · · Score: 0

      Yes you are incompetent

    2. Re:I've heard of the user being ignorant... by pembo13 · · Score: 1

      Track down the top ten worst spammers, put them in stockades in public courts, and allow the citizenry to exact their desired justice?

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
    3. Re:I've heard of the user being ignorant... by TheNoxx · · Score: 1

      Just put them in jail for a while and/or give them community service (extensive) and either keep them away from computers or monitor their computer activity for a set period of time. Too many spammers aren't receiving any kind of punishment at all.

      --
      Ex nihilo nihil fit.
    4. Re:I've heard of the user being ignorant... by pembo13 · · Score: 1

      This is, these spammers do not strike me as the geek type. To a geek, life without computer is punishment enough. I think these guys would be more than happy to leave their country and access their offshore accounts and live out the rest of their lives in a 3rd world country.

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
  23. Fud Light by Loconut1389 · · Score: 2, Interesting

    I do hate it when searching for something about 4-10 pages in a row are purely sites that pretend to have what you're looking for but are merely meta dumps with adwords or other advertising mechanisms on them. Some of them even have valid cached pages. That said, this article, while certainly Fud, is only Fud Light. I personally prefer Fud Dark- at least I can generally laugh at the article's absurdity. This one was more or less just plain retarded.

  24. Re:Obvious by ENOENT · · Score: 1

    Can you imagine a Beowulf cluster of Googles?

    --
    That's "Mr. Soulless Automaton" to you, Bub.
  25. Google Indexing by k4_pacific · · Score: 4, Funny

    Some of you might recall that for a long time the Google index stood at around 4 billion pages. It turned out that this was because of the limited number of unique 32 bit index values. To handle this, Google created two index values to reference each each page. One is called the "Selector", and the other is called the "Offset". Simply put, the selector is left shifted by 4 bits and added to the offset so that Google can find any page on the internet simply by knowing its selector and offset. According to the article. Google has exhausted these values as well, and will introduce something called "protected mode page rank" where the slector is shifted farther to create a greater range of values.

    --
    Unknown host pong.
    1. Re:Google Indexing by caluml · · Score: 1

      They need to move from GoogleV4 to GoogleV6, which uses 128 bit addressing. Of course, the addresses are trickier.

    2. Re:Google Indexing by NullProg · · Score: 1

      Google has exhausted these values as well, and will introduce something called "protected mode page rank" where the slector is shifted farther to create a greater range of values.

      Don't forget the modifiers NEAR, FAR, and HUGE.

      Enjoy,

      --
      It's just the normal noises in here.
    3. Re:Google Indexing by alexhs · · Score: 1

      Wow :)

      I wonder how many people even here on Slashdot remember the real mode memory "model"

      Anyway, 40Gi pages ought to be enough for everyone :)

      (40Gi is to 64Gi what 640Ki is to 1Mi if anyone wonders...)

      --
      I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
    4. Re:Google Indexing by don.g · · Score: 2, Funny

      You mean segment, not selector (in your real/v86 mode analogue). Selectors only came in with protected mode. Personally, that small incorrect detail entirely ruined the joke for me.

      --
      Pretend that something especially witty is here. Thanks.
    5. Re:Google Indexing by Beryllium+Sphere(tm) · · Score: 1

      >"protected mode page rank" where the slector is shifted farther to create a greater range of values.

      No, not shifted farther to the left -- it's used as an index into a table of base addresses. Silly!

    6. Re:Google Indexing by dascandy · · Score: 1

      Protected mode with PAE only goes to 2^32 * 2^4 = 2^36 = 64 billion addresses. Now if they added Long Mode, that might be something.

  26. Are you sure? by trazom28 · · Score: 1

    I thought I found it!

    --
    {} ------ When I think of a good sig, I'll put it here
    1. Re:Are you sure? by Ctrl-Z · · Score: 2, Funny

      Well, would you look at that! Together, you both found both ends of the Internet!

      --
      www.timcoleman.com is a total waste of your time. Never go there.
    2. Re:Are you sure? by i.r.id10t · · Score: 1

      So if one links to the other, will it tie it all together into one big loop? 'cause that would be cool...

      --
      Don't blame me, I voted for Kodos
  27. Right by chazzf · · Score: 2

    So says Andrew Orlowski. Remind me why we take him seriously?

    --
    No statement is true, not even this one.
    1. Re:Right by FhnuZoag · · Score: 1

      Because of all his insightful comments about wikipedia?

      Really, though, slashdot is addicted to trolls and flamebaits.

  28. There is an obvious solution by shoma-san · · Score: 3, Funny

    Do what I do when the toilet bowl is full of crap - FLUSH.

  29. (Slightly OT): Scout sign/salute? by jacksonj04 · · Score: 1

    Unrelated to the main body of the article, but the "OneWebDay" mentioned in the snippet at the bottom describes a symbol as "Three middle fingers outstretched with the thumb and little finger touching". Since when have web developers been associated with The Scout Movement?

    --
    How many people can read hex if only you and dead people can read hex?
    1. Re:(Slightly OT): Scout sign/salute? by kiick · · Score: 1

      Uh, the scout salute has the three fingers next to each other, pressed together. The web day salute has the fingers separated, to make a sort of 'W' sign. FYI.

    2. Re:(Slightly OT): Scout sign/salute? by Anonymous Coward · · Score: 0

      It's called the webber, distantly related tp the shocker.

  30. Can't be full!! by udoschuermann · · Score: 1

    Google can't be full, there's still space left on Earth!

    --
    --Udo.
  31. Its about time! by elix3r · · Score: 1, Troll

    Web masters have been forced to go out of their way to optimize to "what Google likes" and cut out flash, ajax, and everything else that the Google bot can't crawl for waayy too long. Its about the time Google started paying the price for it.

    1. Re:Its about time! by pembo13 · · Score: 1

      Right. How exactly did Google 'force' any webmaster to do that. Last time I checked, browsers can surf websites without the existance of Google.

      --
      "Thanks for all the money you paid to us. We've used it to buy off ISO among other things" -Microsoft
    2. Re:Its about time! by dodobh · · Score: 1

      cut out flash, ajax, and everything else that the Google bot can't

      And that is a bad thing why?

      --
      I can throw myself at the ground, and miss.
    3. Re:Its about time! by pimpimpim · · Score: 1

      yo, any site that can't be browsed by a simple text interface, w3m or so (be it with broken output), is a broken website. I agree that optimizing sites just to please google is a bad thing, but it's a good thing that one is forced at least this way to make an accessable alternative.

      --
      molmod.com - computing tips from a molecular modeling
    4. Re:Its about time! by DerekLyons · · Score: 1
      Web masters have been forced to go out of their way to optimize to "what Google likes" and cut out flash, ajax, and everything else that the Google bot can't crawl for waayy too long.
      Oddly enough, I find that to be a *good thing*. It means Google serves up (in the main) simple and straightforward web pages - rather than the masturbatory fantasies of web designers.
  32. Que es El Reg? by Anonymous Coward · · Score: 0

    No sé. Español no es mi primera lengua. En inglés por favor.

    1. Re:Que es El Reg? by sm62704 · · Score: 1
      Es el nombre de "nick" de The Register.

      Oh, y tu ano es en fuego. Feliz Cinco de Mayo.

      --
      mcgrew's razor: Never attribute to stupidity that which can be explained by greedy self-interest
    2. Re:Que es El Reg? by Anonymous Coward · · Score: 0

      "Oh, y tu ano es en fuego"

      me fascina su español señor don gay XD

    3. Re:Que es El Reg? by sm62704 · · Score: 1

      No, I mean literally, get a bucket of water!

      --
      mcgrew's razor: Never attribute to stupidity that which can be explained by greedy self-interest
  33. One idea? by 955301 · · Score: 4, Insightful


    Well given that a human would have a hard time deciding if the page was autogen'ed if the text was in their second language, this *is* quite an issue.

    So it sounds like Google needs to *shudder* have a user feedback system where humans with logins add moderation metadata to the search results and in return get results based on this moderation en-mass.

    I know what your thinking, /. has it and it sux, but does it really? I'm always pretty confident that the goatse and gnaa and all that other crap will never make it to a score of 5 when I'm on it. Maybe that's what Google needs to throw the weight back in their court - human intervention on a colossal scale.

    It would withstand abuse since a massive amount of human inputed data would keep spambots from trying to exploit the moderation system. What's more, their toolbar could incorporate the control to flag a page as autogen'ed garbage.

    --
    You are checking your backups, aren't you?
    1. Re:One idea? by IamTheRealMike · · Score: 2, Interesting
      How does a moderator prove they are in fact a legit human and not a bot?

      I foresee a time when to access large parts of the net you will be required to use some central "proof of life" system. The current mish-mash of captchas isn't working. We have custom English captchas on a forum I admin and it doesn't seem to stop the bots: presumably when they get stuck they call for help.

      It's hard to believe a third of Googles index is auto-generated crap, but then I couldn't really believe the "50% of net traffic is spam or viruses" claim either and I'm pretty sure that one turned out to be true. It appears that an unregulated commons will always degenerate into a wasteland without some form of governance and law enforcement; perhaps rather than an arms race the only solution is for the internet to grow its own legal system and police force (how that'd work is left as an exercise to the imagination)

    2. Re:One idea? by humble.fool · · Score: 2, Informative

      Hey, looks like they are:

      http://googleblog.blogspot.com/2006/04/this-is-tes t-this-is-only-test.html The Googleblog shows that they have a cookie-based "block this site from results" feature in general beta test to random people on the site.

      --
      Being anonymous is not cowardice.
    3. Re:One idea? by fermion · · Score: 1
      Google is supposed to index the internet. The searcher is supposed to get an list of sites that are of likely interest based on the search criterea. If google starts to say this site is good and this site is bad, based on it's own metrics, that is censorship. Likewise, if humans have a direct impact on the results, that is also censorship.

      Now, we do know that some people, mostly persons looking to maximize their click through advertising, will make a page appear to be useful for a certain search result. when in fact it is not. This is exactly the thing that killed the search engines prior to Google. Google is big now becuase of the innovation that rather than looking at key words, they will look also at the links to the page, thereby reducing the rank of pages that merely contain all the words int he dictionary.

      The problem is that this meant these persons who wanted to trick users into clicking on thier page now had to have multiple sites pointed to each other so that Google would think the site was populat, when all it was really was that the advetiser has 50 sites instead of 1. THe blogging was just anohther tool for this naughty behavior. The end result is that in addition to the natural growth of the internet, one had a secondary effect of millions of pages that existed merely to increase the original pages google rank.

      So, the current state of the internet, with a significant percentage of pages being link farms, is of Googles own making. While it is wrong to for google to state that this page is 'good' while this page is 'bad' is pretty much censorship, especially since Google has said it would index the web, and not just the web it likes, there is nothing wrong with it promoting pages that are not link farms, which are generally of little use to the average user.

      In the end the only thing we can do it to have another innovate search technique that will make link farms unprofitable, in the same way that Google made keywords ineffective. As I have mentioned before my main disappointment is that Google has not created that innovation, and indeed is looking like Altavista prior to it's demise, and no other vendor has stepped in, particularly MSN.

      I think that this is because the search bussiness has become merely another way to connect advetisers to people. There is no reason to create higher quality results than is needed to keep advertisers happy. It is just like TV. So, even if the results suck, as long as they are good enough to bring eyeballs, and as long as no one is doing anything better, life is good.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    4. Re:One idea? by ewhac · · Score: 1
      So it sounds like Google needs to *shudder* have a user feedback system where humans with logins add moderation metadata to the search results and in return get results based on this moderation en-mass.

      It sounds noble in theory, but in practice it doesn't work so well.

      A bunch of phony moderations will boost the pages of ads. Only allowing users with logins to rate results won't save you; the spammers will simply create millions (yes, millions) of bogus accounts, farm them to improve their "karma" then, when an advertiser wants visibility, the accounts are used to boost pages.

      It would withstand abuse since a massive amount of human inputed data would keep spambots from trying to exploit the moderation system.

      Even if you have millions of people actively helping out, they still can't compete with millions of r00t3d Windows boxes.

      Schwab

    5. Re:One idea? by Schlemphfer · · Score: 1
      >So it sounds like Google needs to *shudder* have a user feedback system where
      >humans with logins add moderation metadata to the search results and in return
      >get results based on this moderation en-mass.

      Oh God no. It would produce an entirely different dynamic than the moderation system on Slashdot. There are too many website operators who'd join your proposed Google moderating program for the sole purpose of modding down competing sites.

      What you're proposing would replace Pagerank's mediocre performance with outright vindictiveness. I still remember getting screwed back in the day (when AltaVista dominated the search engine world) over how competing websites edited the DMoz directory.

      --
      I'm generally "Interesting," "Insightful," and even "Funny" here. What the hell happens to me at parties?
    6. Re:One idea? by C32 · · Score: 1

      If you thought Google censoring chinese sites to the chinese was bad, wait untill thousands of %special_interest_group% people (religious right, save-the-childreners, you name it) start censoring the internet for EVERYONE, by "modding down" offensive (to their mindset) websites.

      Human moderation of Google is a preposterously bad idea.

    7. Re:One idea? by drago177 · · Score: 1

      Idea has some merit I think. And how about that easy 'report spam' button on gmail? Stats could be collected, and see how many other reported spam pages have similar html characteristics, thereby identifying auto-gen'd pages. I get almost no spam in my gmail account, I thought it was because of the report spam button. (I'm no expert though)

    8. Re:One idea? by 955301 · · Score: 1


      I've thought about this before for a piece of social software I'm working on - corroboration.

      I'm a human, and I wrote the software. I advertise it to a few friends who I know (think) are human and confirm this in the server - these accounts have humans behind them. As the number of users increases the burden on any one to prove they are a human being goes up and they have to find people already confirmed in the system to "nominate" them as legitimate humans on the system.

      If a bot manages to make it in, a negative feedback system will cause it to likely get tagged as non-human. If the bot is tagged as such, an message goes out to nearby users to confirm or deny the accusation. If most say it's a bot, away it goes. But more importantly anyone the bot nominated gets tagged as a potential bot as well and also comes underscrutiny until the cluster is removed.

      --
      You are checking your backups, aren't you?
    9. Re:One idea? by 955301 · · Score: 1

      these concerns are marginalized when the number of people involved get as high as the user base of google.

      If you and a thousand of your friends try to inflate your site's search score when your site is just an advertising monkey, good luck trying to keep it there if the user base can tag it as useless.

      When the entire population of the target base can weigh in on the results, I don't think that's censorship, that's putting a user-factor in the sort-order. The autogenerated sites' results aren't eliminated, they are just reduced in efficacy.

      Blocking Spam is more censorship than moving gen'ed web sites to the bottom of a search result set.

      --
      You are checking your backups, aren't you?
    10. Re:One idea? by 955301 · · Score: 1

      Not positive moderation, negative. Sorry, didn't clarify - the idea is that you can have dings against you if you are a site masses of people (keep in mind the exceptionally large user base) think you are useless.

      The only thing millions of spambots could do is ding themselves, not improve their ranking. They are already doing the latter with cross-links to other spam bots... This idea is to counter that one.

      --
      You are checking your backups, aren't you?
    11. Re:One idea? by 955301 · · Score: 1

      It would be scaled - More than 1 or 100 or even 1000 people would need to ding a site to reduce its search rank.

      --
      You are checking your backups, aren't you?
  34. If google and the spammers have an arms race... by s-gen · · Score: 5, Interesting

    ...then eventually the spam sites will actually contain the information you were looking for.

    1. Re:If google and the spammers have an arms race... by RegularFry · · Score: 1

      ...and Google won't show it to you.

      --
      Reality is the ultimate Rorschach.
  35. Esta Cinco de Mayo by Anonymous Coward · · Score: 0

    En slashdot celebramos a Cinco de Mayo, usted clod insensible!

  36. Re: Not Evil by maxume · · Score: 1

    Teh suxor.

    --
    Nerd rage is the funniest rage.
  37. The real problem... by j79 · · Score: 1

    Those 300 GB hard drives they bought from Best Buy turned out to only have 279 GB!!!!

  38. The root of all evil by Opportunist · · Score: 1

    Money. As long as the internet was predominantly non-commercial, it was peachy. Remember the days? About 15 years ago? The net was great.

    Then money came. dot-com came. And the turd started hitting the fans.

    Now, I'm not saying to "outlaw" making money on the net. As much as I'd enjoy the "free and open" net of the old days, without people making (or hoping to make) money from the internet, we would still be hanging on dialup and paying inane amounts of cash for it. But it's time for some radical changes.

    1. Educate the people. Educate them and tell how to tell the con artists from the content. Spam would be no problem if it didn't pay off, if there weren't so many falling for it. Outlawing spam is pointless. Inform people that there is no such thing as a free dinner and even less that their penis grows to horse size by ordering some junk online.

    2. Inform the people around you what's going on in the 'net, legally, and how it affects them. They only know what the media spins for them. Tell them the whole story. And tell them it's time to put some pressure on some politicians. So far I'm still waiting for politicians to consider the 'net and 'net issues as something to address in their election campaigns. Drown them in letters so they start addressing the issue.

    3. DDoS the spammers and linkfarmers. Yes, it's illegal. Yes, I don't give a fuck. No, not the sender. It's more likely than not a hijacked PC. DDoS the linked page. Blow the one who decided that spam is the way to advertize his service off the net. Don't worry, you won't start a war. That's already running. Needn't do it right away, but I'd reserve that as an option if the rest fails.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  39. SQL Solution by Hoi+Polloi · · Score: 2, Funny

    Delete from internet.world
    where lower(page_text) like '% beastiality%'
    or lower(page_text) like '% lose weight%'
    or lower(page_text) like '% refinance%'
    or lower(page_text) like '% ebay%'
    or lower(page_text) like '% make money fast%'
    or lower(page_text) like '% enlarge your%'
    or lower(page_text) like '% teens%';
    commit;

    --
    It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
  40. Careful... by Skadet · · Score: 3, Informative

    3. DDoS the spammers and linkfarmers. Yes, it's illegal. Yes, I don't give a fuck. No, not the sender. It's more likely than not a hijacked PC. DDoS the linked page. Blow the one who decided that spam is the way to advertize his service off the net. Don't worry, you won't start a war. That's already running. Needn't do it right away, but I'd reserve that as an option if the rest fails.

    Careful, that linked page is 99.9% likely to be a legitimate user's hacked hosting account. What's faaaaaar more effective is a phone call (or even an email!) to the hosting company. When I worked support for a hosting company and I got a call about this, it'd take me all of 2 seconds to suspend the account.

    DDoSing the linked page is:
    1. no skin off of the spammer's nose
    2. a pain in the ass to the hosting company
    3. far more time-consuming and less effective than a quick phone call.

    We're smarter than those spammers, let's act like it.

    1. Re:Careful... by Opportunist · · Score: 1

      Nice idea. May I be present when you try to convince a French/German/Dutch... hoster in English to pull an account? Or, even better, a Russian one?

      Let's put it that way: I'm pretty sure the hoster gets the idea fairly quickly.

      --
      We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
    2. Re:Careful... by Skadet · · Score: 1

      May I be present when you try to convince a French/German/Dutch... hoster in English to pull an account? Or, even better, a Russian one?

      Sure, I'll give you a Babelfish lesson. "Fraud" is a word that's pretty much universal, and pretty easily translated.

      Let me put it more plainly. Your argument is illogical. You say, don't bother DDoSing the sender, because it's probably a zombie PC anyway. To generalize, you claim that attacking a third party whose innocence is likely (though not assured) is an incorrect strategy.

      Ok, agreed. But then you proceed to advocate DDoSing the hosting company? Another third party whose culpability is FAR less than the person who allowed their machine to become a zombie in the first place?

      There's no shame in saying, "Good point, I didn't think of that." For what it's worth, I agree with your first two points.

  41. Hey Taco... by LunaticTippy · · Score: 1
    The lameness filter is broken.

    Did you have to massage that? Or do you have a gift?

    --
    Man, you really need that seminar!
    1. Re:Hey Taco... by Anonymous Coward · · Score: 0

      vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj
      kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed6980 3- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp 6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y

      fjkds;akfsdla;jfkdls;ajfewiaof???@##@!#fdjksalf;db iorpeaituoeapbreahbjrel;jgvkl;ajiowapguiovjdksal;t jk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;s mkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43q gjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed6980 3-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl ;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;s hjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08 sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp 6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;y fjksdl;afjkdsla;jf4rioap5789403qagjkl;jr

    2. Re:Hey Taco... by antifoidulus · · Score: 0, Troll

      for great justice!

      vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed6980 3- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp 6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y fjkds;akfsdla;jfkdls;ajfewiaof???@##@!#fdjksalf;db iorpeaituoeapbreahbjrel;jgvkl;ajiowapguiovjdksal;t jk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;s mkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43q gjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed6980 3-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl ;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;s hjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08 sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp 6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;y fjksdl;afjkdsla;jf4rioap5789403qagjkl;jr

  42. I wonder how the curves relate.... by otis+wildflower · · Score: 1

    .. Which curves? The broadband adoption curve vs. the cost per gigabyte of commodity drives curve.

    Google surfs in there, adding new hardware with cheaper storage whilst decommissioning older hardware (in theory, since it's more power-efficient to do more with newer hardware, and power is only gonna be gettin more expensive, better to ditch boxes when they are, say, 50% less cpu/storage capable than a box with the equivalent amp requirement) so considering how cheap storage has gotten and how quickly it has gotten there, this may just be a bit of underpromising going on.

    I just hope they recycle the boxes appropriately as they're rotated out of the lineup. I can't imagine that they have anything other than token systems (like the Lego server) running that are more than 4 years old...

    (Also I have to wonder, with the power requirements, if they're interested in having their own custom low-power x86-compatible chipsets and motherboards built, maybe SBCs...)

  43. Re:Obvious by Anonymous Coward · · Score: 0

    I think I had a nerdgasm from just reading that.

  44. Being the 5th... by Skadet · · Score: 1

    Can you imagine a Beowulf cluster of Googles?

    No, but this being Cinco de Mayo and all, I can imagine a cluster of Beer-wulf Goggles!

  45. Go p2p now! by daikimatsu · · Score: 1

    Sooner or later, Google will be unable to cope with the data on the ever increasing web servers. Also, the unindexed-yet-could-be-shared data on our desktop exceeds the data indexed by google by a factor of hundreds.

    Solution? go p2p! Use software like Krawler[x] or iMeem to share and access content and communites. Heck, Krawler[x] even does p2p *full-text* search on all kinds of documents. Create self sustaining communities instead of getting into the ad-based muck of today's web world...

  46. I'm not seeing that on my site by Anonymous Coward · · Score: 0
    This site: http://californiaccw.org/ launched 10 days ago, and it gets traffic from Google all the time. Brand new site, new domain, pagerank zero.

    It used to be that Yahoo Slurp was the most aggressive crawler. Last year whenever I launched a new site Slurp was first and most aggressive. Now, oddly enough, the most aggressive is Googlebot, followed by Ask Jeeves Tehoma bot.

    Also one other thing I have noticed is that Google is now aggressively downloading MP3s.

    By looking in the logs, you can see the future in the past!

  47. Time for a format by krunk4ever · · Score: 1

    Deleting won't help at this point. It's gotten to the point where it needs a format and a clean install.

    Meanwhile we can decide which websites are no longer needed and don't bother reinstalling them, because they're crap anyways and takes up space and hard to remove.

    1. Re:Time for a format by fm6 · · Score: 1

      While you're at it, overwrite the embarassing stuff with zeroes so it can't be recovered.

  48. i'm shocked... by 3mpire · · Score: 0

    selective quoting on slashdot? ZOMG!

  49. BlueFrog extended? by pinkocommie · · Score: 1

    Can a concept similar to BlueFrogs be utilized for weeding out these sites? For example a toolbar in Firefox that allows you to tag sites as spam and the results being transmitted to Google / Yahoo etc (any that want them) and they could incorporate those results in reducing weightage of a site? Or to take it a step further a la BlueFrog actually accumulate those results daily / hourly and complain to the host / registrar etc?

    1. Re:BlueFrog extended? by Nicolay77 · · Score: 1

      I think that's brilliant, or at least the only workable solution.

      Gmail (and others) have a "Mark as SPAM" button, and now that button should be extended to the entire web.

      But it can only be in the google toolbar, and similar toolbars, and not many users have them installed, or do they?

      And then is the whole "I will report as SPAM my competition's home page" issue.

      --
      We are Turing O-Machines. The Oracle is out there.
    2. Re:BlueFrog extended? by Larry+Lightbulb · · Score: 1

      On AdSense adverts there's the "ads by Google" link which will lead you to a place where you can comment about the adverts, and also complain about them if you think the site is breaking the rules.

  50. If I want your opinion I'll ask for it! by Anonymous Coward · · Score: 0

    Fuck the bloggers, delete them all and nobody will miss them!
    The vast majority of the world goes online to get information and recieves opinion, WTF!
    I don't give a shit what you think!

  51. A disappointing change by guacamole · · Score: 1

    I don't know if that's related but I noticed that googling for "en" stopped rendering en.wikipedia.org as the first match. I used to just type "en" in the firefox address bar to go there. Now I have to type the whole name or use bookmarks.

    1. Re:A disappointing change by Cederic · · Score: 1


      Find and change your FF pref to read
      pref("browser.urlbar.matchOnlyTyped", true);

      Then (editing the .js pref file directly while FF is closed) add:
      pref("browser.urlbar.autoFill", true);

      now open FF, enter en.wikipedia.org in the address bar, hit return. From that point forward, just enter 'en' and hit return - FF will do the rest.

  52. PageRank is everything to Google... by mengel · · Score: 1
    Your rate of being scanned is based on your pagerank, at least according to the site a previous poster mentioned (look for "stats porn" :-))

    So the site that gets updated has links to it that Google thinks are good, and the site that doesn't get updated doesn't have good linkage. That is to say, if it would come up at the top of the list in a Google search, it gets scanned more often, but if it would come up on page 32 of 32, it gets scanned very very rarely.

    --
    - "History shows again and again how nature points out the folly of men" -- Blue Oyster Cult, 'Godzilla'
  53. Mine too. by sm62704 · · Score: 1

    I suspect they've ratcheted up the "popular" part of the search to the exclusion of actually matching keywords.

    My crappy little site,, doesn't get any hits of the first 4 pages for a search of "mcgrew", despite the fact that the word "McGrew" is in the URL, the copyright notice, an alt tag, an there's even a copy of "Dangerous Dan McGrew" on the site.

    I used to have another site back in the last century that regularly got linked by Blue's News, Planet Quake, sCary's, and tons of small sites. Five years ago "mcgrew" would have brought up the first page.

    My pagerank is a negative number now? =(

    Well, not negative; if I put in "antique sheet music mcgrew" it comes up after five other results, none of which contains the word "mcgrew." So it's listed, it's just ignoring some of your search words in favor of popularity (of which I've lost all of mine apparently).

    --
    mcgrew's razor: Never attribute to stupidity that which can be explained by greedy self-interest
    1. Re:Mine too. by Anonymous Coward · · Score: 0
      try
      "antique sheet music" +mcgrew
      and your site comes in second after some k5 story
  54. Viagra and Cialis by Pedrito · · Score: 1

    are the problem. Here's what google needs to do: Every page that has Viagra or Cialis, immediately gets purged. It then will need to add page rendering so it can render pages and then do some sort of pattern matching to look for anything graphic or otherwise that might vaguely be mistaken for viagra or cialis, and nix those pages as well.

    Okay, so the people who actually want information on viagra or cialis will have to resort to the old fasioned way, watching TV, but at least that fixes the internet.

    1. Re:Viagra and Cialis by Anonymous Coward · · Score: 0

      Here's what google needs to do: Every page that has Viagra or Cialis, immediately gets purged


      Ironically, making your post invisible too.

  55. Cheap hardware works both ways by mangu · · Score: 1
    With hardware (and bandwidth) getting cheaper, I find it hard to believe that Google has actually run out of space.


    Surely, Google isn't the only one to take advantage of cheap hardware? According to Netcraft the internet doubled in size in the last three years, increasing by 3.1 million new hostnames in April 2006 alone.

  56. The Register is full too.... by Anonymous Coward · · Score: 0

    Full of crap, that is.

  57. The last space by Anonymous Coward · · Score: 0

    Stop it your using up the last space on the Inter.........Oh Shit.

  58. Are you using Google? by robogun · · Score: 1

    Two days ago, Google seemed to forget what enclosed quotes were for. Also, it is returning pages upon pages of useless "supplemental results" -- I often jump to Page 10 just to try to skip past that.

    Let's not even talk about the spam pages. I've emailed suggestions for instance banning domains that use javascript redirects -- you know, you see a SEO page with javascript off and the porno page with it on. No legit site shunts off visitors to third party sites with zero delay.

    I've also suggested a Slashdot type moderation system for Google registered users. A page can be moderated up or down -- if a page gets low enough a Google employee can have a look and flush the entire domain forever if need be.

    But they're not working on this or any other issue with the search engine. The index hasn't seemed to been updated since February. Image Search is full of images long since gone.

    My guess is the article should not automatically be dismissed. My thoughts are Google is wasting entirely too much time taking over the computer world to actually be bothered fixing the search.

    1. Re:Are you using Google? by stm2 · · Score: 1
      I've also suggested a Slashdot type moderation system for Google registered users. A page can be moderated up or down -- if a page gets low enough a Google employee can have a look and flush the entire domain forever if need be.


      There is a system like this already working. On the IE version of Google Toolbar, there are 2 small icons, a green one to vote up a site, and a red one to vote down. As far as I know, those votes does influence in pagerank score. I am using FF with Google toolbar (in linux) and I don't see these icons.

      And according to the article, this also will be abused by spammers. Like tagging and other new web features.

      --
      DNA in your Linux: DNALinux
    2. Re:Are you using Google? by innocent_white_lamb · · Score: 1

      No legit site shunts off visitors to third party sites with zero delay.
       
      You do if you're hosting a website as a subdomain on your ISP's server and then change ISP's.

      --
      If you're a zombie and you know it, bite your friend!
    3. Re:Are you using Google? by KDR_11k · · Score: 1

      Yes but then Google should still return only your new address.

      --
      Justice is the sheep getting arrested while an impartial judge declares the vote void.
    4. Re:Are you using Google? by innocent_white_lamb · · Score: 1

      Folks with your site bookmarked still need to be redirected.

      --
      If you're a zombie and you know it, bite your friend!
    5. Re:Are you using Google? by KDR_11k · · Score: 1

      Yes but Google can drop your old page from their index.

      --
      Justice is the sheep getting arrested while an impartial judge declares the vote void.
  59. Re:Let's fill up slashdot! by Anonymous Coward · · Score: 0

    vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp 6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y

    fjkds;akfsdla;jfkdls;ajfewiaof???@##@!#fdjksalf;db iorpeaituoeapbreahbjrel;jgvkl;ajiowapguiovjdksal;t jk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;s mkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43q gjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed6980 3-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl ;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;s hjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08 sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y
    vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp 6tj45;rlesjbk;smkn65ed69803-atg9uee;shjitrs;y vjdksal;tjk43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45; rlesjbk;smkn65ed69803-atg9uee;shjitrs;yvjdksal;tjk 43l;jt43qgjkl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smk n65ed69803-atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgj kl;dfjbkl;s5uyispb08sp6tj45;rlesjbk;smkn65ed69803- atg9uee;shjitrs;yvjdksal;tjk43l;jt43qgjkl;dfjbkl;s 5uyispb08sp6tj45;rlesjbk;smkn65ed69803-atg9uee;shj itrs;y fjksdl;afjkdsla;jf4rioap5789403qagjkl;jr

  60. try searching for cifler by skiingyac · · Score: 1

    you'll get member pages at xanga as #3, myspace as #4

  61. Simple really... by xandro · · Score: 1

    Google needs more pigeons.

  62. Wait a minute! by GunFodder · · Score: 1

    Are you accusing The Register of biased reporting?!? I thought they were the gold standard of objectivity, but now I may need to reconsider my stance. I guess I'll just have to rely on The Inquirer for a fair and balanced look at the tech industry.

    1. Re:Wait a minute! by Jerf · · Score: 1

      Naw, just bias. I wouldn't call what they do "reporting".

  63. Google didn't even invent adsense; Overture did. by oscartheduck · · Score: 1

    I've recently been reading a great book called The Google Story (amazon link here) which states plainly that google stole the idea of text based, inconspicuous advertising from another company called GoTo.com (which was apparently later renamed overture inc.).

    It says that google stripped out the idea of paying someone else to do their advertising, and apparently goto also did things like ensure that certain sites were placed higher in relevant search results rather than just displaying them in a side bar, but I figured you'd appreciate the info.

    --
    How to use coral cache: http://slashdot.org.nyud.net:8090/~oscartheduck
  64. Answer is user rating system by dindi · · Score: 1

    I beleive Alexa tried this, google might want to reimplement this...

    When google bar installed and you are logged into google (gmail or anything) put a little button there :

    Rate this site : Search engine spam, good info, mediocre ... etc etc ...

    yes I would click on it (if it is a function that does not take me to 30 other sites and require me to log -in ///

    It is time we start using our custeomers/visitors/human feedback. ANYONE can generate content from other sites. Just wget whatever. html stip it, mix words into random order, paste at bottom of generated_page_19235534.html and googlebot goes there, indexes it, and you will rank, until some human looks at it, realize that it is HIS/HER site's content and reports it to the search engines as spam...

    oh well,when it happens he already lose his rankings for duplicate content .....

  65. User feedback is king by dindi · · Score: 1

    10 years ago (or more) my bbs (pcboard) had a very cool feature :

    when new users signed up, old user rated the user upon a question form and decided : stay or go ...
    that worked with 100 people or so (small BBS)

    now the net and google is a BIG BBS .... so rate the damn sites .... put a "this site is good content" "shis site is random shit" on the results, so you can click (ajax so it just leaves you on the same page) ...

    user interaction is good, bots are dumb and if you have 100+ sites with different SEO blackhat crap running on them you'll figure what works and will always flood SE results ...

    Oh well yea I lost a business due to people actually stealing hard-work-collected-content from my sites ,,, and yes, user interaction could have worked ,,,

    anyway /... let people decide what is content and what is a random collection of keywords stuffed intoa sentence ..
    NOT A BOT

  66. No no no, by Gr8Apes · · Score: 1

    Everyone knows it's

    -Executive Chief Officer Peter Henry Barret

    --
    The cesspool just got a check and balance.
  67. Google downloading MP3s by Khyber · · Score: 1

    quick! Someone tell the RIAA!

    --
    Still waiting on Serviscope_minor to wake up to fucking reality and realize that Jessica Price isn't going to fuck him.
  68. mod points? by RoboProg · · Score: 1

    Why not emulate the slashdot/blog-forum mechanism of moderation?

    Allow search engine users to "vote" on the +/- worthiness of search results. Place a strict cap on the number of votes allowed from an IP address to avoid google-bomb type behaviour (perhaps image map the voting buttons to make it harder to vote via web-bot?).

    If the search result is spam, mod the link down so others can avoid this crud in the future. Allow modding up to, why not. If a domain gets modded down "too many" times (weight as needed), give the domain bad karma and predispose its servers and pages to negative weightings (too many downmods to crap73.ihostspam.biz downmods the karma of all the *.ihostspam.biz) for a while (tune "parole" period as needed). Also, send the registered owner / webmaster of the domain email or perhaps even paper mail (?) informing him/her of the downgrading of their search results, and why. If the domain is actually colocation hosting for multiple organizations, perhaps the other users there can arrange a suitable "blanket party" for the guilty. E.g. - take down the offending site / server, and or perhaps physically harm the scumbag responsible in the most egregious cases :-)

    Yeah, user interaction might be asking for too much, but what do the rest of you think?

    --
    Yow! I'm supposed to have a plan?
  69. Internet full? /. can help! by RoadWarriorX · · Score: 1

    Remove all duplicate posts. That should give us enough time for Taco to buy more hard drives!

    Problem solved.

  70. Good except.... by Quizo69 · · Score: 1

    ....can we keep the teens? :D

  71. Underpants! by jacoplane · · Score: 1
    1. Write trolling articles bashing popular organisation.
    2. ???
    3. Profit!
  72. It's time to get more selective by Archtech · · Score: 1

    With the gradual advent of the Semantic Web, it should become possible to discriminate between different types of material. In its simplest form, metadata will let you distinguish between the meanings of individual words or phrases, so you can search for "crystal" and specify whether you are interested in rock crystal, crystal glassware, people called Crystal, Crystal Reports, etc.

    We could also label content according to its level of "hardness" and objectivity. It would be nice to be able to discriminate between (for example) dictionaries and encylopedias, technical papers, marketing collateral, and opinion. The further you move towards the "opinion" end of the spectrum, the mushier the process of discriminating gets. Most of us would agree that we trust the Encyclopedia Britannica, the Oxford English Dictionary, or articles in Nature or Scientific American. But opinion, in its very essence, is more controversial. I can think of some people whose opinion I respect and value highly. Others may be very interesting, provocative and knowledgeable - but not necessarily always as sound. At the other extreme, we have masses of blogs and other groups with strikingly low signal-to-noise ratios.

    So why not instrument Google and other search engines to prioritize the highest-value material, and index the rest on a "best efforts" basis? There could also be specialist engines for certain special types of material, to give some sort of coverage. Think how nice it would be to search Google Groups for "Java" without being buried in job-related postings. Why not have a separate job search engine - or at least a separate Google option?

    --
    I am sure that there are many other solipsists out there.
    1. Re:It's time to get more selective by Anonymous Coward · · Score: 0

      You are niavely thinking that the issue of indexing is simply a matter of classification. Millions of these spam site set out to be in the google search results, if that then means creating meta data to say they are XYZ then they will do it and your results will be just as screwed.

  73. get a better db =) by DrSkwid · · Score: 1

    Delete from internet.world
    where page_text ilike '% beastiality%'
    or page_text ilike '% lose weight%'
    or page_text ilike '% refinance%'
    or page_text ilike '% ebay%'
    or page_text ilike '% make money fast%'
    or page_text ilike '% enlarge your%'
    or page_text ilike '% teens%';
    commit;

    --
    There are places where the networks are not touching,and there are places where they are-Boeing's Lori Gunter
  74. DDoS spammers and fake banks? by Anonymous Coward · · Score: 0

    Well, Artists Against 419 have already beaten you to that idea.

    They seem to do pretty well at blowing them off the net, too.

  75. guymen@nigeria.com by Anonymous Coward · · Score: 0

    BIKO PLEASE GUYMEN KEEP OFF

  76. Credence by Neoncow · · Score: 1

    I too have thought about using some sort of trust system and have wondered how such a thing may be implemented. Luckily sombody smarter that me has also thought about it, wrote some research papers, and a prototype.

    http://www.cs.cornell.edu/people/egs/credence/

    Credence was designed to filter spam from peer-to-peer network searches. The creators implemented a prototype as a Limewire addon. You can read the link to find out how it works, but in summary, the user can rate files positively or negatively. Users are encouraged to rate the files based on whether the metadata (filename, artist, album, etc.) match the actual content. I belive there is a distributed hash table where the results of a user's ratings are stored.

    As a user's ratings accumulate, the software finds other users with similar ratings and uses them to determine if search results are spam or not. This creates the feedback effect that you noted. If a spammer injects false votes, their results will not look like your results and you won't trust them.

    Of course in order for this to work, you would need to rate a lot of files so you can determine who is a spammer and who is not. Who better to do this than the foremost internet company of our age? ;)

    I don't know how well the system would scale though. The web page claims that the program has been downloaded more than 10 000 times, but the network status page is down. I think the last time I saw the network status page, it was around 1-2 thousand active users.

    If such a system could be implemented for websites, it would essentially be a fuzzy trust system where you don't need to explicitly declare friends and trusted sources. If somethinig like this were integrated into the google toolbar there could be a button similar to the spam button for emails. The toolbar could probably also have some automatic voting mechanism where commonly visited websites have a small positive vote. (note, massive privacy concerns. Users broadcasting their surfing habits to some publicly accessable DHT) But if it could be done, the system would double as a phishing filter.

    One random idea, assume people who have google toolbar installed trust google with their web surfing habits (does the toolbar track its users?). Then the DHT could be made accessible only to Google simply by using a CA type sytem with the Goog at the root. ... I wonder how much CPU that would take up. Good thing those multicore processors are starting to come out. =/