Slashdot Mirror


Confirmed Gmail / Google App Outage

mbone writes "Earlier today there was a confirmed Google outage which got a lot of attention from network operators. From a post to NANOG after everything calmed down: 'Google ack'd a maintenance on their core network did not go as planned-Forced traffic to one peer link that was unable to handle all the traffic. Maintenance has been rolled back. Issue has been restored.' This is exactly what makes me nervous about cloud computing and data storage. It's bad enough when I screw up a config and it takes down my mail, but what about when it happens to the entire globe at once?" Several readers also point to CNET's coverage of the outage. Update: 05/14 19:25 GMT by T : CWmike adds this: "Steven J. Vaughan-Nichols writes that what may be happening is a massive DDoS attack. Based on the size of the attack that would be needed to interfere with Google, I believe that it's quite likely to be the result of an attack from the controllers of the Windows worm, Conficker. Another theory that has been put about — that the problem was due to AT&T NOC routing problems — does not appear to hold water, writes Steven." Update: 05/14 21:01 GMT by T : Google's put up a low-detail explanation on their blog that says "An error in one of our systems caused us to direct some of our web traffic through Asia, which created a traffic jam. As a result, about 14% of our users experienced slow services or even interruptions."

53 of 189 comments (clear)

  1. comments from google by JWSmythe · · Score: 5, Funny

        In comments from Google Admins, they said "oops." :)

    --
    Serious? Seriousness is well above my pay grade.
    1. Re:comments from google by TommydCat · · Score: 5, Funny

      We apologise for the fault in the comments. Those responsible have been sacked.

      --
      This comment does not necessarily represent the views and opinions of the author.
    2. Re:comments from google by Red+Flayer · · Score: 3, Informative

      Yes, but the Admins' "Oops" comments were a cover-up. This outage was not due to an error during maintenance.

      This outage was a sign that Gatus and Joba are having some success in their quest to thwart Googol the Destroyer; apparently, not only are they getting converts in their efforts to have all the world's sorcerors collaborate on the One True Operating System with Global Search, but they (or someone else) has launched an effort to delay the work on the Rite of a Million Targeted Ads by Googol the Destroyer and his acolytes.

      Who is responsible for the guerrilla attacks on Googol the Destroyer, his acolytes, and his crack team of evil underlords? How are Gatus and Joba faring on their recruitment of the world's sorcerors? What has Stallmanx been working on in his secret laboratory*? Will we ever find out what lies beneath his Beard of Druidic Prowess? Answers to these questions and more will be revealed in the next two episodes!

      Remember, kids, tune in to next week's episode (look for it Tuesday or Wednesday) of Googol the Destroyer!

      --
      "Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
    3. Re:comments from google by Anonymous Coward · · Score: 4, Insightful

      Thanks for ruining it. Cultural references are MUCH funnier when not immediately followed up by some smartass going "I got the joke, guys! It's funny, see, he's paraphrasing a line from !"

      Don't do that anymore. Please.

    4. Re:comments from google by danieltdp · · Score: 4, Funny

      Ni!

      --
      -- dnl
    5. Re:comments from google by Piranhaa · · Score: 4, Funny

      I thought that what they said was more along the lines of "Hey, it's Beta!"

    6. Re:comments from google by EricTheGreen · · Score: 4, Funny

      I find your lack of faith disturbing.

    7. Re:comments from google by Anonymous Coward · · Score: 4, Insightful

      "The ability to quote is a serviceable substitute for wit." --- W. Somerset Maugham

    8. Re:comments from google by Anonymous Coward · · Score: 4, Funny

      tl;dr

    9. Re:comments from google by Veggiesama · · Score: 2, Funny

      There are FOUR lights.

    10. Re:comments from google by Ekimus · · Score: 2, Funny

      There are FOUR lights.

      but: There is no spoon!

      --
      You are not free to read this message, by doing so, you have violated my licence and are required to urinate publicly. T
  2. Google Voice Issues by 0100010001010011 · · Score: 4, Informative

    My Google voice account went all sorts of haywire.

    1) Text messages sent from the web got duplicated. One person got near 10 duplicates in quick succession. I also got duplicate messages back.
    2) My number doesn't work. If you call it you get a "Currently unavailable"
    3) A few calls that came in before the outage aren't showing up in the Received/Missed calling list.

    1. Re:Google Voice Issues by MozeeToby · · Score: 4, Interesting

      I've noticed some inconsistencies on my companies finance.google page. It seems to be giving two different values for gains and losses for the day, the one on the graph is correct but the one at the heading is not. It also lists our company as one of the related companies, something that it has never done before.

      I've got to wonder just what the hell happened here. Major and unusual issues across nearly all of Google's services? This isn't going to be good for Google's brand image.

    2. Re:Google Voice Issues by MickyTheIdiot · · Score: 5, Funny

      I noticed that when I did an images.google.com search for "hot anime chicks" it showed me a picture of Cowboy Neal.

      So yeah, it did cause issues on all of Google's services. It also soiled the inside of my trash can with puke.

  3. We have to take an deep breath... by WarwickRyan · · Score: 5, Funny

    ...and take an stroll to the great big place known as "outside".

    1. Re:We have to take an deep breath... by Just+Some+Guy · · Score: 4, Funny

      Wow, that's literally what we did. "Hey, Internet's broken. Walk?", and we went outside and got a little sun and fresh air. Thanks, Google!

      --
      Dewey, what part of this looks like authorities should be involved?
    2. Re:We have to take an deep breath... by Red+Flayer · · Score: 4, Insightful

      Or you know, do what most people do when systems go down... work on things not requiring those systems.

      What's that? All your work systems are dependent on a single outside vendor?

      Heh. Heh heh. Brilliant.

      --
      "Trolls they were, but filled with the evil will of their master: a fell race..." -- J.R.R. Tolkien on Olog-hai
  4. e-mails down? by Anonymous Coward · · Score: 5, Funny

    call me....

    1. Re:e-mails down? by PhxBlue · · Score: 2, Funny

      call me....

      I can't ... all my contacts' phone numbers are stored in GMail!

      --
      !#@%*)anks for hanging up the phone, dear.
  5. Hey Chicken Little by llZENll · · Score: 5, Funny

    And yet somehow miraculously we are all still alive. The sky is not falling!

  6. Everyone at once is better though by Nick+Ives · · Score: 3, Insightful

    When it's just your mail server down, everyone else gets annoyed at you because you're not {gett,receiv}ing mail they're {sending, expecting from} you. When the cloud is down, everyone can just chill and be thankful that they're not going to log on to find a whole stream of new emails.

    This sucks for docs though but using a completely cloud based doc solution is a bit mental. Even if you're mobile it's best to have a local copy to save on battery life.

    --
    Nick
    1. Re:Everyone at once is better though by rho · · Score: 3, Insightful

      It also sucks for the Web in general.

      Google was so fucked that a lot of pages that had Google ads, or Google Analytics were slow to load or not loading at all.

      --
      Potato chips are a by-yourself food.
    2. Re:Everyone at once is better though by Nick+Ives · · Score: 4, Insightful

      Browsers should be smarter about that. Maybe if they remembered that certain hosts are down and so stop trying to load scripts from them? They could periodically retry unreachable script-hosts in the background and then ask the user if they wanted to reload all relevant tabs.

      The problem with remotely hosted scripts isn't just limited to Google or cloud apps, it's a more general issue and browsers should be able to handle it with grace.

      --
      Nick
    3. Re:Everyone at once is better though by Krneki · · Score: 3, Insightful

      Noscript is the solution to this.

      There is a reason we don't like all the nasty stuff loading in the background.

      --
      Love many, trust a few, do harm to none.
  7. It's a feature, not a bug by ColdWetDog · · Score: 3, Insightful

    It's bad enough when I screw up a config and it takes down my mail, but what about when it happens to the entire globe at once

    If everybody goes down, nothing happens and you just go outside (beyond the doors, out into the bright white light) and enjoy your day until 'they' fix it.

    What's not to like?

    --
    Faster! Faster! Faster would be better!
    1. Re:It's a feature, not a bug by geekoid · · Score: 4, Funny

      Don't fall for it gang.
      Clearly he is trying to get you to go beyond the safe perimeter to eat your brains.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  8. Then download your google mail by LingNoi · · Score: 4, Insightful

    This is exactly what makes me nervous about cloud computing and data storage. It's bad enough when I screw up a config and it takes down my mail, but what about when it happens to the entire globe at once?"

    If it bothers you then use a mail client to download your mail from Google. As someone that has been using my gmail account all week I didn't even notice a problem, the whole thing seems overblown.

    1. Re:Then download your google mail by Genocaust · · Score: 4, Informative

      You don't even need to run POP to do this; just use the gears widget for offline access.

      --
      It could be that the only purpose of your life is to serve as a warning to others.
    2. Re:Then download your google mail by Botia · · Score: 2, Insightful

      The problem wasn't just mail. Any site that used Google for web statistics, mapping, or other services that Google offers was affected. For example, certain online banking systems use Google Analytics. These were affected.

    3. Re:Then download your google mail by Ash+Vince · · Score: 3, Informative

      I know it is poor form to reply to your own posts but I have just read the full article above and discovered that us in the UK seemed to be ok. Also not affected was the West coast apparently.

      Maybe someone told Google I was on holiday tomorrow and needed a nice quiet day to clear my desk :)

      --
      I dont read /. to RTFA, I read /. to offend people in ignorance.
  9. Mail Servers by Aladrin · · Score: 5, Insightful

    Having run my own mail server, and used mail servers run by companies I work for, I'll -gladly- take GMail's track record for reliability. Even with no 'guarantee', it's been a hell of a lot better than anything else I've experienced.

    And what's -really- the difference between a server going down locally that affects you and a server going down globally that affects you? Nothing.

    --
    "If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
    1. Re:Mail Servers by ACMENEWSLLC · · Score: 5, Informative

      >>And what's -really- the difference between a server going down locally that affects you and a server going down globally that affects you? Nothing.

      Actually, I disagree. There is a difference. If it's local and I own it, I have to fix it. If it's outsourced and Google owns it, I sit back and let Google fix it. Which is nice.

      ThePlanet.com had a bad switch install a few days ago which brought down part of our cloud. Our website was down, as was our access to Google DNS gave an IP down there for Google. If you look at the last year, the cloud solution has had a better uptime than what I was providing computing in planned maintenance, patching, updates and all.

      It was nice to leave at 5pm, knowing ThePlanet would fix the switch and get us back up. And they did. It's a lot easier to gripe about the cloud being down and sit back, than to manage and fix your own local servers switches and such. When you get to managing hundreds of servers, it becomes time to know what to outsource.

    2. Re:Mail Servers by Cro+Magnon · · Score: 5, Insightful

      The flip side is, if it's local and important to me, I'll make fixing it a priority. If it's important to me and I DON'T have control, I just have to hope that it's reasonably important to whoever can fix it, or I'm screwed.

      --
      Slow down, cowboy! It has been 4 hours since you last posted. You must wait another few hours.
  10. Google vs. Twitter...Once in an e-lifetime. by geekmux · · Score: 4, Funny

    Take a good look kids. Google was down and Twitter was up. This only happens once in every 3,271 days. You probably won't see it again, at least in Twitters lifetime...

    1. Re:Google vs. Twitter...Once in an e-lifetime. by larry+bagina · · Score: 4, Funny

      not only that, but slashdot wasn't fucked up by some web 1.9998743234722415086 "improvement."

      --
      Do you even lift?

      These aren't the 'roids you're looking for.

    2. Re:Google vs. Twitter...Once in an e-lifetime. by SBrach · · Score: 5, Funny

      Using a Pentium I see.

  11. The news is not that Google went down by cwgmpls · · Score: 4, Insightful

    Anyone who has ever used or administered a mail server has experienced a mail server going down. This is not news.

    What is news is that Google Mail has been up for so long until now. And current accounts seem to indicate the outage lasted about one hour.

    One hour of down time after five years of steady service is good enough for me. It is better than any other mail server I have ever used.

    1. Re:The news is not that Google went down by teknopurge · · Score: 4, Interesting

      You mistakingly act as if this is the first time google has had an outage in 5 years. Try again. Some more too.

      Over the years there have been countless issues with google - from gmail being down to apps not working, though it tends to to affect everyone, but subsets of users.

      Some of the google issues have to do with mailboxes getting lost and reassigned, etc. If it doesn't happen to you, it doesn't count as an issue, according to your logic.

    2. Re:The news is not that Google went down by sharkey · · Score: 5, Funny

      It'll get better once it's out of Beta.

      --

      --
      "Outlook not so good." That magic 8-ball knows everything! I'll ask about Exchange Server next.
  12. blacklash against the cloud? by recharged95 · · Score: 4, Interesting
    In the end, who the F* cares if a cloud service goes down?

    If a life is not lost, there are no worries with cloud computing (hence, cloud computing should be used for non-life critical services, gmail is a perfect example).

    Of course, VCs may have lost revenue, Capitalists may sweat from loss stock trades, teenagers may lose that one twitter about how cool Miley is to them, some adult may not get that date tonight from craigslist, you may miss that one Hulu commercial, some K-12 kid may not be able to send out his homework, some college kid can't access his pirate bay music lists, or the USPoTC may miss that extra minute to promote his stimulus bill.

    In the end, I hope cloud services shows us that we are not slaves to time. The human race has advanced enough to know that already. And really, if "the cloud" is down for an hour, maybe you should go outside and enjoy the wonders of nature and peace for once, or talk to someone physically. It begs to ask the question: "can it wait?"

    1. Re:blacklash against the cloud? by againjj · · Score: 3, Interesting

      For good or for ill, the Internet has become rather important for the functioning of society, and it is only getting more so as time goes by. Compare it to any other piece of infrastructure.

      Recently here in the bay area, we lost part of the MacArther Maze (the interchange of 580, 880, and 80 on the Oakland side of the Bay Bridge). You can trivialize by saying that the tool plaza may have lost revenue, the bus line may sweat from loss of fares, some adult may not get that date tonight to the SF restaurant, you may miss that one baseball game, some K-12 kid may not be able to get to the zoo, etc., or you can recognize that the bay bridge is one DAMN IMPORTANT piece of infrastructure that makes waves if it is down.

      There is a lot that relies on cloud services, many more than you may realize. That is why there are binding QoS contracts. When something goes down, it costs money and time. While you can route around the damage, or maybe take a vacation for the day, that does not mean that failures are unimportant. When you say, "If a life is not lost, there are no worries with cloud computing", you trivialize any loss other than life. The recent housing downturn didn't cost lives, but it did cost jobs, homes, and retirement incomes, to name a few. Sorry, when a major Internet service goes down, someone had better "the F* care".

  13. Re:Big Deal by westlake · · Score: 4, Insightful

    Considering the amount of usage google sees, a minor interruption like today's issue is nothing that worries me much at all.

    But usage is precisely the point.

    I lost access to Search, News, E-Mail...

    Everything Google.

    To a casual user at home this doesn't matter - but try explaining a global blackout of Google to your boss.

    Google is the poster child for the web-based app.

    Computing in the cloud.

  14. Forcing denial of service on unrelated sites by roc97007 · · Score: 3, Interesting

    If we're talking about the same outage that caused google advertisements to hang forever this morning, it caused access to many unrelated websites to hang, including slashdot itself. This seems like a really bad single-point-of-failure issue. If a site can't display ads, shouldn't it come up anyway?

    It's bad enough that I have to wait tens of seconds for Captcha content to pop up long after a login page has loaded.

    This is starting to get annoying. If this is "cloud computing", I'd rather stay on earth.

    --
    Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
  15. The "cloud" in cloud computing is the internet... by GPLDAN · · Score: 4, Interesting

    When done correctly, the "cloud" is the internet itself. Google has network design issues, some of their key services only have a couple of ingresses into Tier-1 providers:

    http://en.wikipedia.org/wiki/Tier_1_carrier

    I don't work for them, i don't hold their stock, and I am not (currently) a customer, so I have no skin in their game, but Internap as a BUSINESS MODEL, becomes more important.

    If you are a major company that comes to rely HEAVILY on Cloud Services, you want to insure that you have on-ramps into several Tier-1 providers ALL AT ONCE, without having to contract individually with 4 or 5 of them yourself. I predict more companies will mimic this model of aggregation, essentially handling the business of BGP optimization for customers, and handing customers 2 redundant pipes and saying "hey, don't worry if San Fran has an earthquake and these peering points blow up, we'll get you out via this Tier-1 backbone over to your cloud computing provider's service via this backbone within seconds. Let us handle that."

    Especially with ISPs that get into pissing matches, like when Cogent and Telia got into it, and cut each other off. If you had Cogent as your only ISP, you were screwed if you wanted to get to a bunch of Swedish sites, because Cogent's CEO was trying to play chicken over some tariff rates. The cloud computing model will no longer tolerate that, it's not just some website, it's a BUSINESS function.

    that's my take at least.

  16. Re:Big Deal by rAiNsT0rm · · Score: 4, Insightful

    Finally someone commenting with some sense. It kills me to read all the "Great Job! Google!" and "Bravo!" comments. This exposes a serious flaw in planning, design and change management of a very heavily relied upon resource.

    There is nothing to give kudos for here. Gotta love blind loyalty.

    --
    http://teasphere.wordpress.com - A little spot of tea
  17. Re:Big Deal by OverlordQ · · Score: 2, Insightful

    But usage is precisely the point.

    I lost access to Search, News, E-Mail...

    but try explaining a global blackout of Google to your boss.

    Well that sounds like your problem really. Guess you shouldn't sold him on that single point of failure idea then huh?

    --
    Your hair look like poop, Bob! - Wanker.
  18. Phone... Home... by N!NJA · · Score: 2, Funny

    strange. my Firefox 3.0.10 got somehow affected by this outage. it just refused to open! it loaded about 30Mb of data to RAM but went nowhere from there. the browser window never appeared. and i tried to re-launch it several times, but for no avail! very odd.... anyone else had problems with it? Opera -- although not able to open Google.com -- opened fine!

    Is Firefox tied to Google like E.T. was tied to Elliot?

  19. Re:Big Deal by geekoid · · Score: 2, Insightful

    OK, search and news is gone? you do know there are other services besides Google, right?

    You also assume in house systems would never go down.

    "Boss, Google is having a slow down on the west coast."

    There done.
    Unless you made a stupid promise saying a system will never go down and now are getting called out onto the carpet it should be a problem.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  20. Re:Big Deal by geekoid · · Score: 2, Insightful

    "This exposes a serious flaw in planning, design and change management of a very heavily relied upon resource"

    true. But there record over all is very good.
    In house system go down as well.

    Yeah, it's a screw up. Just like anything can screw up.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  21. Re:Big Deal by jdenver · · Score: 2, Interesting

    Well good thing we still had access to Twitter! ;-)

    I was following the #googlefail channel I found from the InformationWeek story and found a link to some cool response time graphs from the outage.

    There's also a really great Wired article with graphs from a Tier-1 provider showing the incredible drop in network traffic (by about 15Gbps) during the outage.

  22. Re:It must have been the Klingons by mbone · · Score: 4, Informative

    ack'ed is short for acknowledged, by way of TCP (which sends ACKs and NACKs). In the networking world, saying ACK as shorthand is pretty common.

  23. Re:The only real question is... by trouser · · Score: 2, Funny

    Seems unlikely but I'd better check.....fap fap fap fap fap...fap......faaaaaaap.
    No, the porn is still working.

    --
    Now wash your hands.
  24. I knew it... by robbrit · · Score: 3, Funny

    Somebody must have typed "google" into Google. It's the only possible explanation.