Google Apps Gets a 99.9% Guarantee

← Back to Stories (view on slashdot.org)

Google Apps Gets a 99.9% Guarantee

Posted by kdawson on Sunday November 2, 2008 @11:46AM from the outlook-cloudy-try-again dept.

David Gerard passes along a posting on Google's official blog announcing that they have extended the three-nines SLA for the Premier Edition of Google Apps from Gmail alone to also cover the Calendar, Docs, Sites, and Google Talk services. 99.9% uptime translates to 45 minutes a month of downtime, and the blog post puts this in context with Gmail's historical reliability, which has been between three and four times as good over the last year (10-15 min./mo.). It also claims, based on research by an outside group, that Gmail's historical reliability beats that of in-house hosted solutions such as Groupwise and Exchange, on average. Reader Ian Lamont adds an article in The Standard that digs down into the details of the SLA, revealing for instance that outages of less than 10 minutes aren't counted against the monthly 45 minutes.

22 of 155 comments (clear)

Umm... by Sylos · 2008-11-02 11:50 · Score: 3, Insightful

so if I have 60 1 minute downtimes, I'm keeping within the 99.9% uptime range? I call shenanigans.

--
'Number-memorizing Chinese people.'-Anon
1. Re:Umm... by Creepy+Crawler · 2008-11-02 11:53 · Score: 4, Insightful
  
  Most likely it's the time for node crash detection and load balancing to take effect.
  If service is that bad or intermittent, nobody would buy service there.
  --
  
  Mod parent up! by Anonymous Coward (Score:1) Thurs, Nov 31, @13:37
2. Re:Umm... by ILongForDarkness · 2008-11-02 12:09 · Score: 4, Interesting
  
  Well if they cache the current session locally and it is just the connection to the back end that you lose temporarily I think it would be alright. Losing data sucks. That said who uses desktop suites without a crash? "Hopefully" (not sure if that is the right word to use when referring to an outage), they manage to have the downtime clumped together and planned in non-peak hours for the region (say upgrades done first Saturday of the month at midnight or something).
  My big concern with this type of offering is it increases a companies dependence on their internet line. If your network is down not only can't retrieve files, email or browse, you now can't work on productivity software either. Essentially if your doing a job that requires a computer in this environment you can't work whenever the internet or network has a hickup. I like having something else to do in the rare instances where the network isn't working right.
  Add to that the fact that wireless/laptops are becoming of larger importance in companies (and wireless is flaky at the best of times IMHO) you're really courting disaster not just in terms of outages but in terms of accidental data loss. Say your not so gifted technologically colleague decides to walk over to your desk with their laptop to show you the spreadsheet they've been working on. They get out of range of the router that they were using and presto session time out and the chance of data loss.
Re:Wait.. by mikael_j · 2008-11-02 11:53 · Score: 5, Informative

It's called a cluster, "The cloud" is a really annoying buzzword for software as a service.
/Mikael

--
Greylisting is to SMTP as NAT is to IPv4
What about internet downtime? by Dan+East · 2008-11-02 11:54 · Score: 4, Insightful

Yes, but what is the average company's internet downtime verses their LAN downtime for a single-campus outfit?
So instead of LAN / Exchange Server (or whatever is being used) you now have LAN / WAN / Google downtime. WAN gateway downtime is probably the weakest link in the chain, so wouldn't the total downtime be greater using something internet based?

--
Better known as 318230.
1. Re:What about internet downtime? by vadim_t · 2008-11-02 13:08 · Score: 4, Insightful
  
  With an internal server, the mail you got it stays there so you can still read it, and compose replies. With an internal SMTP you can queue emails for delivery even if they don't get out (nice for laptops that may not stay around until the connection comes back). With an internal IM server you keep being able to talk to people inside the company, and can depending on the server, can queue messages until the connection comes back.
  Now if you happen to use say, gmail, then you're out of luck. You can't read your mail, can't compose replies, can't IM people in the next room. All you can do is sit there and wait for somebody to fix the problem.
2. Re:What about internet downtime? by mysidia · 2008-11-02 13:20 · Score: 3, Insightful
  
  So instead of LAN / Exchange Server (or whatever is being used) you now have LAN / WAN / Google downtime. WAN gateway downtime is probably the weakest link in the chain, so wouldn't the total downtime be greater using something internet based?
  E-mail is internet based and isn't going to work if your WAN is down, regardless (you can't e-mail anyone, or receive e-mail from other people).
  One of the costs of using a service like Google Apps is the increased need to design a proper resilient network at your site that won't go down.
  If you are multi-homed and have dual WAN links that take an independent path, with a standby router, and ensure your ISP provides redundancy, and your network is properly designed according to network industry standard and respected network equipment manufacturer's best practices: then a failure of your internet connection is unlikely.
  Much less likely than the probability of failure of a single mail server.
  The cost of internet link failure or congestion is significant for companies that rely on internet-based resources and online communications for productivity.
  For companies that conduct eCommerce, it is unthinkable to have the website going down, or to not have planned enough capacity for the network connection to meet all anticipated needs in a failure scenario. Bad connectivity is already costly, even without relying on application service providers for business apps.
  In a well-designed setup, the WAN itself should not much reduce that 99.999% figure. Although yes, there are some new failure modes introduced.
  Loss of connectivity to Google, for example, even if the network is otherwise working. Some unexpected Tier1 depeering ala. Sprint/Cogent may cause issues on rare occasion.
Re:Wait.. by Anonymous Coward · 2008-11-02 11:58 · Score: 4, Insightful

Google is a company. Saying "Google doesn't have 100% uptime" makes as much sense as saying "Microsoft takes 40 minutes to install". What specifically are you trying to say?
3 9's is meaningless without customer support by syousef · 2008-11-02 12:03 · Score: 4, Interesting

The 99.9% guarantee is great, if there's someone to talk to who'll actually look at the problem when those three 9s aren't met. Otherwise it's marketing propaganda.

--
These posts express my own personal views, not those of my employer
Server uptime is not the issue. by B5_geek · 2008-11-02 12:04 · Score: 3, Informative

The issue is your internet connection AND your ISPs connection to the world. Your connection to the world is more likely to go down before a Google cluster would. Think of how often Telco's, ISP, and major hubs go down. This is the point behind having LOCAL copies of apps/servers/services, the odds that the hub/switch dies (with nothing else inhouse to patch around) is very slim compared to the odds of internet connectivity going south.

--
"The price good men pay for indifference to public affairs is to be ruled by evil men." ~Plato (427-347 BC)
1. Re:Server uptime is not the issue. by Predius · 2008-11-02 12:12 · Score: 5, Informative
  
  As a commercial user of Google Apps, I have observed this not being the case. GMail does go down, and the cause is not our connectivity. What's worse is when there is a problem, all the 'phone support' does is tell you to post on their forums... not impressed.
2. Re:Server uptime is not the issue. by Predius · 2008-11-02 12:23 · Score: 3, Insightful
  
  Gee... you don't think I haven't brought it up, multiple times, with data? I pointed out the pitfalls before we jumped in, and we got bit. If I had control we'd be off GMail, but it's not my final decision.
  That doesn't make my observation any less salient.
Nothing has 100% uptime by EsJay · 2008-11-02 12:25 · Score: 3, Insightful

If your organization will fail without 100% email uptime - bon chance in the real world, mon friend, bon chance.

Make sure your users have a phone directory available on their local PCs (or paper copies on their cubicle walls). Have a phone tree notification system scheme in place in case the network is REALLY down.

And prepare for the troublesome PRODUCTIVITY SURGE when your users cannot reach the Internet!
1. Re:Nothing has 100% uptime by tomhudson · 2008-11-02 14:23 · Score: 4, Funny
  
  It's "bonne chance"...
  
  ... his Google Apps spellchecker only has a 99.9% SLA, you ignorant clod!
What I actually posted by David+Gerard · 2008-11-02 12:39 · Score: 4, Funny

was their claim that this is 4x less outages than on-site-maintained Exchange or GroupWise.
(Notes, of course, gets 45 minutes of uptime a year.)

--
http://rocknerd.co.uk
Wow, that's pretty terrible by yttrstein · 2008-11-02 13:02 · Score: 4, Informative

I achieved four nines (%99.99) 8 years ago with Netscape's broken mail server "Suite Spot" running on a (at the time) three year old Sun E450 with 4 gigs of RAM. As I recall, it served about 120,000 clients on a large cable network in Chicago.

This whole "new web" thing is very pretty, but it seems like about three steps back to me.
1. Re:Wow, that's pretty terrible by hax0r_this · 2008-11-02 13:44 · Score: 4, Insightful
  
  That may be true, but what you were able to achieve and what you guarantee clients you will achieve are two very different things.
Re:Wait.. by game+kid · 2008-11-02 13:02 · Score: 3, Informative

It's a King Arthur cloud, maaan. Get with the times!

--
You can hold down the "B" button for continuous firing.
Re:Wait.. by TooMuchToDo · 2008-11-02 13:10 · Score: 3, Insightful

On a related subject, next person who says "in the cloud" is going to get cockpunched. As parent said, there are no clouds, just highly available clusters.
Re:Wait.. by moosesocks · 2008-11-02 13:15 · Score: 4, Informative

There'd be no need for a Beowulf-type cluster in this case.
Have a bunch of machines running identical instances of Apache, and randomly fire requests at them individually. This balances the load, and ensures that the servers themselves aren't a single point of failure.
It's quite a bit more complicated than this in reality, although you should get the basic idea.
Beowulf is typically used for clusters that seek to emulate a supercomputer (usually for scientific number-crunching), rather than a server. For this reason, something like Google's setup would more typically be referred to as a "server farm"

--
-- If you try to fail and succeed, which have you done? - Uli's moose
Re:Wait.. by Anonymous Coward · 2008-11-02 13:21 · Score: 3, Funny

Yeah, punch those bastards. Punch 'em so hard they'll go flying up high in the sky. In the cloud, even.
Microsoft has 5 nines ... by tomhudson · 2008-11-02 14:19 · Score: 3, Funny

0.00099999.
Hey, it's five nines ... and with all the "exceptions" and bogus metrics in google's SLA, they're not offering 3 nines.