Slashdot Mirror


'Why You Should Not Use Google Cloud' (medium.com)

A user on Medium named "Punch a Server" says you should not use Google Cloud due to the "'no-warnings-given, abrupt way' they pull the plug on your entire system if they (or the machines) believe something is wrong." The user has a project running in production on Google Cloud (GCP) that is used to monitor hundreds of wind turbines and scores of solar plants scattered across 8 countries. When their project goes down, money is lost. An anonymous Slashdot reader shares the report: Early today morning (June 28, 2018) I receive an alert from Uptime Robot telling me my entire site is down. I receive a barrage of emails from Google saying there is some "potential suspicious activity" and all my systems have been turned off. EVERYTHING IS OFF. THE MACHINE HAS PULLED THE PLUG WITH NO WARNING. The site is down, app engine, databases are unreachable, multiple Firebases say I've been downgraded and therefore exceeded limits.

Customer service chat is off. There's no phone to call. I have an email asking me to fill in a form and upload a picture of the credit card and a government issued photo id of the card holder. Great, let's wake up the CFO who happens to be the card holder. What if the card holder is on leave and is unreachable for three days? We would have lost everything -- years of work -- millions of dollars in lost revenue. I fill in the form with the details and thankfully within 20 minutes all the services started coming alive. The first time this happened, we were down for a few hours. In all we lost everything for about an hour. An automated email arrives apologizing for "inconvenience" caused. Unfortunately The Machine has no understanding of the "quantum of inconvenience" caused.

321 of 508 comments (clear)

  1. Sorry, but... by Known+Nutter · · Score: 5, Insightful

    If millions of dollars are on the line, you should be running your own systems. Seriously. I'm not an IT expert, data infrastructure guy or anything. I'm just a dumb nerd, and I know that. Never trust your data to a third party when millions are at stake -- let alone critical infrastructure reliability.

    --
    Beware of the Leopard.
    1. Re:Sorry, but... by snapsnap · · Score: 5, Interesting

      As if servers doing down can't happen if you host it yourself. Also, this isn't a problem limited to Google. We've been getting a lot of emails from Amazon lately with the subject " [Retirement Notification] Amazon EC2 Instance scheduled for retirement."

    2. Re:Sorry, but... by Nutria · · Score: 5, Insightful

      As if servers doing down can't happen if you host it yourself.

      But then you're in control, instead of having to rely on some amorphous, anonymous monster that only allows communication via automated email.

      --
      "I don't know, therefore Aliens" Wafflebox1
    3. Re:Sorry, but... by El+Cubano · · Score: 4, Informative

      I came here to say essentially the same thing. Three letters come to mind: S....L....A....

      If you do not have an agreement with the provider that indicates the measures they will take to restore service in the event of an outage, and the escalating penalties on the provider if they fail to restore service according to the agreement, then you really have no business running a revenue-generating production service with that provider. Or, you don't have a business at all, just a hobby.

    4. Re:Sorry, but... by Known+Nutter · · Score: 4, Insightful

      Of course servers go down. But if they are your own, you have many more levels of control than if they're in the "cloud" -- redundancy, as an obvious example. Or at least you have keys to the goddamn data center and don't have to provide the CFO's p-card to resolve the problem.

      In the case of TFA, it sounds like a consumer-grade solution was applied to an enterprise-level problem. Maybe that's Google's fault, maybe that's the customer's fault -- sounds like there's plenty of stupid to go around.

      --
      Beware of the Leopard.
    5. Re:Sorry, but... by Anonymous Coward · · Score: 5, Insightful

      That's why you also configure a failover at a secondary location. This server punching guy sounds like a damn idiot. With millions on the line why is there no disaster recovery plan outside of "call the CFO for his credit card?"

    6. Re:Sorry, but... by oic0 · · Score: 1

      I work in banking and we host almost everything internal for security reasons and also for because of the same crap the OP is talking about. Its getting harder and harder to stay that way though. So many services you need to remain competitive these days are being offered cloud only now. Its BS!

    7. Re:Sorry, but... by aaarrrgggh · · Score: 1

      Moreover... build your solution with redundancy if you are going cloud! There are times it is very hard and/or expensive, but you need an order of magnitude better reliability/availability options if you don't have an SLA.

    8. Re: Sorry, but... by Anonymous Coward · · Score: 1

      Why is it "getting harder" to host your own solutions? Have servers and/or networking changed that drastically all of a sudden? Or is "getting harder" code for the it's harder to convince the bean counters even though there are literally thousands of examples of how third party cloud hosting can go wrong?

    9. Re:Sorry, but... by Junta · · Score: 1

      The failure modes in a hosting service are more insidious, when they don't want your services to be up, they take them down, all at once. No amount of being good about your availability zones and such will stand up to the reality that your vendor can shut down *everything*.

      Exacerbated by the reality they have so *much* activity, they can't have humans police it so instead they have automation systems that are prone to false positives.

      When you host yourself, the fundamental operational technology failure modes you can deal with. Of course if your network provider decides something similar, they can mess you up.

      In the situation given in the summary, local monitoring of each power facility would not carry the "some other company disables you" problem. This is why I get bothered when people make some cloud service a prereq for relatively simple voice recognition. That sort of workload can be handled locally nowadays, the dependency just makes the feature more fragile.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    10. Re:Sorry, but... by Junta · · Score: 1

      If you do a second location, you may be inclined to go to another "availability zone" at the *same* provider, and then watch all your redundancy go poof.

      Some may say "oh that's why you should have *mulptiple* cloud vendors too", which greatly increases cost and complexity when even *one* vendor is more expensive than owning your own equipment.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    11. Re:Sorry, but... by Junta · · Score: 4, Insightful

      Ah, but SLAs cost money, and part of the cloud movement has been "see, these vendors can make hosting cheaper than it used to be", precisely because all they have to do when they screw you over is say "whoops, sorry", and as such don't need to invest in *too* much resiliancy and don't have to worry too much about liability.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    12. Re: Sorry, but... by Anonymous Coward · · Score: 1

      That's exactly the pitch and timber I was hearing in my head when reading it

    13. Re:Sorry, but... by umghhh · · Score: 1

      It is of course Google's fault. But also the guys who did system design as even if you can migrate the VMs in case of HW failure you cannot migrate between google servers if google thinks you are evil. What one should be doing is having multiple instances working in HA setup in case one goes off the other part of the brain goes from hot standby and takes over. If more is at stake more redundancy is needed. So in this case you need two different infrastructure subcontractors and HA done between them as well as internally with hot SBY taking over as soon as the other side dies - that is complicated. From what I have seen telecom is using similar setups of hierarchical redundancy on the process level, in machine itself and then geographical if the system is critical. On top of that the same is done in test systems. If you want security you need at least 3 systems : 1 active and 2 SBY - this mainly because in case of maintenance you lose one instance. Just remember this - high redundancy means usually less utilization - your accounting will not like it. At the end of the day you have to make calculation what you can pay and what risk does that mean.

    14. Re:Sorry, but... by Chris+Mattern · · Score: 1

      The failure modes in a hosting service are more insidious, when they don't want your services to be up, they take them down, all at once. No amount of being good about your availability zones and such will stand up to the reality that your vendor can shut down *everything*.

      Your vendor can't shut down everything if they don't *have* everything. That's why any cloud service redundancy worthy of the name will have backups sourced to other vendors.

    15. Re:Sorry, but... by Joce640k · · Score: 4, Insightful

      Google cloud isn't supposed to be enterprise grade? I bet that's news to google.

      If you read the fucking summary it doesn't say the servers went down, it says that some robot shut them down when it detected "suspicious activity". No review was done, nobody at google called the customer, nothing.

      This is clearly 100% Google's fault.

      --
      No sig today...
    16. Re:Sorry, but... by Joce640k · · Score: 1

      That's why you also configure a failover at a secondary location.

      Isn't that exactly what the Google cloud is sold as - a huge 'cloud' of servers for massive redundancy?

      Here, read the sales blurb: https://cloud.google.com/solut...

      --
      No sig today...
    17. Re: Sorry, but... by tepples · · Score: 4, Informative

      Why is it "getting harder" to host your own solutions?

      Many applications are not even available for purchase of a copy to run on your server. Instead, they are available exclusively on service as a software substitute (SaaSS) terms, namely that the application runs on an application service provider's server. One example of these is the server for any major MMO game.

    18. Re:Sorry, but... by Luthair · · Score: 1

      Unless you're an extremely large company you probably don't have your own data center so you're beholden to your colo, and could have any number of issues there too.

    19. Re:Sorry, but... by Known+Nutter · · Score: 1

      Google cloud isn't supposed to be enterprise grade? I bet that's news to google.

      Maybe it is news to Google...

      https://cloud.google.com/solut...

      Why cloud storage makes sense for small and medium businesses...

      --
      Beware of the Leopard.
    20. Re:Sorry, but... by El+Cubano · · Score: 1, Insightful

      Google cloud isn't supposed to be enterprise grade? I bet that's news to google.

      ... it says that some robot shut them down when it detected "suspicious activity". No review was done, nobody at google called the customer, nothing.

      Those two statements seem, at least to me, clearly contradictory. I can understand summarily shutting down a customer on a residential Internet connection, or a small business shared web hosting provider. However when providing an "enterprise grade" service, you should be prepared to give your customers the benefit of the doubt. About the only instances I can think of for an enterprise service to shut down a customer is if they are greatly exceeding their allocated resources and/or the activity associated with the customer is actively in the process of harming other customers. In both of those instances, though, the right thing to do is attempt to contact the customer first. Of course, if Google attempted to contact the customer and could not get a hold of him (perhaps because the contact person was listed as someone who has changed employ, or because their phone was off, etc.) we would not know that from this individual.

      This is clearly 100% Google's fault.

      Based on what we know, that seems like an accurate statement.

    21. Re:Sorry, but... by Anonymous Coward · · Score: 1

      There's redundancy in the cloud, it just looks different, TFA didn't have it obviously. Bad things would happen to the author of TFA with on site servers as well, I think we know why too.

    22. Re:Sorry, but... by AmiMoJo · · Score: 5, Insightful

      Google's cloud services are enterprise grade if you pay enterprise prices for them.

      If you pay for it on a credit card assigned to the CFO then you are not an enterprise and you are not paying enterprise prices.

      They chose a cheap, no-SLA no-support service, probably because it was cheap. Then they get upset that they aren't receiving the support they didn't pay for.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    23. Re:Sorry, but... by Darinbob · · Score: 1

      Agreed, the title should be "Why You Should Not Use The Cloud."

    24. Re:Sorry, but... by tepples · · Score: 1

      The brochure covers up the fact that a massive "cloud" at one provider doesn't help with the single point of failure that is one provider's automated policy enforcement.

    25. Re:Sorry, but... by Anonymous Coward · · Score: 5, Insightful

      As if servers doing down can't happen if you host it yourself.

      But then you're in control, instead of having to rely on some amorphous, anonymous monster that only allows communication via automated email.

      Exactly right. If your servers go down, YOU can fix them immediately and you don't have the problem of trying to get in touch with some support person who may or may not respond quickly, or may not respond at all if you can't even figure out how to get in touch with them.

      That said, this was a case of a company trying to be cheap, and instead of an enterprise account they tried to run critical applications on what is supposed to be a consumer-level account, which is why they started getting e-mails from Google about "suspicious activity".

    26. Re:Sorry, but... by rudy_wayne · · Score: 4, Insightful

      If you read the fucking summary it doesn't say the servers went down, it says that some robot shut them down when it detected "suspicious activity". No review was done, nobody at google called the customer, nothing.

      This is clearly 100% Google's fault.

      Yes, Google was a little too quick to pull the trigger. They should have sent out a couple of e-mails first before shutting things down.

      But

      The "suspicious activity" was due to the cheapfuck customer running heavy, 24/7, critical applications on what is supposed to be a consumer-level account.

      You get what you pay for.

    27. Re:Sorry, but... by Anonymous Coward · · Score: 5, Informative

      I can understand summarily shutting down a customer on a residential Internet connection, or a small business shared web hosting provider. However when providing an "enterprise grade" service, you should be prepared to give your customers the benefit of the doubt. About the only instances I can think of for an enterprise service to shut down a customer is if they are greatly exceeding their allocated resources and/or the activity associated with the customer is actively in the process of harming other customers.

      Cheapfuck customer was running enterprise-level applications on what was supposed to be a consumer-level account. That's why Google Bots detected "suspicious" actitivy and shut them down. If they hadn't tried to be cheapfucks and were using an enterprise account this most likely wouldn't have happened.

    28. Re:Sorry, but... by gweihir · · Score: 1

      I am an IT expert and you are completely correct. Mission critical infrastructure does not belong on an anonymous ElCheapo cloud that will pull crap like this. Sorry to the people affected by this, but it is 99% their fault for really messing up the decision about the appropriate infrastructure.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    29. Re:Sorry, but... by gweihir · · Score: 1

      Indeed. And if things are critical, you can do something about problems, instead of dealing with some anonymous robot.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    30. Re:Sorry, but... by gweihir · · Score: 5, Interesting

      Ah, yes. And that is at the core of the problem. But since this apparently happened before and they did not learn from that, my guess would be they have far larger problems than the known unreliability of cheap cloud services.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    31. Re:Sorry, but... by stephanruby · · Score: 1

      Or at least you have keys to the goddamn data center and don't have to provide the CFO's p-card to resolve the problem.

      One data center, even your own, doesn't help if there is a hurricane, an earthquake, a snowstorm, a backbone fiber cut, a wildfire, a power outage, sabotage, a bug, a mistake, etc.

      You need to have contingency plans of your own, whether it's on your own premises or whether it's on the cloud.

    32. Re: Sorry, but... by Anonymous Coward · · Score: 1

      'Architect' is not a verb.

    33. Re:Sorry, but... by Zenin · · Score: 4, Insightful

      That Amazon message is very different. It's basically telling you (days or weeks in advance!) that there's a serious hardware failure and the underlying hardware needs to be pulled from service.

      It's literally as simple as a reboot to move to new hardware. You can even catch the notification easily with a CloudWatch Alarm and trigger the purpose-built auto-recovery action to do it for you the moment the instance goes into that state. Or use an AutoScale group and it'll just cycle the hardware out for you w/o any downtime or manual action. TMTOWTDI

      If you aren't living by the motto, "Everything Fails, All The Time", then you're simply doing Cloud wrong. To be fair, even if you're entirely on your own physical hardware in a datacenter...you're still Doing It Wrong if you aren't counting on your hardware to always be failing all the time.

      --
      My /. uid is better then your /. uid
    34. Re:Sorry, but... by squiggleslash · · Score: 1

      What do you mean "sorry, but?" You're agreeing with TFA.

      --
      You are not alone. This is not normal. None of this is normal.
    35. Re: Sorry, but... by Anonymous Coward · · Score: 1

      Designed properly this is not an issue. Cloud is nothing like on prem. If you are not ready then stay on prem

    36. Re: Sorry, but... by Junta · · Score: 1

      Well, one it is an oversimplification for me to imply it's always cheaper to own your own, but in a lot of cases it is cheaper. The fundamental reality is that it is renting vs. owning, and in renting the owner is having to charge to have profit above and beyond what is theoretically possible to do yourself. Of course, the gap between theory and practice is huge, but in some cases it can be close.

      Three major contributors that differ to host in house versus cloud:
      -Equipment
      -Labor
      -Real estate/locality.

      On equipment, some major enterprisey types save money largely because the cloud providers are willing to cheap out on hardware more than they did, and to skip service and warratny fees. In other words, they got traction by gouging customers ever so slightly than the "accepted" vendors. Of course, the providers of servers to those cloud datacenters are perfectly happy to sell to your business, and at small scale the delta is negligible.

      Labor savings are, well, overstated. The good news is they are taking care of the physical infrastructure, but in its place you have at least as complicated 'cloud' infrastructure to manage.

      Real estate gets to be be the more complicated situation. If your ambitions for your business are modest, then you may not derive value over having the ability to place in datacenters all around the world. If you are a pure 'software/information' play with multi-national ambitions, then it's probably cheaper to get reach through a cloud provider. If however you deal with physical goods, then you get sufficiently embroiled in all sorts of complexities in dealing with geographies anyway, that cloud provider might not be sparing you from much.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    37. Re:Sorry, but... by Junta · · Score: 1

      Yes, but at the exact same time you see articles saying to "go all in with ".

      By the time you get to a multi-cloud provider, are you really getting the value for your dollar as you relinquish control? It multiplies the cost of your infrastructure to cope with an entirely new non-technical burden. Also if you are not in the US, using all of the top cloud providers (Amazon, Microsoft, Google, IBM) still puts you at risk of the government messing with your business (doesn't need to be even vaguely criminal looking, some international tariff//tax gets levied and you could be screwed). Yes those companies do their best to have somewhat sovereign subsidiaries, but the head of the company is ultimately US jursidiction...

      --
      XML is like violence. If it doesn't solve the problem, use more.
    38. Re: Sorry, but... by LynnwoodRooster · · Score: 1

      Grammar Nazi - even an AC one - should at least be correct. See definition 4.

      --
      Browsing at +1 - no ACs, I ignore their posts. So refreshing!
    39. Re:Sorry, but... by Peter+P+Peters · · Score: 1

      But then you're in control, instead of having to rely on some amorphous, anonymous monster that only allows communication via automated email.

      What you said just now sounds like my IT dept, so I'll take the cheap and easier 'amorphous, anonymous monster that only allows communication via automated email' over the expensive onsite version.

    40. Re:Sorry, but... by Peter+P+Peters · · Score: 5, Informative

      Google's cloud services are enterprise grade if you pay enterprise prices for them.

      If you pay for it on a credit card assigned to the CFO then you are not an enterprise and you are not paying enterprise prices.

      They chose a cheap, no-SLA no-support service, probably because it was cheap. Then they get upset that they aren't receiving the support they didn't pay for.

      Exactly this. I don't use Google Cloud, but my 'enterprise' AWS service comes with a dedicated account manager and architect who I can call at anytime for help.
      Sounds like this guy cheaped out then is bitching because he received cheap service.

    41. Re:Sorry, but... by RickRussellTX · · Score: 1

      And don't forget about building your own data center around that infrastructure that's going to satisfy the many specific requirements of your various government, financial, etc. customers.

    42. Re:Sorry, but... by RickRussellTX · · Score: 4, Informative

      if your servers go down, YOU can fix them immediately

      Spoken like someone who has never had to fix a server immediately :-)

      One of the strong points of cloud computing is the infrastructure to shift load to accommodate failing hardware. To reproduce that capability with your own hardware & infrastructure requires a tremendous amount of planning and capital investment: in power, servers, and network. It's almost never a simple matter of "fix the server immediately".

    43. Re:Sorry, but... by BronsCon · · Score: 1

      You mean if your multiple redundant ISPs go down; in which case, most of the world is probably offline just the same. Oh, and you have an SLA with those providers, because you're serious about your business, and they cover your losses.

      Before you spring up some bullshit about power outages... You mean multiple geographically diverse datacenters all lost grid power and exhausted their backups simultaneously? If it takes less than that to take you offline, you're doing it wrong; and if that happens, your website is, most assuredly, near the bottom of the list of things people other than you are giving a shit about.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    44. Re:Sorry, but... by BronsCon · · Score: 4, Informative

      Because I know someone's going to insist this is not the case and ask how you know differently, I'll step in and say it now: credit card verification.

      In the rare cases where enterprise services are paid for with a credit card, any and all verification for the $10k+/mo in charges is done when the contract is signed in person.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    45. Re: Sorry, but... by BronsCon · · Score: 1

      Do you know of anyone who runs a business on the basis of selling access to someone else's MMO?

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    46. Re:Sorry, but... by ras · · Score: 2, Insightful

      If millions of dollars are on the line, you should be running your own systems.

      Apparently Netflix and friends (who use these services) don't agree. Neither do I.

      Running your own system doesn't fix anything. Yes, you won't go down because someone forgot to renew the credit card, but you will go down because of a faulty RAM chip, or an air-conditioner going out, or a ISP getting a route wrong, or a backhoe going through a cable. None of those failures are likely to take out Google as they will just move you to new working hardware in a working location - both of which they have in abundance along with well tested procedures to do the move. In fact it will probably happen automatically without you having to raise a finger.

      Comments like this betray something that appears common in the industry - people have absolutely no idea what engineer tradeoffs using the cloud involves. For example rented cloud computing resources are far more expensive per CPU cycle (perhaps an order of magnitude so) then a machine lying on the bedroom floor at home. Yet you will hear people say they use the cloud because it's cheaper. Then later you will hear them complain bitterly about how their cloud bills are breaking them.

      I don't know why they are surprised. It's not like the cloud provider has some magic cookie jar they can pull unlimited CPU cycles from. They, like you, had to buy the servers, pay for the power, pay for building it's in, pay the highly redundant internet connection they provide, and pay for people who keep the things running, pay for the accountants, tax, and all those other business overheads. They also have to make a profit. Anybody thinks that could do this and still sell you those CPU cycles for cheaper than you can make them yourself must be smoking something. The flip side of course is they are really good at doing it, you don't have to concern yourself with replacing dead disk drives, swapping out power supplies you have (hopefully) top notch staff at your disposal you didn't have to train, and don't have to pay.

      Regardless, the cloud is cheaper in the sense that it's cheaper to stay in a $500/night hotel room for a night than it is to buy an apartment for the overnight stay, but those CPU cycles aren't "cheap". But they are cheap in the sense that can't buy 1/2 an apartment and nor can you buy 1/2 a server, but you can rent a room and buy a VM that shares a machine with 1000 other VM's.

      The moral of this story isn't "he should be running his own systems". It's that if you want really high uptimes you have to design a system that wouldn't die because of a single point of failure, be that CPU, internet, power, or administrative control (ie, what happened here - someone has the power to flip the switch). But doing that is currently hard. As in you need the top software engineers on the planet type hard. And the sad bit is you can't just "buy it from Google (or some other cloud provider)" as he apparently thinks he should be able to, because that, as he had discovered, opens you up to a single point of failure when that provider flips the switch. Which makes the problem is so hard an MBA can't solve it by solve it by waving a money wand, he has to build and maintain a team of top flight software engineers. So we aren't talking just hard, we are talking really, really, really hard.

    47. Re:Sorry, but... by aaarrrgggh · · Score: 4, Informative

      I had a $2million/18 month defense contract paid by credit card via a random accounting person 2000 miles away. I would say you are not quite up with the times... $10k/month can be peanuts.

    48. Re:Sorry, but... by aaarrrgggh · · Score: 1

      Especially compartmentalization of functions. Have one server instance per location communicating to a series of redundant master instances.

    49. Re:Sorry, but... by Spazmania · · Score: 1

      You can't get high availability with any single vendor. Including yourself.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    50. Re:Sorry, but... by Anonymous Coward · · Score: 1

      Sorry but that is plain BS. I don't know why any drivel like this would be informative because I can get enterprise level redundancy without needing an enterprise contract from anyone with just a simple credit card. The only thing you have shown is how overblown the marketing is for these services. Today you can cluster across almost all continents and ensure that you never have downtime with just a simple site. If you don't know how to accomplish this then you should get out of IT.

    51. Re:Sorry, but... by Harlequin80 · · Score: 1

      Yeah...... No..

      If I wanted 99.99999% reliability I sure as hell wouldn't be trying to do it myself. I would be going for straight up hosted solutions with Azure or AWS. I would use their location based hot redundant failover system and I would have warm failovers sitting with which ever provider wasn't my first choice.

      Geographically redundant, multi carrier connectivity, in first class data centers with redundant hardware on site ready to go with secure facility access is insanely expensive.

    52. Re:Sorry, but... by chmod+a+x+mojo · · Score: 1

      Maybe, but the bigger question is: why didn't this trigger SLA re-compensation. Why the hell is this company seemingly trusting critical enterprise computing to a place the doesn't have a guarantee for five 9's of uptime and penalties for failing that? That's the question I want answered.

      --
      To err is human; effective mayhem requires the root password!
    53. Re:Sorry, but... by lgw · · Score: 1

      As if servers doing down can't happen if you host it yourself. Also, this isn't a problem limited to Google. We've been getting a lot of emails from Amazon lately with the subject " [Retirement Notification] Amazon EC2 Instance scheduled for retirement."

      Their servers weren't down. The account was marked as fraudulent. This is a very different kind of problem, and one you don't have if you self-host.

      On the other hand, if the server was actually down in a cloud service, you can just immediately provision a new one and you're up and running. Much better that waiting for Dell to Fed-Ex you one, if it comes to that.

      Sound like their real problem was they didn't pay for support. At least with AWS or Azure, you can get a real human to talk to when shit goes wrong, you just have to pay for the privilege. Not sure Google even has the concept of customer service, but their competitors do.

      --
      Socialism: a lie told by totalitarians and believed by fools.
    54. Re: Sorry, but... by phantomfive · · Score: 1

      Real estate gets to be be the more complicated situation. If your ambitions for your business are modest, then you may not derive value over having the ability to place in datacenters all around the world. If you are a pure 'software/information' play with multi-national ambitions, then it's probably cheaper to get reach through a cloud provider. If however you deal with physical goods, then you get sufficiently embroiled in all sorts of complexities in dealing with geographies anyway, that cloud provider might not be sparing you from much.

      I think most websites hosted on AWS don't use more than one region anyway, because it's really kind of a pain. I think most don't even use more than one availability zone, and that won't protect you when a region goes down (and it will).

      --
      "First they came for the slanderers and i said nothing."
    55. Re:Sorry, but... by BronsCon · · Score: 2

      Are you a Google Cloud account manager? No? Well, then...

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    56. Re:Sorry, but... by Sarten-X · · Score: 1

      This is the correct solution, and I'm rather disappointed in /. that it's so far down the thread. There is no substitute for a proper risk analysis, which in this case would point to having redundancy.

      Every time someone says "I moved all my infrastructure to (Google|AWS|OpenStack|Azure|closet servers)", it's invariably followed (perhaps years later) by "we lost just one little thing and our whole enterprise was crippled".

      If your whole enterprise hangs on one credit card, one person, or one service, you have a significant risk that needs to be addressed.

      --
      You do not have a moral or legal right to do absolutely anything you want.
    57. Re: Sorry, but... by Anonymous Coward · · Score: 1

      Thatâ(TM)s such small potatoes you shouldnâ(TM)t be in any cloud. I think the cost it would take to replace one bad IT guy wouldnâ(TM)t pay for the product conversion.

      All of these tiny shops with loud mouths donâ(TM)t get it. If you can do the job by hand then run it on a desktop in your basement.

      I would hang myself if my platform was so tiny as to have solutions which could be solved by me logging into a single host.

      How do you manage 20,000 servers? Now make sure that answer scales to around a 100k and you have a fraction of the east coast.

      The numbers everyone whines about here are so small as to be a joke.

      Now sadly, the salary doesnâ(TM)t scale linearly with the managed assets.

    58. Re:Sorry, but... by djinn6 · · Score: 2

      Cloud or no cloud, you need redundant systems for mission-critical software. This means at least one production system and 2 backups ready to handle the load.

      If you're running your own, then you should have hardware in 3 separate locations. If you're running in the cloud, then you better have 3 separate providers.

    59. Re:Sorry, but... by q_e_t · · Score: 2

      As if servers doing down can't happen if you host it yourself.

      But then you're in control, instead of having to rely on some amorphous, anonymous monster that only allows communication via automated email.

      If you only have a single site, then you are still at the mercy of things like a cooling systems or power failure. If you have multiple sites, then to have full redundancy of operation, you need sufficient resources at each site to take up the entire slack from a single site, and sufficient network resources to ensure consistency of databases. This is all possible, but it takes sufficient effort and planning. This can get complex if your business is also expanding rapidly, as the time taken to obtain extra resources in your own data centres can be considerable. You could go for third-party data centres, but that loses you some control, and is still not trivial to spin up quickly - i.e. you are still talking weeks to procure, sign a contract, move in, commission, and test new resources.

    60. Re:Sorry, but... by q_e_t · · Score: 1

      If millions of dollars are on the line, you should be running your own systems. Seriously. I'm not an IT expert, data infrastructure guy or anything. I'm just a dumb nerd, and I know that. Never trust your data to a third party when millions are at stake -- let alone critical infrastructure reliability.

      A better option, if you have a rapidly expanding business, might be to look at either signing assurances on the behaviour of your host (only works if you are a big enough player), or look at multiple clouds. The latter is complex in terms of ensuring deployment across multiple clouds, interoperability, and database consistency, and failover and ramp up of resources in a new location that allows you to run without interruption. In reality, you are going to have some temporary reduction in service as you ramp up on one of the alternative providers. You need to have good monitoring.

    61. Re:Sorry, but... by hey+hey+hey · · Score: 1

      In the rare cases where enterprise services are paid for with a credit card, any and all verification for the $10k+/mo in charges is done when the contract is signed in person.

      I can't speak to Google's requirements, but we signed up with AWS, and paid with a Corporate Credit Card (I think we were around $6K/month) quite happily, and only changed when we got bought, and the new overlords had their own AWS account we are now a part of. We never had to go to any Amazon site to sign anything.

    62. Re:Sorry, but... by malkir · · Score: 1

      Running your own systems is overkill these days. You can serve, highly redundantly, for less money utilizing AWS/GCP/Azure... but you gotta know how to do it. These guys don't.

    63. Re:Sorry, but... by BronsCon · · Score: 1

      Yes, and I'm sure the new overlords' AWS account is an enterprise account whereas the one you paid with a CC was not. That enterprise relationship, given the bulk purchasing it implies, also probably affords them much better rates than you were getting with your puny $6k/mo.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    64. Re: Sorry, but... by BronsCon · · Score: 2

      And let Microsoft decide there's something they don't like about what you're doing and shut down all your instances in all their POIs. Hmm....

      Yeah, if you're a small shop and your web presence isn't your whole business, go for it. If being taken offline means you're out of business until services are restored, perhaps you should take it a little more seriously.

      I currently host with a single provider; but, then, I'm just hosting my personal shit, so it's no big deal if it's down for a minute here and there. That provider has also gone to bat for me against other providers filing takedown notices. Don't believe me? it's cool, I was in disbelief myself; turns out the support rep who processed the notice had been bitten by that other provider *cough*GoDaddy*cough* the same way I was and, rather than knock my instance (and all the sites on it) offline, notified me of the notice and attached a counter-notice form to the email.

      I guess it comes down to choosing the right provider and getting in contact with the right people there so you name is known -- in a positive light. Even then, as I said above, if being taken offline means you're out of business until services are restored, perhaps you should take it a little more seriously.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    65. Re:Sorry, but... by Nutria · · Score: 1

      Obviously, the solution is to control your growth to something manageable. Otherwise, you inflate like Sun and then the balloon pops.

      --
      "I don't know, therefore Aliens" Wafflebox1
    66. Re:Sorry, but... by Askmum · · Score: 1

      The subject of this is wrong.

      "Why you should not use #randomservice that is mission critical without having a proper contract in place."

    67. Re:Sorry, but... by Greyfox · · Score: 1

      There is really a lot of reliability that can be engineered into a system if you want to justify the costs. You can multi-home your internet connection so a single provider can't cut you off from the internet, you can set up redundant systems designed to fail-over to backups. You can replicate your databases. You can put processes in place so that upgrades are installed and tested on redundant servers so that problematic upgrades can be rolled back without a customer noticing. You can hire knowledgeable people to manage those systems, with clearly defined processes and escalation procedures. It'd be fairly expensive and no one actually does all, or even much of any of that, BUT YOU COULD!

      --

      I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

    68. Re:Sorry, but... by thegarbz · · Score: 1

      If millions of dollars are on the line, you should be running your own systems.

      Not quite. If millions of dollars are on the line you should be getting specialists to run your systems with very strict performance contracts and penalties in place.

      Seriously cloud based providers are the backbone of much of the Fortune 500. But the key difference is the enterprise agreements that will have people on call 24/7/365 to resolve even the tiniest problem without automatically cutting people off. A lot of people look at consumer grade cloud providers, fly by night companies, and the Google which is renound for customer service in no good ways at all, and apply that the entire concept/industry.

      Don't be like one of those people.

    69. Re:Sorry, but... by thegarbz · · Score: 1

      But if they are your own, you have many more levels of control than if they're in the "cloud"

      No you have more personal control. Your cloud provider definitely has more levels of control to deal with outages.

      Seriously though, this isn't about hardware. The major cloud providers can show that they provide uptime services that far exceed what even a well funded IT department are capable of. However the big key missing here is the contract management.

      Mission critical systems in the cloud isn't a bad idea. The redundancy both in hardware, network connections, and geography are hard to beat. However who the fuck puts a mission critical system on a service without a performance contract that provides 24/7 disaster response, or worse on a service where the provider will arbitrarily cut you off.

    70. Re:Sorry, but... by thegarbz · · Score: 1

      More importantly, with millions on the line why was a credit card even part of the discussion. Doesn't sound like an enterprise grade system to me.

    71. Re:Sorry, but... by thegarbz · · Score: 1

      This isn't about hardware. This is about someone hosting a million dollar system on a service that you pay for with an end user credit card.

    72. Re: Sorry, but... by jabuzz · · Score: 1

      Further for lots of firms even if it is a regular power cut you are mainly foobar anyway till the power comes back. So imagine you are a law firm and the power goes out. If the file server has shut down and the email is out who gives a fly f%^k anyway because everyone is going to be sitting their twiddling their thumbs till the power comes back anyway. They might have a laptop, but guess what the WiFi is down because the power is down.

      Happened to me last year at work. I had to grab a laptop, put my mobile into hotspot mode, took a peak at my HPC system to make sure the UPS and generator had kicked in and nothing had gone down, then sat back waiting for the power to return.

    73. Re:Sorry, but... by geekmux · · Score: 1

      As if servers doing down can't happen if you host it yourself.

      But then you're in control, instead of having to rely on some amorphous, anonymous monster that only allows communication via automated email.

      Control is an illusion. I know this because I've seen many IT people "in control" of their own shit and yet fail to fucking back it up properly and ultimately lose all control.

      Speaking of people with no fucking backup plan who lose all control, I read recently about someone who was running a multi-million dollar operation out of Google Cloud who failed to justify the importance of uptime and put redundant solutions into the overall design...

    74. Re:Sorry, but... by Anonymous Coward · · Score: 1

      But then you're in control

      Exactly. Downtime is expensive? Have spare machines. Possibly spare sites. Have employees on watch that can step in quickly to deal with anything from a machine-room fire to the newest zero-day exploit making the rounds. Use several different platforms, in case one of them is suddenly dealt a killing blow.

      Like to outsource to "big players" instead of having your own data centers? Negotiate contracts with uptime guarantees then! So if they don't set up with spare machines, sites & all that - if they get an outage longer than you agree too, they pay part of your losses until they're back up. They all have such deals - if you're willing to pay for them. Guarantees like that costs more though.

      Or you could go for cheap. Run of the mill Google cloud that is available for everyone. But cheap comes with "no guarantees". Maybe even "we turned things off for whatever reason". Perhaps they turned your stuff off, to allocate cpu-time for someone paying for guaranteed uptime.

    75. Re:Sorry, but... by dcw3 · · Score: 1

      Any company dumb enough not to have one deserves to go poof.

      --
      Just another day in Paradise
    76. Re:Sorry, but... by dcw3 · · Score: 1

      You can't get high availability with any single vendor. Including yourself.

      Define "high availability", and then I'll tell you why you're still wrong.

      --
      Just another day in Paradise
    77. Re:Sorry, but... by Nutria · · Score: 1

      Control is an illusion. I know this because I've seen many IT people "in control" of their own shit and yet fail to fucking back it up properly and ultimately lose all control.

      You're in control of your own fate. If then you fuck things up, it's your fault.

      --
      "I don't know, therefore Aliens" Wafflebox1
    78. Re:Sorry, but... by Cytotoxic · · Score: 2

      It is funny to read the "puny $6k /mo" denigration. When you are not exposed to these enterprise level vendors, you can really get taken aback by the scale of the thing.

      I had my first experience with this in negotiating package shipping. I was dealing with UPS and FedEx, trying to get contracts in place and looking for better pricing. In Telecommunications I had really good success, with lots of players competing for my business. We were spending $50k per year (and growing) with FedEx, so I expected a little attention from our UPS rep when I started looking for competing quotes. But they were treating us like we weren't all that important.

      When I told the story to my wife, she was really surprised. She dealt with UPS all the time and they were very responsive. Then she dropped the bomb.... "We have 3 reps from UPS on site in our warehouse"....

      Uh, what? How much do you guys spend on UPS? "Oh, it depends.... three to five million dollars a month...".

      HA! So that's what a big account looks like. Well.... no wonder they weren't knocking my door down.

      Enterprise data is the same. If you "maintain your servers" all by yourself... you really don't understand enterprise. And even enterprise is peanuts next to the really big boys.

      If you get the chance, everyone should go visit a real, hardened, world-class data center. It is amazing. Even fairly large corporate data centers are quaint in comparison. I got a tour of the NAP of the Americas in Miami as my first experience with this level of infrastructure. They spend more on power redundancy than most large corporate data centers will spend in total. They have these huge flywheels to handle the intermediate load while their enormous diesel generators crank up, instead of batteries. They have fully redundant, 2x(N+1) cooling and air handling systems. Each floor is independent and redundant. Security is beyond the pale. These are the locations that hold things like the root servers to the internet DNS system. They are not your "but we have a generator and a raised floor" data center. They are built to "even if the city is destroyed" level. Very cool to see.

    79. Re: Sorry, but... by Cytotoxic · · Score: 1

      It depends on how much you are losing.

      Law firms are notorious for being behind the times on tech (and most other business needs).

      Once you get to be a midsized company, most companies will have generators to keep the doors open in an extended outage.... at least if you are located in an area where this is a reasonable possibility.

      I was with a company in hurricane territory, so I not only developed fully redundant data centers across 3 locations (one being a colo provider), we also had a huge generator in place to run most of the company in extended power outages. It worked great when we were out of power for 2 weeks after a hurricane. I also had contracts in place for emergency office space for critical employees if the building were to be destroyed.

      But our costs were over a quarter million a day if we were down (not including payroll). And if we were down for more than 4 days we risked violating contracts that would have placed tens or hundreds of millions at risk. So the cost of setting up a $120k generator system wasn't that big of a problem.

      It all depends on what your costs of down time are. For BankAmerica or AT&T, there can be no downtime, so they pay through the nose for super-hardened systems. For a 2 man law firm, a battery backup that shuts the system down while they go hit the links waiting for the power to come back on might be sufficient.

    80. Re:Sorry, but... by coofercat · · Score: 4, Interesting

      I signed a previous client up for Google Cloud services. I did so using the company credit card. I was in touch with Google pre-sales people throughout, and spoke to some account management as well. In other words, I did it correctly, with their oversight throughout.

      What I'm saying is... you *can* pay for 'enterprise' services from Google with a credit card. Why anyone would want to is another matter, but to get invoice billing does take a few additional steps, which my client actually failed because of a change of business address at about the same time as we were doing the application. It all got resolved, as my client was in generally good standing, but I guess if your company is a bit flaky you might not pass such checks.

      By the looks of things though, this particular company was using a consumer account, which is of course the source of the problem. If it also happened more than once, you'd think they'd have learned by now.

    81. Re:Sorry, but... by Cytotoxic · · Score: 1

      At least with AWS or Azure, you can get a real human to talk to when shit goes wrong, you just have to pay for the privilege. Not sure Google even has the concept of customer service, but their competitors do.

      I think this is the salient point.

      If you are dealing with an enterprise service, you have a dedicated customer support rep and team. And you pay for this.

      Getting the "google chat was not available" problem means that you didn't have an enterprise level contract.

    82. Re: Sorry, but... by reanjr · · Score: 1

      Stop projecting.

    83. Re: Sorry, but... by reanjr · · Score: 1

      Just because your spell checker doesn't have a word, doesn't mean it's not a valid word.

    84. Re: Sorry, but... by reanjr · · Score: 1

      Building infrastructure properly will already tick all the regulatory boxes. But building infrastructure properly is expensive.

    85. Re: Sorry, but... by Cytotoxic · · Score: 1

      No, he's just using an example that people might be familiar with.

      There are many niche packages out there that used to be sold as software that you'd have a VAR set up at your site. Then they started offering a hosted service. Then they started offering "cloud based" service.

      And now they've started discontinuing the "you set up your own server in your datacenter" version.

      I'm sure most of them would still negotiate with you and set it up for a price, but the business model is definitely moving to hosted services. Support is just so much cheaper if you don't have to deal with hundreds of different individual deployments with their own hardware and software idiosyncrasies.

    86. Re:Sorry, but... by coofercat · · Score: 1

      ...or you have a provider that you've paid for a "storage SLA' with. When you point out that your storage falls below the SLA, they tell you that no, the *storage* was within SLA, just your access to it which wasn't, and the access to it is not covered by your SLA.

      (I has this problem with an EMC-backed VM hypervisor at a provider we'll call 'crapface'. The storage was running slow, and after 4 months of convincing, the provider conceded there was a problem with the 'director' CPU which was running at 100%. When I pointed a the SLA, they welched out of it because the underlying storage was all well within thresholds).

    87. Re:Sorry, but... by shaitand · · Score: 1

      They certainly can go down but if it is critical infrastructure you've layered redundancy upon redundancy upon redundant datacenters. Cascading failures do still sometimes happen but are rare. Usually if you go down these days on self-hosted model it is because you've switched from avoiding home grown like the plague and using software that has cooked for 5+yrs to home grown apps with continuous integration wherein you are always running on blazing edge alpha code.

      The cool thing about automation systems, they give you the power to effect a change across your entire farm with a push of a button, which means they give you the power to break your entire farm with the push of the button.

    88. Re:Sorry, but... by shaitand · · Score: 3, Interesting

      "One of the strong points of cloud computing is the infrastructure to shift load to accommodate failing hardware. To reproduce that capability with your own hardware & infrastructure requires a tremendous amount of planning and capital investment: in power, servers, and network. It's almost never a simple matter of "fix the server immediately"."

      Of course you need to pay for the required infrastructure. That is what "host yourself" means. Either you are paying the higher up front cost of building it out for yourself or you are paying the much higher costs to have google/amazon/digitalocean/etc do it for you. As for shifting load to accomodate failing hardware, it doesn't work as well as pitched (which is why these services go down all the time and have such high latency) but it works better and faster on a private cloud implementation. Which is where you've built a cloud stack on your own gear. Just be sure you actually have redundant controllers across the whole thing and didn't just build redundant controllers on a virtualized layer sitting on top of a not redundant system.

    89. Re:Sorry, but... by Anonymous Coward · · Score: 1

      Which you can mimic in your own environment to an extent. What I've seen as a great use of cloud services is just for real time operations. No longer term data storage. Cloud services like google and amazon have built in redundancy to hopefully make availability really high, but even that kind of service is something you should have some sort of backup if staying up is that important. You want as few single point of failure scenarios as you can.

    90. Re: Sorry, but... by shaitand · · Score: 1

      Not as expensive as paying someone else to build infrastructure properly. If your cloud solution costs anything less than dramatically more than building your own infrastructure they aren't building it properly.

    91. Re:Sorry, but... by arglebargle_xiv · · Score: 1

      Never trust your data to a third party when millions are at stake

      That's at least seventh in the list of classic blunders, the second of which is "Never go in against a Sicilian when death is on the line".

    92. Re:Sorry, but... by Junta · · Score: 1

      That is a very interesting point as well. Internally whatever 'formal' agreements may be there, the business reality has the opportunity to overrule the 'letter-of-the-law', which is a very valuable facet of having your requirements fulfilled even in the face of unexpected circumstances.

      Meanwhile in any outsourcing arrangement, they don't really care about your business success, they care whether they fall within the narrow wording of the contract. If that contract has a loophole that's easier than fixing the problem, loophole it is.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    93. Re:Sorry, but... by shaitand · · Score: 1

      "Your cloud provider definitely has more levels of control to deal with outages."

      Highly doubtful, your cloud provider has heavy heavy automation which lends itself nicely to cascading automated multi-level failures and an extremely complicated solution built on top of multiple programming languages, dozens of frameworks, etc that were all known by two or three guys... in the world... and those guys left over the last 3 months due to not getting raises, other staff being laid off and their work dumped on them, and cuts to benefits like work from home flexibility and healthcare. That is if those guys really understood all those pieces they were deploying in the first place.

      Oh but it is all so automated and dumbed down that it's going to start decaying in small bits and fraying at the edges in ways that your heavily silo'd org isn't going to recognize are a problem until it's too late. Cheers.

    94. Re: Sorry, but... by shaitand · · Score: 2

      Of course cloud computing is more expensive than running your own dedicated servers... they have to buy the same equipment you do to run the service, then charge you enough to pay for it plus make a profit.

    95. Re:Sorry, but... by Junta · · Score: 1

      My point was not about hardware, it is that the purported savings of going to a cloud vendor are about acquiescing to that vendor's decisions that are likely less than what you had. Sometimes they are right and you overreacted and paid too much for something too crazy, sometimes they are cutting corners because the business arrangement doesn't require them to take care of things that you formerly had covered, but took for granted, or assumed it 'came along' or otherwise didn't think you needed to worry about it, because no one else seems to talk about worrying about it.

      Sure the fact that a business of some scope is tied to a specific personal credit card seems problematic, but then again if it is a small business, this isn't too strange.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    96. Re:Sorry, but... by DaveV1.0 · · Score: 1

      As if servers doing down can't happen if you host it yourself.

      While that is true with a single site/server, a geographically disperse active-active system minimizes the risk.

      The important thing in this is that the company paying for using these cloud systems has zero control and zero warning. They are completely at the mercy of Google and it's algorithms.

      --
      There is no "-1 offended" or "-1 you don't agree with me" mod options for a reason.
    97. Re: Sorry, but... by Reverend+Green · · Score: 1

      You don't need an "enterprise contact" to get support from AWS. You just have to pay a monthly fee. Works just fine with a card paid account - no salesmen, lawyers, or "enterprise" crap required.

      Even if you're not paying for support, you can still get AWS customer service with a live person on the phone. They can't/won't help you with tech problems. But this was not a technical issue, it was an account issue.

    98. Re:Sorry, but... by shaitand · · Score: 1

      "Then you shouldn't blindly be advocating "roll your own"."

      No but he is right, mostly he is right because "cloud" solutions are comprised of about 40+ layers of roll your own always run ultra bleeding edge code continuous integration solutions on top of frameworks that are the same bleeding edge crap. When you move your solid in house solution to one it improves your stability in the same way bundling a bunch of bad mortgages together to "diversify" improves the stability of your bad investments... it doesn't.

    99. Re:Sorry, but... by shaitand · · Score: 1

      "Running your own system doesn't fix anything. Yes, you won't go down because someone forgot to renew the credit card, but you will go down because of a faulty RAM chip, or an air-conditioner going out, or a ISP getting a route wrong, or a backhoe going through a cable. None of those failures are likely to take out Google as they will just move you to new working hardware in a working location - both of which they have in abundance along with well tested procedures to do the move. In fact it will probably happen automatically without you having to raise a finger."

      There is no failover capability a cloud provider can offer that you can't build in-house, and have quality control while doing it.

    100. Re:Sorry, but... by Goldsmith · · Score: 1

      When you accept credit cards for payment, you pay the credit card company a few % of the transaction. The exact amount depends on your contract. You (or someone) likely paid at least $50k for the convenience of taking that credit card order. What a waste.

    101. Re: Sorry, but... by Reverend+Green · · Score: 1

      The credit card is a red herring. The real issue is what appears to be Google's cavalier attitude toward cancelling their customer's hosting without warning or even notification, and their complete lack of reachable customer service representatives.

      The aggrieved party here did not need or expect an enterprise SLA. Rather they merely expected a minimum level of professionalism from their hosting provider. Alas, it seems Big Brother Google was too busy snooping on the world to provide basic service to their paying customers.

    102. Re: Sorry, but... by drinkypoo · · Score: 1

      Of course cloud computing is more expensive than running your own dedicated servers

      Doesn't that depend on your utilization patterns? If you mostly utilize a little bit of hardware but occasionally use a whole bunch for short periods, can't you still save money using cloud computing?

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    103. Re: Sorry, but... by BronsCon · · Score: 1

      No, he's just using an example that people might be familiar with.

      And I was just pointing out that his example was horrible and didn't really prove his point.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    104. Re:Sorry, but... by BronsCon · · Score: 1

      This is what people don't understand, because they've never seen it. To most folks who think $6k/mo is enterprise, a datacenter is a couple racks in a broom closed with water dripping from the ceiling and maybe a UPS that probably isn't even name brand and only has the one "critical" server plugged into it. I don't know where these people think "the cloud" resides, but the existence of a place like that is just unfathomable to them.

      That, of course, is why most people shouldn't manage servers; and the ones who can't also won't get the cloud right and shouldn't be doing that, either. They should leave it to people like us who know what things look like behind the scenes and how things are likely to break, so we can architect solutions to avoid those problems.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    105. Re: Sorry, but... by thegarbz · · Score: 1

      The credit card is a red herring. The real issue is what appears to be Google's cavalier attitude toward cancelling their customer's hosting

      The credit card thing is not a red herring. Any cloud provider giving a service worthy of $1m / day if it goes doesn't receive payment via credit card. The credit card bit here is evidence that someone bought something very cheap and nasty. No customer service is more evidence of that.

      The aggrieved party here did not need or expect an enterprise SLA

      I didn't say enterprise SLA. SLA come in all shapes and sizes, and cover all sorts of things. Even small companies should have a SLA in the contract to cover basic things like customer service escalation and phone numbers.

      Alas, it seems Big Brother Google was too busy snooping on the world to provide basic service to their paying customers.

      Just to play on your words here, "basic service" is exactly what is being provided here. You want something more, pay more.

    106. Re:Sorry, but... by thegarbz · · Score: 1

      Sure the fact that a business of some scope is tied to a specific personal credit card seems problematic

      It's not problematic, it's evidence of a business that isn't buying something worthy of the alleged $millions worth of loss. Likewise that they don't seem to have a SLA in place that at least offers 24h customer support contact numbers.

    107. Re: Sorry, but... by shaitand · · Score: 1

      There are usage patterns where it can work this way for small entities and a few niche needs like scientific computing on a temporary basis but in practice is almost always ends up costing more because this usage pattern is extremely uncommon in the real world. An enterprise scale organization is going to have many departments and solutions across it and there are quite a few that pretty much every enterprise is going to have along with the load requirements that go with them.

      At a large organization you can take advantage of the same thing the cloud host is, virtual resources, to more efficiently use your resources. At a blush virtualization actually costs resources in abstraction layers but a large organization needs a highly redundant infrastructure you will have a lot of "warm" resources sitting idle, you should always make sure that your typical day-to-day demands don't exceed 50% utilization which means you have 50% sitting idle to absorb peaks like this and that is just production. You also need a staging environment where you can test successful deployment and stress test that should be a complete mirror of your prod environment, these resources can be used for other things outside of these test windows. Not mention your actual dev environments where there is any significant time savings to be had from being able to spin up fresh instances on the fly. By the time you've built all these and built out the hardware you need to make sure none of them go above 50% utilization, you'll have plenty of warm idle resources for a project with an odd load pattern somewhere. If you have a niche requirement that is blowing that out of the water you look to custom hardware not third party virtualized general purpose hardware to fill the gap.

    108. Re:Sorry, but... by omnichad · · Score: 1

      So they should pay protection money? It's not like it was just unplanned downtime. Google actively shut them down. Sounds like a shakedown if you want them to pay in order for that not to happen.

    109. Re:Sorry, but... by BronsCon · · Score: 1

      No, we've been talking about Google this whole time. What we have is an AC who can't follow context.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    110. Re:Sorry, but... by Gr8Apes · · Score: 1

      I laugh at the $6K a month pricing as enterprise. I personally ran a data center, a small one, representing about 25K people in a smallish dept. Even that had roughly 200 racks of equipment in it with dual generator backups. Not quite the level Cytotoxic states, but we stayed up most of the time. I also was partially responsible for a nice smallish datacenter for a startup I worked for in years past. To give you an idea of what a datacenter for a commercial going concern handles and why millions are spent on them, we ran more than $2M/min in transactions during our peak 2 or 3 hours a day. IIRC, the equipment in that datacenter was roughly $10M with a minimum goal of $3M/yr being cycled out during upgrade cycles. At a different employer, just 1 machine ran roughly $1M, and we had 20 of them in our local datacenter. And those aren't even the biggest servers you could get, even then. People these days mostly have no clue what a real datacenter is or involves, and it's sad they think their single box or a handful of VMs on Amazon make for "enterprise" deployments. They're fine for POC or startup, but sooner rather than later you're going to want to bring that in house.

      --
      The cesspool just got a check and balance.
    111. Re:Sorry, but... by roc97007 · · Score: 1

      As if servers doing down can't happen if you host it yourself.

      But then you're in control, instead of having to rely on some amorphous, anonymous monster that only allows communication via automated email.

      Exactly if/when mistakes are made, your organization made them, you have better transparency on what actually happened, and you have a better chance of affecting the root cause. The cloud is... a cloud. Automatic processes can arbitrarily shut your business down (which, sounds like, happened in this case) and you can't get hold of a human to remedy.

      Mind you, there's lots of ways to fail, if you host or cloud, but at least, when you host, they're your own failures.

      --
      Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
    112. Re:Sorry, but... by apoc.famine · · Score: 1

      Yeah, but think of the airline miles!

      --
      Velociraptor = Distiraptor / Timeraptor
    113. Re:Sorry, but... by Bryansix · · Score: 1

      If you're application is that big and that important and having downtime is unacceptable, you operate an active-active architecture in two geographically separated data centers. Then the only thing that is going to cause an outage is if you have poor change control on configuring the routing rules.

    114. Re:Sorry, but... by aaarrrgggh · · Score: 1

      Yes. Unfortunately it is mandatory with certain companies and pushed very hard with others. The actual cost was nearly double your estimate. Complete and total BS, especially when our LOC interest rate is only around 3% annually-- going from NET-90 to NET-15 saves us ~1% interest but costs us ~2.5% in fees minimum.

    115. Re:Sorry, but... by Nethead · · Score: 1

      To reproduce that capability with your own hardware & infrastructure requires a tremendous amount of planning and capital investment: in power, servers, and network.

      No, that's what VMWare vCenter is for. It's not a tremendous amount of work, etc., we do it every month or so.

      --
      -- I have a private email server in my basement.
    116. Re:Sorry, but... by Nethead · · Score: 1

      Greyfox, old man, the dreams we use to have, I tell you; the dreams we use to have.

      --
      -- I have a private email server in my basement.
    117. Re:Sorry, but... by WorBlux · · Score: 1

      Or at least have a redundant fail-over system. And if you do trust a third party, trust one that has good customer service and up-time gaurantee's.

    118. Re:Sorry, but... by proibido · · Score: 1

      That's true but the costs are not even close and I'm a in-house advocate all the way. Every choice (cloud vs in-house) has pros and cons. For the OP that choice should've been made based on budget, risk assessment or internal resources. If it was budget, well, that's the price to pay for a cheaper infrastructure, at least on short to mid term.

    119. Re:Sorry, but... by shaitand · · Score: 1

      "That's true but the costs are not even close and I'm a in-house advocate all the way."

      That's true, cloud solutions are much more expensive than in-house solutions with the same capabilities.

    120. Re:Sorry, but... by LordWabbit2 · · Score: 1

      you need sufficient resources at each site to take up the entire slack from a single site

      Depends on the workload. I worked for a large bank, and their failover mainframe could not handle the full load from the production mainframe. So they would only transfer critical systems to the failover when it was needed, ATM's etc. If they were in failover mode you could not check your home loan balance for instance, things that were not considered "mission critical". But then IBM mainframes hardly ever have to be IPL'ed (rebooted, for you heathens) I think the entire time I was working there they only failed over once (generators would not start during a power outage) and IPL'ed once during an update of something (upgrading DB2 I think, but it was a long time away in a land far far away). The failover was lots of fun though, the entire company basically all got in their cars and drove across the city to the failover site, had upper cheese standing behind you (that was not fun) and when I needed permission to do something all I had to do was turn around and say "I need to do xyz, can I go ahead?" The outage did make the financial news though, and caused our currency to dip a bit, but I still had a lot of fun. Linux fan boys like to say Linux is more stable than windows, and I suppose it is, but if you want seriously stable, run a IBM operating system, just sucks having to write in COBOL 85.

      --
      There are three kinds of falsehood: the first is a 'fib,' the second is a downright lie, and the third is statistics.
    121. Re:Sorry, but... by LordWabbit2 · · Score: 1

      And charge an arm and a leg (with me sometimes a kidney as well) to do it. The people complaining here have no understanding of "critical" systems. To me it's simple. Did anyone die? Did you make the news? Did the currency take a dive? Anything else is people whining about promising 99.9% up time without building the infrastructure to deliver it and farming off that responsibility to "The Cloud". You don't farm off stuff to the cloud when you are promising up time, if you don't have the resources or skills to do it yourself then you should not be operating in that space.

      --
      There are three kinds of falsehood: the first is a 'fib,' the second is a downright lie, and the third is statistics.
    122. Re:Sorry, but... by LordWabbit2 · · Score: 1

      And no amount of backup cloud providers is going to be able to help when a backhoe goes through the internet connection. Having the servers onsite will.

      --
      There are three kinds of falsehood: the first is a 'fib,' the second is a downright lie, and the third is statistics.
    123. Re:Sorry, but... by sverdlichenko · · Score: 1

      But this is not like Amazon kills your entire server pool. It's just single server requiring migration or, in case of EBS root, just a restart.

    124. Re: Sorry, but... by Reverend+Green · · Score: 1

      '"basic service" is exactly what is being provided here.'

      That's too generous an assessment. Pretty much every other hosting provider out there - from local small businesses to giants like AWS - offers superior customer service.

      "Substandard service" seems like a more accurate description.

    125. Re:Sorry, but... by q_e_t · · Score: 1

      True. Most of the time you want to keep customer expectations up and avoid a slow down, though.

    126. Re:Sorry, but... by q_e_t · · Score: 1

      Ideally you would, but an opportunity for faster growth may present itself, and the options might be to not take it, and risk losing market share and later vanishing. But the other danger is botching expansion and collapsing.

    127. Re:Sorry, but... by Spazmania · · Score: 1

      Ha! I suppose that's fair. I should have said: a sole source vendor is a single point of failure in any high availability system.

      --
      Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    128. Re:Sorry, but... by dcw3 · · Score: 1

      Okay, I'll buy that, while I disagree with the original statement. I've worked for the same company for 36 years, and we've continually provided high availability to our customers. Sure, we could go bankrupt, but would it be of value to our customers to mitigate that risk by utilizing another source? Not likely worth their money IMO.

      --
      Just another day in Paradise
  2. Wait by Anonymous Coward · · Score: 5, Insightful

    The first time this happened,...

    Why was there a second time?

    1. Re:Wait by stephanruby · · Score: 1

      Obviously, it wasn't deemed that important enough to put a contingency plan in place after the first time it happened.

  3. Why you should self host by Anonymous Coward · · Score: 1

    Why you shouldn't use the cloud period

    1. Re:Why you should self host by Anonymous Coward · · Score: 1

      You shouldn't use the internet. Your ISP can knock your 'self hosting' server offline PDQ!

    2. Re:Why you should self host by Junta · · Score: 1

      They can't knock out the connection between a local monitoring server and the local equipment.

      In this case, they might have lost *a* power plant remotely, but could still reach out to the local personnel and *they* would still see monitoring.

      Also, the same ISP can knock out your ability to manage your cloud hosted infrastructure, so the cloud adds a party to screw you over without removing any.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    3. Re:Why you should self host by shaitand · · Score: 1

      No, they can't because you have multiple redundant peerings for your AS... right?

  4. Donâ(TM)t trust an ad company. by Anonymous Coward · · Score: 3, Insightful

    Over 90 percent of Google income is adverts. You would be absolutely insane to trust them with your business or educational institution data.
    Iâ(TM)m not saying MS or Amazon is great but at least their revenue model is not based exclusively or largely on data mining of users.

    1. Re:Donâ(TM)t trust an ad company. by Actually,+I+do+RTFA · · Score: 1

      And Amazon and Microsoft are working hard at replacing^W supplimenting their "just charge money" business plans with advertising money.

      --
      Your ad here. Ask me how!
  5. Can Confirm by Anonymous Coward · · Score: 1

    This happened to me. They had some kind of p2p malware going on at the data center, they saw that one of my servers use a p2p service (cryptocurrency) and they literally banned my entire project causing all servers in all regions to go offline. It took them DAYS to get everything back online with only a "sorry for the inconvenience" email. They costed me money and spent trust with my users. I had lots of redundancy, just never expected my project to get shut down.

    I still use them, but now I spread my services across other cloud providers as well.

  6. Get used to it by Anonymous Coward · · Score: 1

    If someone else owns your infrastructure, then they but need to flip the switch and your infrastructure vanishes in a puff of, well, cloud.

    This is the essence of "cloud". This is the future, everyone tells us.

    1. Re:Get used to it by AHuxley · · Score: 1

      Some sort of chart on a web site that compares what brands offer and the costs?
      The cloud products, services. A computer thats at another location. Another location that allows a company to bring their own computer and set it up.

      --
      Domestic spying is now "Benign Information Gathering"
  7. Fake news by Bobrick · · Score: 1

    That's impossible, AI wouldn't let this happen. /s

  8. No mission critical activities on the internet by Cassini2 · · Score: 2

    You need to design the systems such that they have a fall-back and can continue to operate without an internet connection.

    Really ...

    What are you going to do the next time a major blackout occurs, and the grid wants you to restart your turbines?

  9. Amazon's cloud s no better by Wycliffe · · Score: 5, Interesting

    Our company tried to use Amazon a few years ago and ran into the same issues. Although google and amazon allow you to
    spin up a single instance, they are really designed for companies that have hundred if not thousands of servers. Amazon
    assumes that you have dozens of fault tolerant servers and if one goes down you just replace it with another one. This works
    great for companies like Netflix but Amazon is a disaster for a company that isn't fully fault tolerant and has critical servers
    that can't go down. Liquidweb, Rackspace, Linode, and even Digitalocean are more reliable when it comes to wanting to
    keep a single server up and running with minimal downtime. Now if you need to keep thousands of servers up and don't care
    if any one server goes down then Amazon works fine.

    1. Re:Amazon's cloud s no better by fahrbot-bot · · Score: 5, Insightful

      ... but Amazon is a disaster for a company that isn't fully fault tolerant and has critical servers that can't go down.

      If your company has "critical servers that can't go down" (wherever they are) and you're not fully fault-tolerant, you're the disaster, not Amazon.

      Not here to pick a fight, just sayin'. (one finger pointing at someone else is also three fingers pointing at yourself)

      --
      It must have been something you assimilated. . . .
    2. Re: Amazon's cloud s no better by Anonymous Coward · · Score: 1

      That is the opposite of the story. The story says Google took every server down.

      I think you missed the point of the person you replied to.

      Google can't take "every" server down, they can only take down servers Google runs.

      The person you replied to is saying if it is really so important, you need redundant systems in many data centers on many networks and run by many different hosting companies.

      He or she said that because that is the solution. A solution to the problem of only using a single hosting provider, aka Google.

      Had the story submitter known this fact, it wouldn't have been possible for Google to take down "every server", all they could have done was taken down all of the servers run by Google, leaving all the other servers hosted at other companies up and running just fine.

      By complaining about how much money they lose when down, it is implied they want the opposite of being down, and this solution leaving all servers up except the ones Google hosts would provide exactly that.

    3. Re:Amazon's cloud s no better by fleabay · · Score: 1

      What do you mean "even Digitalocean"?

    4. Re: Amazon's cloud s no better by Wycliffe · · Score: 1

      What do you mean "even Digitalocean"?

      Digitalocean is usually considered a low cost provider. I'm not sure they even have 24/7 phone number you can call. They are also completely self service as far as I know as is google/amazon unless you have a multi thousand dollar contract. Liquidweb and rackspace on the other hand are considered full service and you can always reach a technician if you have a problem.

    5. Re: Amazon's cloud s no better by Wycliffe · · Score: 1

      Heaps of fail in your statement, I fear for your customers. No wonder you think cloud is so bad.

      Servers that canâ(TM)t go down are so 80s and 90s ways of thinking. Servers/compute are throw away and should be treated as such. You are not supposed to ever touch a servers config in cloud computing, change the base image spin up another and terminate the old.

      But this is ignoring the fact that it Should be load balanced across multiples azs anyways, across multiple regions if mission critical with high cost of downtime.

      There are millions of small businesses that run their websites on a single server or a single VM. Very few if any are using base images or multiple AZs. Most are likely using a third party web designer who sets them up on a $5-$20/month instance and hands them the keys. Neither they nor their web designer knows the first thing about base images or availability zones. My point is that for these businesses even though google and amazon are cheap, they are not a good fit for small businesses. Small businesses are much better off using one of the many companies designed for you to have a single box. These companies still obviously need some form of backup and probably shouldn't rely totally on the backup provided by their cloud provider but they do just fine without fault tolerance.

    6. Re:Amazon's cloud s no better by viperidaenz · · Score: 1

      I'll reword that with the same meaning:

      Ill pick a fight. :) AWS should have their VM's fully DRS's so they never go down. I run VMware and with over provisioning hardware we sometimes have VM's go down.

    7. Re:Amazon's cloud s no better by whitelabrat · · Score: 1

      This is truth. To be honest this is cloud computing in general. You have to architect for everything to fail in every way. To expect any cloud computing environment to be completely reliable is naive and ill prepared for the reality.

    8. Re:Amazon's cloud s no better by coofercat · · Score: 1

      I agree, but it's possible that you may have some legacy code somewhere that maybe costs a lot to replace with better. For that, you want a decently reliable VM to run it on - not necessarily "up 100% of the time", but not manually taken down without some notice. That doesn't seem like a crazy requirement of a cloud vendor.

      I can't say with any qualification that Amazon probably is not a good place to run such a server. It's great for 'destroy it and let the auto-scaling fire up a new one', but I wouldn't trust it for high uptime servers. No particular 'red flags' to justify that feeling, but also nothing to 'give me the warm and fuzzies' about it either.

      A previous client switched to Google - I asked them about 'high uptime VMs', and they talked about how a hardware failure would mean the VM would be migrated to new hardware. They did of course suggest we built in some redundancy, but didn't dissuade me in any way. To date, there was one server which rebooted mysteriously during the weekend. Support regurgitated the need for redundancy, but also told us how quickly the server became available again.

    9. Re:Amazon's cloud s no better by nasch · · Score: 1

      Our company has one main web server on AWS, and I don't think it's ever gone down due to Amazon's fault. If it were important enough, I could set up a load balancer or hot backups in different AZs or something, but for our needs a backup VM that is turned off is good enough.

  10. Disaster Recovery by tk77 · · Score: 4, Informative

    If an extended system outage can cause "millions of dollars in lost revenue" then you should have a DR plan. Don't put all your eggs in one basket. Have copies of everything at another site (EC2, Azure, Colo, etc) that you can turn on and switch to in this event. If millions of dollars are on the line, then it shouldn't be unreasonable to have such a plan and infrastructure established.

    1. Re:Disaster Recovery by Anonymous Coward · · Score: 1

      If an extended system outage can cause "millions of dollars in lost revenue" then you should have a DR plan. Don't put all your eggs in one basket. Have copies of everything at another site (EC2, Azure, Colo, etc) that you can turn on and switch to in this event.

      Isn't that the point of having your shit hosted with a provider like Amazon or Azure. You order up the level of redundancy you need for the situation at hand within the provider itself. Most of these systems offer locality controls where data can be physically replicated across the planet if need be. Otherwise you are being ripped off for nothing when you could host the same shit yourself and bury backup disks in a deep hole in the forest for much less.

      Then of course when something goes wrong with that it isn't enough people will be like ah ha! You actually should have used multiple providers.

      Then when you use multiple providers and there is a widespread Internet outage due to some BGP snafu or fiber apocalypse someone else will be like ah ha! You need your own leased lines.

      Then when those break there will be someone else going...ah ha! You have to string them yourself and bury them deep where trackhoes won't get them.

      Then when there is a freak history setting blackout you will find someone saying ah ha and explain some increasingly implausible way in which you could have prepared yourself for that disaster.

      It's a never ending stream of hindsight that is never good enough and never acknowledges basic reality that nothing can ever be guaranteed and sometimes the best course of action is to ACCEPT RISK instead of burrowing further into an unwinnable abyss. A basic reality no armchair hindsight purveyors will ever admit to or even consider.

      I can't even enter 1 + 1 into a calculator and be guaranteed the resulting outcome will be 2. It's a low probability event yet the chance of occurrence is nonzero.

      I happen to agree with TFA as it relates to relying on Google specifically as a company for anything as general matter. Yet for the standard to be you need multiple providers if you want redundancy... well that's basically bullshit. It's in nobody's best interests to accept that.

    2. Re:Disaster Recovery by gweihir · · Score: 3, Insightful

      They did not have a DR plan _after_ this happened before. They did apparently not even buy enterprise grade cloud services, you know, those with an SLA. The problem is not the cloud here, but the utter morons that decided to use it in that way.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    3. Re:Disaster Recovery by tk77 · · Score: 3, Insightful

      No.

      Relying on a single company no matter what their internal redundancy is, is not having a good DR plan. Especially when the amount of revenue involved his so high.

      As others have pointed out, the other cloud providers do the same thing. You can have your stuff spread all across the country/world with a single provider and if a glitch in their system says you shouldn't have service, it will all be turned off (as is presented in the TFA and from others suggesting at least Amazon has done the same thing).

    4. Re:Disaster Recovery by BeanThere · · Score: 2

      Then of course when something goes wrong with that it isn't enough people will be like ah ha! You actually should have used multiple providers

      Exactly; one of the core selling points of "cloud" services is that they're supposed to offer increased fault tolerance (e.g. a virtual server can be restored in a different data center if one data center explodes) ... so now these knee-jerk Google defenders are telling us we need 'multiple cloud providers - let's call this new service a "cloudcloud" - so when your cloud service is shut down you can continue running your backup clouds from the cloudcloud ... what next, we need a cloudcloudcloud for your cloudclouds?

      The core issue here isn't that a server went down - of course everyone knows servers can go down - it's that they were deliberately shut down, with no warning, with unreasonable recourse, after the user paid for a service that Google failed to provide through incompetence. It should be a an SLA issue, you should be able to sue your service provider for something like this, but you can bet Google's lawyers have covered everything in the fine print.

      This is like, you order a burger at a restaurant, and instead of a burger, you get a plate with shit on it. And then the people here would say, 'well you should have expected that you might get shit on the plate you dumb-ass, why didn't you order five burgers from five different places!?' Uh, because no reasonable person expects that.

    5. Re:Disaster Recovery by tk77 · · Score: 1

      This is like, you order a burger at a restaurant, and instead of a burger, you get a plate with shit on it. And then the people here would say, 'well you should have expected that you might get shit on the plate you dumb-ass, why didn't you order five burgers from five different places!?' Uh, because no reasonable person expects that.

      Another straw man argument.

      I am in no way a Google defender, but it does sound like your a single cloud service defender. Why would you trust your entire business to a single provider? Regardless of whether they can host your VM on a thousand different locations, its still the same provider. One mistake on their end bring your entire business down.

      For a small company that can't afford the multi hosting or staffing to manage it, I can see running on a single provider. But if the amount of revenue at stake is in the order that the TFA suggests ("millions" over a short period of outage time), then I would think you'd want to be as redundant as possible. At least have two locations run by different providers.

      I'm all for blaming Google for a lot of things, but not if its due to my own poor choice in IT practices.

    6. Re:Disaster Recovery by Anonymous Coward · · Score: 1

      Relying on a single company no matter what their internal redundancy is, is not having a good DR plan. Especially when the amount of revenue involved his so high.

      Says the person offering zero objective supporting evidence.

      As others have pointed out, the other cloud providers do the same thing. You can have your stuff spread all across the country/world with a single provider and if a glitch in their system says you shouldn't have service, it will all be turned off (as is presented in the TFA and from others suggesting at least Amazon has done the same thing).

      I may get hit by a meteoroid sitting at my desk at work. To protect myself the best thing for me to do is immediately quit and only seek work in underground bunkers from now on.

      Unsubstantiated appeals to FUD with no supporting data is not a useful methodology for characterizing and managing risk.

      Without characterizing likelihood of occurrence and considering available strategies to prevent or mitigate a problem you are lost and flying blind. Your view to your destination is as clear as my view of the sun from my underground bunker.

    7. Re:Disaster Recovery by viperidaenz · · Score: 1

      You make sure the service levels of the service you're paying for meet your requirements.
      If the contract says they can immediately suspend all of your services if their system detect unusual behaviour with your account, you should have something in place to mitigate that risk.

      How is that not obvious to someone in charge of the operations of millions of dollars worth of equipment?

    8. Re:Disaster Recovery by Darkling-MHCN · · Score: 1

      Easier said than done. Cloud Services provided by companies such as Google, Amazon and Micosoft's often come with proprietary API and sdk to access cheaper, more virtualised services, that provide redundancy and automatically scale on demand. For most people that's the whole point of "The Cloud" is the ability to create cheap proof of concept applications and then be able to spin up a global enterprise scale service out of that with a flick of a switch. The concept seems simple and is pretty sexy and seductive, but it tends to lock you into the vendor. If you're just creating VMs and hosting them in cloud space instead of hosting them locally that's another story, but many would say you're missing the whole point of "The Cloud"... especially if they work for Google, Amazon or Microsoft.

    9. Re:Disaster Recovery by N1AK · · Score: 1

      This is like, you order a burger at a restaurant, and instead of a burger, you get a plate with shit on it. And then the people here would say, 'well you should have expected that you might get shit on the plate you dumb-ass, why didn't you order five burgers from five different places!?' Uh, because no reasonable person expects that.

      No reasonable person has millions of dollars riding on having a proper burger delivered, if they did then you can bet your ass that anyone competent would have a backup plan.

    10. Re:Disaster Recovery by tk77 · · Score: 1

      That's part of the reason why at my company we opt'd to stick with a more traditional setup. Our own hardware sitting in a cage co-located at a local data center with multiple internet routes. We have our own named sales rep who gives us a call if there are any problems (strange activity, hitting an overage on bandwidth, blacklisting issues, complaints, etc..), or sometimes to just to say hi and see how things are going. We replicate our systems off-site in case of a problem.

      We do have up-keep and maintenance costs on the hardware/software but after running the numbers and risks on running everything on a "cloud" provider, it came out better to keep going this way. We have thought about using EC2 potentially as additional capacity, but in the end we find it easier to just order up another server or two and add it to our vSphere cluster.

  11. It makes sense why Google is like this by MikeRT · · Score: 2, Interesting

    YouTube users, GMail users, etc. have all complained about similar issues with blackbox, zero accountability. On click, boom, you're done.

    IANAL, but this is my theory...

    We know that Google is controlled by some highly political people. People who want to be able to disconnect you, deplatform you, etc. at the drop of a dime. The more they make their services a customer service blackbox, the easier it is to get away with acting in bad faith.

    By bad faith I mean specifically in contractual bad faith. All of the XKCD-citing hipsters miss a very important nuance of the law regarding "deplatforming assholes:" contracts are judged by the "good faith" conduct of both parties and evaluated by reasonable behavior standards.

    They do things like tie your account to all of the services, including purchases, and after a few vague "bad behavior incidents" nuke it. Often taking real assets with them because of how those accounts are tied. I don't think, for instance, Microsoft would fair well if they cost someone $2k of XBox Live marketplace purchases because they cussed out a few butthurt players a few times (Microsoft claims it has the authority to do this). Google is the same way on a larger scale.

    The more people that are involved, the more people who can be hauled into court, forced to testify, etc. You can demand they answer why they thought a reasonable person would act that way. You can point to flesh and blood people who are the focal point for a real user suffering real economic harm due to one or a few people's biases.

    And then win damages.

    IMO that is why you see these companies aggressively moving in this direction. It's about not facing as much accountability for acting like dicks.

    1. Re:It makes sense why Google is like this by Anonymous Coward · · Score: 2, Insightful

      All of the XKCD-citing hipsters

      This is a good time to remind people that XKCD wrote that comic to justify the forceful expulsion from the entire Internet of ordinary people who disapproved of the corruption of a gang of liars who happened to be connected to multiple foreign intelligence agencies, including state sponsors of al-Qaeda. And it turns out they own all of the press too, given how nobody has told this story correctly.

      All of that "Gamergate harassment?" There was one tweet. ONE. It was sent by Slade Villena to Randi Harper. The rest was either sent by their own people or random jokers. None of it was ever traced back to the Gamergaters despite having the Home Office, QCRI, DHS, FBI, Google, and Microsoft on the case trying to find somebody who was guilty so that they could justify their funding. When they couldn't find anyone, they simply concluded that Gamergate was a harassment campaign because everyone said it was and they pocketed their paychecks.

      When Trump says that the MSM is fake news, he is not lying. When he calls them the "enemy media," remember that I mentioned state sponsors of al-Qaeda. He is not lying.

    2. Re:It makes sense why Google is like this by AmiMoJo · · Score: 3, Informative

      A simpler and more likely explanation is that you paid $0 for the service and got your money's worth.

      There are also lots of obvious examples that disprove your conspiracy theory. If it was politically motivated then what kind of politics wants to silence both the far right and progressives and non-political trans make-up videos and help the copyright industry abuse people?

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    3. Re:It makes sense why Google is like this by vtcodger · · Score: 2

      Let me get this straight. You're asserting that Donald J Trump and XKCD are sponsoring al-Qaeda?

      --
      You can't see ANYTHING from a car, You've got to get out of the goddamned contraption and walk...Edward Abbey
    4. Re:It makes sense why Google is like this by Kohath · · Score: 1

      I don't think it's based on politics. I think all the behavior you are seeing from Google is merely extreme arrogance.

      The political stuff is a manifestation of that extreme arrogance. Think about it: when you're so much more important and simply better than everyone else, judging any outlook that differs from yours as evil (or stupid, or otherwise contemptible) becomes second nature.

    5. Re:It makes sense why Google is like this by Kjella · · Score: 1

      I think you're trying to turn a simple truth into a complex conspiracy theory: The profit is in users that make no fuss. Every time somebody clicks "report video" on YouTube it costs Google money. Every harassment report on Xbox costs Microsoft money. Dealing with borderline content or behavior isn't about giving you a fair trial, it's about minimizing expenses. What that means in practice is lots of automation, lowest-bid contracting, broad rules and zero tolerance policies. Did you break the rules? Yes. Is that legal grounds to kick you off the service? Yes. Then bye-bye. Reactions don't have to be "reasonable", take this example from the GPLv2:

      4. You may not copy, modify, sublicense, or distribute the Program except as expressly provided under this License. Any attempt otherwise to copy, modify, sublicense or distribute the Program is void, and will automatically terminate your rights under this License.

      It doesn't matter if you did it accidentally, one mistake and it's instant death penalty. That's got nothing to do with bad faith, as long as Google/Microsoft isn't misleading you about what the terms are and wants you to use the service by their terms they're acting in good faith. If anything, the claim would probably have to be that the terms are unconscionable but that's a really high bar to pass and particularly for a free service.

      --
      Live today, because you never know what tomorrow brings
    6. Re:It makes sense why Google is like this by serviscope_minor · · Score: 4, Insightful

      This is a good time to remind people that XKCD wrote that comic to justify the forceful expulsion from the entire Internet of ordinary people

      No it's a good time to remind people you've gone way off the deep end, mate.

      It does however prove that just aobut anything in favour of gamergate no matter how batshit insane will get modded up here. Like this bit:

      of al-Qaeda.

      Aside: isn't that sort of old news even for the crazies? Aren't ISIS responsible for chemtrails now or are they merely a false flag perpertrated by the deep state t ostop us knowing the truth about how a chemtrail spraying plane actually did 9/11?

      All of that "Gamergate harassment?" There was one tweet.

      Gamergate was one tweet: +3 Insightful. I think that might be a new low for slashdot moderation.

      --
      SJW n. One who posts facts.
    7. Re:It makes sense why Google is like this by serviscope_minor · · Score: 1

      We know that Google is controlled by some highly political people.

      I can see where this is going, and it's going somewhere pretty silly. Google's actions are simly no that coherent. They block people you like, also people you hate and plenty of random shit for no apparent reason that you're ambivalent about.

      All of the XKCD-citing

      And yet you don't in fact have the right to spew venom in my living room or indeed on a forum I run.

      hipsters

      I know you think it's an insult, but they don't. Hipsters have better food fewer gears (just one!) and seem to enjoy life a great deal by choosing to not have the same priorities as you.

      This makes a lot of people angry because they seem to be winning at life (i.e. being happy) by not putting in the soul crushing grind to make tons of money.

      More power to them, I say. And I shall continue to get excellent coffee while enjoying the fine selection of beards on display.

      --
      SJW n. One who posts facts.
    8. Re:It makes sense why Google is like this by squiggleslash · · Score: 1

      +3 Insightful. I think that might be a new low for slashdot moderation.

      It's +5 right now. Reminder, you can at least reduce the amount of terrible moderation - we seem to be getting it in droves right now - by visiting this page and voting opposite bad moderation.

      --
      You are not alone. This is not normal. None of this is normal.
  12. s/Google// by Todd+Knarr · · Score: 2

    Seriously. When someone else owns and operates your infrastructure, things like this are going to happen. When that someone earns their revenue from something other than the bill from them you pay every month, it's going to happen a lot more often because they'll be acting based on what's good for their business, not what's good for yours. This is life on any cloud platform. This was life with mainframe service bureaus back when they were the cloud platform of choice.

    You have to make a call based on what the trade-offs are. Make sure you know what those trade-offs are going to be, bearing in mind that any contract you have is probably going to say the provider's only responsible for refunding your month's payment no matter what the cost to you of their mistake was. It's that that you're balancing against the cost of running your own hardware, not the monthly bill.

  13. Clouds are opaque for a reason by eastjesus · · Score: 1

    When you entrust your business to an outside cloud service you are entrusting people, organizations, policies, and procedures that you don't and usually can't know with the keys to the success of your business. They can be very useful and cost effective in situations but I would never trust an outside organization for mission critical services.

  14. What should they have done? by viperidaenz · · Score: 1

    Turned off their automated systems, and who ever caused the flag to be raised with your Google Payments Account gets in and takes over your entire system, maxes out your CFO's credit card?

  15. Points and laughs, HA HA! by bigmacx · · Score: 1

    Put all your infrastructure under the physical control of some other entity well beyond your reach and then discover they can summarily turn it off and refuse to respond - Duh!

    The Cloud Stikes Back!

  16. Third parties by sjbe · · Score: 1

    Never trust your data to a third party when millions are at stake -- let alone critical infrastructure reliability.

    While that is reasonable advice, sometimes that isn't an option. Sometimes the only reasonable way to do things is through a third party. Furthermore sometimes the third parties can do a better job than I could do myself, even accounting for their flaws.

    1. Re:Third parties by Chris+Mattern · · Score: 2

      As other people have pointed out, the magic letters here are "SLA". You must have a contract stating what the vendor's responsibilities are and be able to enforce that contract. Otherwise, you don't have a business, you just have a hobby.

    2. Re:Third parties by Anonymous Coward · · Score: 1

      Why would you advocate "Lamba/EC3" cloud trash when you know -- definitively, absolutely know -- that "Google, and MS are all shooting for vendor lockin" ? How is using any "technology" a "super super cool way to build out your infrastructure" -- when its entire purpose is to herd the largest possible market share into a small, tight, unmaneuverable corner, and then keep them there, whether it benefits them or not? Hint: this is not an actual benefit. The so-called "ease of integration" and outsourcing of responsibility are hallmarks of laziness. Naturally, these turn-key systems appeal to the most incompetent.

      Being strategically lured by marketing propaganda, into a position you cannot change - is not actually desirable

      hth

    3. Re:Third parties by DarkOx · · Score: 1

      Because the technology is Good. That is step 0. Sure it might not be ready to be the solution for everyone. Once the lock issues of servless are ironed out and they will be - just look at what happened with virtualization it will be a good move.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
  17. Re:Why by DontBeAMoran · · Score: 1

    Are you saying he should have used Google Wind instead?

    --
    #DeleteFacebook
  18. Redundancy by darkain · · Score: 2

    You should not have ANY one single point of failure.

    Only 1 card holder? Single point of failure.

    More importantly: Only 1 cloud provider? Single point of failure.

    If you're running that level of cash, and still insist on outsourcing infrastructure, then fucking distribute it. Mirror the infrastructure between AWS, GCloud, and Azure. Even these companies themselves know this. Look up Amazon's DNS providers. Hint, its not JUST AWS, but they outside their own shit too *JUST IN CASE* their servers go offline.

    1. Re:Redundancy by WaffleMonster · · Score: 1

      You should not have ANY one single point of failure.

      Agreed.

      Only 1 card holder? Single point of failure.
      More importantly: Only 1 cloud provider? Single point of failure.

      Hosting everything on a single planet is also a single point of failure. Drawing a box and labeling something a single point of failure is so much fun.

      If you're running that level of cash, and still insist on outsourcing infrastructure, then fucking distribute it. Mirror the infrastructure between AWS, GCloud, and Azure.

      Isn't this why people go to overpriced hosted solutions by major providers because they have access to redundancy and DR services they wouldn't be able to pull off themselves? Hey if I check this box all my data gets replicated to some place else hundreds or thousands of miles away.

      Is there data available that supports the assumption straddling multiple providers leads to better reliability outcomes or is it just an assumption?

      What about costs involved not just in terms of additional dollars owed to hosting providers but also cost of management, increased system complexity and possibility of failure as a result? I've personally witnessed misconfigured backup systems cause disasters.

      Even these companies themselves know this. Look up Amazon's DNS providers. Hint, its not JUST AWS, but they outside their own shit too *JUST IN CASE* their servers go offline.

      Someone has to watch the watchers. It's all part of making sure the service works without a single point of failure even when armchair engineers are able to draw a box and label something a single point of failure.

  19. Your failure to plan... by kenh · · Score: 4, Insightful

    I have an email asking me to fill in a form and upload a picture of the credit card and a government issued photo id of the card holder. Great, let's wake up the CFO who happens to be the card holder. What if the card holder is on leave and is unreachable for three days?

    Uh, I don't know - take a picture of each and save them on your phone, in case you need them?

    You report everything was back up within 20 minutes once you submitted the requested information - that seems pretty good to me.

    Now, about your decision to only run one instance of your mission critical application suite on exactly one cloud service...

    The story here is you consider it someone else's fault for your failure to plan/prepare for an outage.

    --
    Ken
  20. Not a Google Cloud problem. by Gravis+Zero · · Score: 1

    This is not a problem with Google Cloud, this is a problem with all "cloud" platforms. It's really simple, they can be held liable so they put acquit ass-covering in the contract so that they can shut you down on a whim. If this doesn't work for you then you should not any "cloud" platform.

    This is just an example of reality catching up to all the idiots who said "put it in the cloud!" while ignoring all the risks. Play with fire and you'll eventually get burned.

    --
    Anons need not reply. Questions end with a question mark.
    1. Re: Not a Google Cloud problem. by Wycliffe · · Score: 2

      This is not a problem with Google Cloud, this is a problem with all "cloud" platforms. It's really simple, they can be held liable so they put acquit ass-covering in the contract so that they can shut you down on a whim. If this doesn't work for you then you should not any "cloud" platform.

      Not true. There are plenty of full service providers like liquidweb and rackspace that won't pull the plug on your server. I've had servers act up and I immediately get a phone call. In severe cases they might even disconnect the network until they can contact you and resolve the problem but they aren't going to destroy your data or even disable your computer without first contacting you. Even in cases of spam they will work with you and try to fix the problem and unless you really are a spammer they won't just boot you off their system the first time there is a problem.

    2. Re: Not a Google Cloud problem. by Gravis+Zero · · Score: 1

      Whether they pull the plug immediately is hardly the issue. The fact is that they can do so legally and they may choose to while leaving you without recourse. Just because they were nice to you doesn't mean they aren't dicking over other people. This is how the business world works.

      --
      Anons need not reply. Questions end with a question mark.
  21. What exactly happened? by thecombatwombat · · Score: 2

    Maybe I just didn't read enough, but it seems like he doesn't say anywhere exactly what happened. He implies it was a billing issue. That's all. Without knowing exactly what went on, it's very hard to care. I imagine it's something like "well the credit card details changed, oh, and we were 107 days overdue."

    Also, millions of dollars are on the line for short downtime and you're billing to a credit card?

    1. Re:What exactly happened? by robinsc · · Score: 1

      Basically Googles automated fraud system suspended his account and threatened to delete everything in case he didn't regularize it within 3 days. the millions of dollars on the line was his account including backups etc. My experience in many cases corporate credit cards are simpler and prefered by the cloud providers since they prefer prepayment to invoicing. And no hyperscale public cloud provider will give you any useful SLA. just look at their published terms and conditions. they don't guarantee anything.

      --
      Linkedin http://in.linkedin.com/in/robinsaikatchatterjee
  22. New scam incoming! by Calydor · · Score: 4, Insightful

    I have an email asking me to fill in a form and upload a picture of the credit card and a government issued photo id of the card holder. Great, let's wake up the CFO who happens to be the card holder. What if the card holder is on leave and is unreachable for three days? We would have lost everything -- years of work -- millions of dollars in lost revenue.

    Somewhere in Russia, India and Nigeria, several callcenters full of scammers came all at once.

    --
    -=This sig has nothing to do with my comment. Move along now=-
  23. Design for Business-Level Redundancy by Anonymous Coward · · Score: 1

    This is why, when I design a cloud service, I design for redundancy at the business level too, not just at the host level. I deploy to and run on multiple different cloud vendors at the same time. This way, all my eggs aren't in one basket if one has a billing hiccup or one goes out of service.

    1. Re: Design for Business-Level Redundancy by Anonymous Coward · · Score: 1

      This!

      People (both techs and especially bad management and bean counters) always want to blame the tech, or place improper emphasis on redundancy at the tech level, but the root of this is a bad/poor business decisions. It happens all the time with peoples bullshit sob stories like the one in the summary.

      Fix the business rules first, then a proper tech strategy can be implemented that includes actual redundancy, along with a contingency plan, and a disaster recovery plan, etc.

  24. Re:Sorry, but...cheap advice. by HornWumpus · · Score: 1

    Good thing I don't work for Amazon. Cause I just found some synergy. Amazon video would be about the get much better, at zero cost to them.

    --
    John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
  25. Read your contract and SLA by davidwr · · Score: 2

    I hate to blame the messenger, but either you didn't buy the right service level agreement or Google broke the contract.

    If it's the first case, blame yourself and learn a lesson. You get what yo pay for. If Google doesn't offer the level of service you need, go elsewhere. If they do, either pay up or go elsewhere.

    In the second case, you are rightfully upset but you should be talking to lawyers before talking to Slashdot.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  26. Yeah, big warning signs on the user side here by Phil+Urich · · Score: 5, Insightful

    The first time this happened,...

    Why was there a second time?

    So many of the problems here (ex. paying with a credit card and one that has only a single person's name on it? Having no fallback that can be spun up elsewhere?) are foolish if this has never happened before, and utterly, mind-bogglingly idiotic if this in fact has already happened before. It's one thing to be blind of something you should know could be a problem, it's quite another to be blind and wholly unprepared for a problem you've personally experienced! Something seems fundamentally wrong at this company.

    Also, if your entire business can die because it takes an unexpected few days off, then perhaps your business is running a bit too raggedly and doesn't have enough meat on the bones . . .

    --
    I remember sigs. Oh, a simpler time!
    1. Re:Yeah, big warning signs on the user side here by gweihir · · Score: 4, Insightful

      There probably is just one problem here: Utterly incompetent and greedy management that made this demented decision.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  27. Cheap service, cheap results by fyngyrz · · Score: 5, Insightful

    The company thought they could get away with paying less for server infrastructure. They can. But they get less. This is one of the "less" things they get.

    If you value your data, host it yourself, preferably in multiple locations. If you want to go cheap, then you can expect to lose things.

    Like your data, or access to it, or availability of it.

    It's not such a smart thing to cheap out on the important stuff.

    Of course, convincing the bean counters of future risk inherent in what appears to them to be current savings... good luck with that.

    Well, best to get rid of your bean counters. :)

    Here's a maxim of mine I like to drop on the table during discussions like these:

    If you can't afford to do it well, you almost certainly shouldn't be doing it at all.

    --
    I've fallen off your lawn, and I can't get up.
    1. Re:Cheap service, cheap results by Anonymous Coward · · Score: 1, Insightful

      Sorry, all of that is horseshit. Google is the one making the mistakes here, not the people who took them at their word that they offered robust professional services.

    2. Re: Cheap service, cheap results by Anonymous Coward · · Score: 4, Insightful

      Then take the contract you have with them to court and get your effing money back. What? No contract? You deserve what you get.

    3. Re: Cheap service, cheap results by cyber-vandal · · Score: 1

      Thankfully that's a cheap and quick process and Google are sure to pay up right away

    4. Re:Cheap service, cheap results by gweihir · · Score: 4, Insightful

      You get what you pay for. They thought they could get much more than they paid for. 100% their fault. At the very least, they failed to evaluate what they actually got. And, since this is apparently the second time this happens, they also seem to be unable to learn.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    5. Re:Cheap service, cheap results by CaptainDork · · Score: 5, Interesting

      This.

      I'm retired now, but my firm had a plan to replace me with the cloud.

      We were a law firm.

      During my last two weeks, an elderly couple drove in from about 70 miles away to sign some family law papers and they were waiting in the conference room when a partner got hold of me and told me, "The cloud's down again .

      I called the support number and they said they were aware of the problem and that they were working on it.

      After a lot of pressure, I called again and told them to fail-safe over to the mirror that they had bragged about.

      They said the outage got the mirror, as well.

      I informed the partner and she started screaming at me. She yelled, "WHAT IS PLAN B?"

      I said, "Ma'am, plan B is plan A."

      It was quite a shit storm.

      I had argued against the cloud, and I documented their rejection of my recommendations and they signed off on it.

      They spent a a butt load of money bringing all that shit home, and I worked there another 5 years.

      I had full hands-on control of the shit I built. All of it. I was totally responsible and could either fix, or get fixed, anything running in my house.

      --
      It little behooves the best of us to comment on the rest of us.
    6. Re:Cheap service, cheap results by BronsCon · · Score: 5, Interesting

      I have (neigh, had) a client making the same choice right now. All of their infrastructure, under my hand, I could fix as it was fully under my control. We'll see where they are in 5 years.

      I'll happily go back to them when they ask me to clean up the shitstorm their current CTO doesn't see coming. As CTO and for double what I was getting as a contractor, of course.

      Under my lead, they went from everything running on a single unstable server and going down every other day to everything running on a distributed cluster of servers and not a single outage in sight. This, in the matter of under a year, on a site that is their entire business and sees over 500k uniques and serves over 20 million pages per month.

      Their new solution costs them more per month and limits them to 1.5 million pages served per month on the best package they actually list a price for, with no uptime SLA. In short, they'll be paying 15x as much for infrastructure for the capacity the current system has, which more than covers what they're "saving" by not paying me.

      It is what it is, but they thought I was only looking out for my own ass when I pointed all of this out. Well, I was looking out for me but, as they were providing me steady full-time work, a huge part of that was looking out for them in-kind.

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    7. Re:Cheap service, cheap results by pnutjam · · Score: 4, Insightful

      That's what I tell people about the cloud. It's great when it works, but if you have problems you are almost certainly just a small fish. The best you can hope for his your money back, but your services will still be down and your left scrambling to build elsewhere.

    8. Re:Cheap service, cheap results by Anonymous Coward · · Score: 1

      Cheap service, cheap results

      in my experience "cloud" services are not actually that cheep, unless you need large amounts of storage and/or processing power for short periods of time.

      If you just keep storing whatever data you have or produce (so your storage needs supposedly are steady or increase at a constant pace) and have a pretty much continuous need for processing power (with picks and lows, of course), the cost of doing it yourself will be similar to that of clouds services and the advantages in terms of owning your data, being in control, and "in house" understanding of what you do will be immense.

    9. Re:Cheap service, cheap results by nick_davison · · Score: 1

      And then, after you moved them back in house, you retired?

      It all comes down to where the business wants their critical vulnerability.

      They moved it from a smart guy on site to what was hopefully a decently staffed cloud service.

      They were then at the mercy of the business having outages, their hardware having outages, the network connection having outages, their phone support having outages. All of these things led to a moderate chance - that came true - of a failure lasting for several hours at an embarrassing time.

      So they moved it back in house to a really smart guy who knew how to maintain the hell out of very well optimized system and was willing to log in no matter whether he was on vacation, sick or whatever.

      Which moved them to a single point of utterly massive failure, not a couple of hours of outage but weeks, if not semi permanent, if he got hit by a bus, into a car wreck, had a stroke, etc.

      Given I'm yet to find someone who's taken over a proprietary system, without a great hand off, that hasn't cursed out the original guy... no matter how well that first guy thought he'd documented everything... recovery after that single point of failure is now hell vs a stressful few hours - if anything happens to you (or the guy you passed it on to after you retired) vs the more frequent issues with the cloud chain.

    10. Re: Cheap service, cheap results by Tough+Love · · Score: 2, Informative

      Whoosh. The GP's point was the Google never contractually obligates itself to customers, because it can get away with that, flouting the norms of commerce. They can get away with this because their customers are stupid.

      --
      When all you have is a hammer, every problem starts to look like a thumb.
    11. Re:Cheap service, cheap results by Anonymous Coward · · Score: 1

      The company thought they could get away with paying less for server infrastructure. They can. But they get less. This is one of the "less" things they get.

      You never pay less, it just becomes Op-ex vs Cap-ex.

    12. Re:Cheap service, cheap results by CaptainDork · · Score: 1

      So I've been retired 3 years now and They're out of business?

      Who the fuck have I been drinking coffee with every Friday afternoon?

      --
      It little behooves the best of us to comment on the rest of us.
    13. Re:Cheap service, cheap results by aquabat · · Score: 5, Funny

      I have (neigh, had) a client making the same choice right now.

      +1 for having the horse sense to move on from that client, after you told them to hold their horses and they didn't listen. I guess you can lead a horse to water, but you can't make him drink. If they're going to get on their high horse and be as stubborn as a mule, then they're going to end up trying to close the barn door after the horse has bolted. When comparing in-house vs cloud, in-house really is a horse of a different colour. Trying to run a tech business on the cloud is just horsing around.

      --
      A republic cannot succeed till it contains a certain body of men imbued with the principles of justice and honour.
    14. Re:Cheap service, cheap results by BronsCon · · Score: 1

      I see what you did there... *clop*clop*clop*

      --
      APK quotes people (including myself) without context and should not be trusted. Just thought you should know.
    15. Re:Cheap service, cheap results by Mad+Merlin · · Score: 2

      Cloud hosting is seldom cheaper than renting space in a DC and throwing your own servers in. The most common exception is if you're really small scale (ie, you would only use one or two physical servers). Most of the time, cloud hosting is quite a bit more expensive.

    16. Re:Cheap service, cheap results by Mad+Merlin · · Score: 2

      Guess what, having someone else manage the hardware for you is only a teeny tiny portion of the workload involved in managing IT infrastructure. Everything you build on top of said hardware still falls on your plate to maintain, cloud or not.

    17. Re: Cheap service, cheap results by houghi · · Score: 1

      Just because the single point od failure did not fail does not mean it can't.
      As long as you talk about a person and not a team, it is a single point of failure.

      --
      Don't fight for your country, if your country does not fight for you.
    18. Re:Cheap service, cheap results by N1AK · · Score: 1

      There are multiple scenarios where cloud hosting is considerably cheaper than co-location. The most obvious and extreme example would be when you have highly variable computational load. For example you could have 3 geographically disparate sets of 6 servers to handle peak load, however that could mean that you're getting
      As much as the cloud isn't the solution to all the worlds problems it is also not the wrong solution to all problems either.

    19. Re:Cheap service, cheap results by Hognoxious · · Score: 2

      Stop being such an ass.

      --
      Confucius say, "Find worm in apple - bad. Find half a worm - worse."
    20. Re:Cheap service, cheap results by pr0nbot · · Score: 3, Funny

      Let's rein in the puns, ok? Feels like we're flogging a dead horse.

    21. Re: Cheap service, cheap results by mellon · · Score: 1

      This is actually not true—the Google contract is pretty explicit and it's not all one-sided. Google makes some pretty good promises. However, rapid response customer service isn't one of them. If I were running this particular business, I'd have redundancy across multiple cloud providers. But that's just me.

    22. Re: Cheap service, cheap results by tysonedwards · · Score: 1

      How did they calculate the âoesuspicious activityâ quotient that would take their entire subscription (not just a single server) offline? Further, should a provider be able to take your entire subscription offline, having multiple instances wonâ(TM)t work unless they were independently managed using separate subscriptions with replication and clustering services added to user land instead of leveraging vendor tools.

      --
      Thirty four characters live here.
    23. Re:Cheap service, cheap results by dcw3 · · Score: 1

      "...took them at their word..."

      Seriously, if you did that, your business deserves to fail.

      --
      Just another day in Paradise
    24. Re: Cheap service, cheap results by Cytotoxic · · Score: 3, Interesting

      Yeah, he is right about that "single point of failure".

      There are a huge number of companies that fall into that hole between "too small to have our own IT department" and "big enough to have a large IT department with formal procedures and personnel redundancies".

      I was that "one guy who does everything" for a long time. It is easily the most efficient way to run a company. Even when we were a small department with a half-dozen really sharp guys, it was still a high-wire act, dependent on having A players who knew the business and systems well.

      It was only after we hit maybe 15 or so IT employees that we really had full redundancy in personnel and reasonably well documented procedures. And even that was pretty small... we didn't have dedicated QA employees, for example.

      It all depends on what your needs are and what your risk tolerance is. For most growing small companies, the reward of having the "one really smart guy" as a key point of failure is worth the risk. Later, when it comes time to sell out to an investor or go public, that risk/reward equation flips and you have to figure out how to eliminate that red flag on the audit.

      As many here have said, going "cloud" isn't a panacea to fix that problem.

    25. Re:Cheap service, cheap results by Anonymous Coward · · Score: 1

      The victim blaming is real with you. Google is the only one to blame here. A long weekend timespan before they just up and delete all of your data is unacceptable. The inability to get a human is unacceptable. Quit defending a bad company for bad practices.

    26. Re:Cheap service, cheap results by shaitand · · Score: 1

      You mean the people who ignored their "overpaid" in-house engineering staff so they could chase the latest buzzwords? Sorry, no, it is on them.

    27. Re: Cheap service, cheap results by shaitand · · Score: 1

      Yeah but the same thing happens, someone cheaps out. They'll run that redundant infrastructure as active/active but then as people cheap out they start treating that as a pool of resources and get in the situation where service is compromised if one is down.

    28. Re:Cheap service, cheap results by sacrilicious · · Score: 1, Interesting

      You get what you pay for. They thought they could get much more than they paid for. 100% their fault.

      For my education, what would you point out as the flaws in the following analogy:

      • I consider buying a house or moving into a condo. I go with the condo, happy that I don't have to maintain a yard.
      • One night a fire starts on the first level of the building. I'm trapped in the fifth floor, and die. I couldn't get out because the heat melted the window shut.
      • The building manager tells my relatives that it's 100% my fault for not having chosen to buy a house instead of moving into a condo, that I got what I paid for.

      While it is "foreseeable" that I might have had different windows on a house I own, or that I'd be able to get out of the fire regardless of having the same windows because there are ground-floor doors in a house, is it really useful to talk about it being "100% my fault"? Should I, as a condo buyer, have had some kind of vague dread that makes me fundamentally at fault for whatever differences there are in the situation, and prevents me from talking about ways the building manager could improve things?

      --
      - First they ignore you, then they laugh at you, then ???, then profit.
    29. Re:Cheap service, cheap results by shaitand · · Score: 1

      Actually you pay more and get much less.

    30. Re: Cheap service, cheap results by wwphx · · Score: 1

      Court? Betcha it's binding arbitration, SUCKAS!

      --
      When you sympathize with stupidity, you start thinking like an idiot.
    31. Re: Cheap service, cheap results by CaptainDork · · Score: 1

      What a dumb fucking thing to say.

      You, or me?

      --
      It little behooves the best of us to comment on the rest of us.
    32. Re:Cheap service, cheap results by HornWumpus · · Score: 1

      Hippocracy!

      --
      John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
    33. Re:Cheap service, cheap results by Mad+Merlin · · Score: 1

      Yes, extremely elastic loads are also a good example of where cloud makes sense. In my experience, they're not that common though. Even if you look at something extreme like a Black Friday sale for an online retailer, they're probably not going to see more than, say, 4x the load of a normal day (numbers estimated, few retailers would share them). Also, find me an online retailer that doesn't see Black Friday coming from a mile away and I'll find you an online retailer that's going out of business soon.

      What is more common is people deluding themselves about how elastic their workload is going to be, like the startup that claims their business is going to grow 50000% next week. Congrats if it does, but it probably won't.

      If you want to brute force some encryption, then sure, spin up a billion cloud servers for a day or two. Few businesses find this a worthwhile endeavor though... except for the NSA, who already has several entire DCs of their own doing this around the clock (making it not particularly elastic anymore).

    34. Re: Cheap service, cheap results by TimMD909 · · Score: 1

      You spell color weird... I'm not sure which accent to use in my head while I am reading it. I want to go with a Canadian accent as I've been watching a lot of Letterkenny, but it could be one of the million UK accents too...

    35. Re:Cheap service, cheap results by roc97007 · · Score: 1

      Sorry, all of that is horseshit. Google is the one making the mistakes here, not the people who took them at their word that they offered robust professional services.

      Ok. We can say that, but your business is still down. Saying Google is the one who screwed up is cold comfort when your wheels come off. The objective, in my mind, is not assigning blame after the event, but making some reasonable assurance that it doesn't happen in the first place.

      This is generally true for outsourcing of any vital component and/or personnel. There are salespeople who's job it is to sell you the service. It's your job to decide whether you should buy it. There are companies (who will remain nameless because I'm not posting anonymously) who will gleefully sell you a really lousy service in a contract that's very difficult to get out of. They're not going to say up front "oh and this will cause difficulty with your customers and damage your reputation".

      I think the issue is that the accountability of the service provider (if any) is not equal to the amount of damage to the company caused by an arbitrary outage. I've seen this over and over -- a service and/or personnel provider does something you didn't ask for, or does something you *did* ask for incorrectly, which causes a major extended outage. When you take this up to them you get "--shrug--".

      And yes, the penalty is and should be to fire the people involved. But firing a service provider who's providing a major part of your infrastructure is a difficult thing. Not just contractually, but tactically. Moving to another provider, or insourcing, is a complicated, difficult, expensive, risky task, even if you could reasonably get out of the contract. The service provider *knows* that, and they know how to play the game -- offer you better reliability at a higher price, for instance, and you end up with just as much risk to your business at higher and higher costs for the sake of bullet points to show in overheads to board members.

      --
      Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
    36. Re: Cheap service, cheap results by roc97007 · · Score: 1

      Then take the contract you have with them to court and get your effing money back. What? No contract? You deserve what you get.

      Good point. Let's say you do have a contract. Let's say that the vendor is clearly in default. Let's say you tell the vendor they're in default and you want your money back including very real damages to your company. Tell them you will sue if necessary to make this happen.

      Real response in a real case like the above: (This is an exact quote.) "Feel free. We have more lawyers than you have employees."

      (And yes, if you have no contract you deserve what you get. But even having a contract doesn't necessarily help.)

      --
      Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
    37. Re:Cheap service, cheap results by gweihir · · Score: 1

      You forget that this is the ElCheapo product. I have no love for Google, but here they are not in the wrong.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    38. Re:Cheap service, cheap results by gweihir · · Score: 2

      Simple: What you actually bought is a straw-hut and what you were primarily using it for is excessive open-fire cooking...

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
    39. Re:Cheap service, cheap results by Mike+Van+Pelt · · Score: 1

      I see what you did there... *clop*clop*clop*

      But where did he get the coconuts? Coconuts are a tropical tree, and I don't think a swallow of whatever continental origin can carry one.

    40. Re:Cheap service, cheap results by CaptainDork · · Score: 1

      Using "the cloud" is fine. If you actually use it as it was meant to be used. With redundancy.

      From my Post:

      After a lot of pressure, I called again and told them to fail-safe over to the mirror that they had bragged about.

      They said the outage got the mirror, as well.

      I think you spend your time following posts and commenting negatively for the lulz.

      --
      It little behooves the best of us to comment on the rest of us.
    41. Re:Cheap service, cheap results by tattood · · Score: 1

      But where did he get the coconuts? Coconuts are a tropical tree, and I don't think a swallow of whatever continental origin can carry one.

      Suppose two swallows carried it together?

      --
      WTB [sig], PST!!!
    42. Re: Cheap service, cheap results by CaptainDork · · Score: 1

      I apologize for my shortcomings.

      --
      It little behooves the best of us to comment on the rest of us.
    43. Re: Cheap service, cheap results by CaptainDork · · Score: 1

      After a lot of pressure, I called again and told them to fail-safe over to the mirror that they had bragged about.

      Read this again and this time, it's OK if you move your lips.

      You are incorrect that I don't know what redundancy is.

      You are correct that I didn't use redundancy.

      It wasn't my cloud.

      You're baiting.

      I'm biting.

      --
      It little behooves the best of us to comment on the rest of us.
    44. Re: Cheap service, cheap results by Tough+Love · · Score: 1

      This is actually not true—the Google contract is pretty explicit and it's not all one-sided.

      Show me who has successfully sued Google for violating its terms of service and I might believe you. As compared to the countless Google customers who have had their service cut off with no compensation whatsoever and usually not even an apology. Not one sided? Give me a break.

      --
      When all you have is a hammer, every problem starts to look like a thumb.
    45. Re: Cheap service, cheap results by CaptainDork · · Score: 1

      Seriously: Moving your lips as you read is OK.

      I had argued against the cloud, and I documented their rejection of my recommendations and they signed off on it.

      --
      It little behooves the best of us to comment on the rest of us.
    46. Re:Cheap service, cheap results by CaptainDork · · Score: 1

      You are correct in all respects, but I see the same thing happening recently where people buy AWS and just load shit up

      Massive US Military Social Media Spying Archive Left Wide Open In AWS S3 Buckets

      .

      --
      It little behooves the best of us to comment on the rest of us.
  28. Welcome to the age of digital serfdom. by technosaurus · · Score: 1

    Sharecroppers, company stores, vassals, etc... all have digital counterparts these days. Instead of a single entity though, it is spread out among several corporate entities and perpetuated by all levels of government. It's only going to get worse unless/until the people revolt. Problem is most of them don't even realize it. Quite clever way of creating highly productive slaves who think they are free.

  29. 365 vs. Apps Story by M0j0_j0j0 · · Score: 1, Informative

    As a consultant a typical war I get into with customers is to pick the cloud setup email, drive etc... I always recommend Microsoft instead of Google products, and I have to always remind people that, altough the strong brand name, Google is an advertising company not an Enterprise partner, and I have countless stories of google pulling the plug on services because of "reasons" whatever, also, they have no respect for the customer when they drop a product, they just send an email with a month notice, and that is a good one, and then they pull the plug, never ever use google products for enterprise, ever. True story!

    1. Re:365 vs. Apps Story by swillden · · Score: 1

      Sure, Google drops free products with little warning. Do you have any examples of them dropping a paid product that way?

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    2. Re:365 vs. Apps Story by Mad+Merlin · · Score: 1

      Yes, they did this to Google Site Search. There was a paid tier that no longer exists as of April 2018. If you used features that were exclusive to the paid tier (for example, no ads), you're SOL now.

    3. Re:365 vs. Apps Story by swillden · · Score: 1

      Interesting. I had to look it up. Not a terribly compelling example, though, since sites using the paid service were given more than a year's notice, and they still had the option of using the ad-based service if they didn't want to invest in something else.

      If that's the best example there is, then I'd say it supports my argument that Google is pretty careful not to pull the rug without warning from paying customers. It's unreasonable to expect that a business will never shut down a product, what you can and should expect is that the business will honor its contracts and provide adequate notice for transition.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    4. Re:365 vs. Apps Story by Mad+Merlin · · Score: 1

      It's one example that I was personally(-ish) bitten by. The free tier was a non-starter as it doesn't offer the XML based API that GSS used to offer. Luckily we already had a replacement for GSS ready to go before the announcement, but I'm sure not everybody was so well prepared.

  30. one colo? by Anonymous Coward · · Score: 2

    Who do you work for? I'm divesting. Hell, as a 6 man startup in the 90's, we knew better to have only one server farm in one colo. Granted, our failover was to the developmental farm on a T-1 in our office, but it was at least *some* failover. h, and the colo texted when there was a problem, real or imaginary.

    That was 6 drunk amateurs 2 decades ago.

  31. Also... by GerryGilmore · · Score: 2

    ...(as mentioned in other comments):
    1) Don't trust another company with your critical IT infrastructure!
    2) Have redundant facilities with different ISPs. 3) Have tested backup/standby power systems.

    Yes, it is expensive, but - how much would it cost you to be down a week? A month? There is no free ride.

    1. Re:Also... by thegarbz · · Score: 1

      1) Don't trust another company with your critical IT infrastructure!

      Why not? They can provide orders of mangitdue better: "redundant facilities with different ISPs" , and "backup/standby power systems" than you ever could.

      Yes, it is expensive

      And that's the key here. This wasn't a case of critical infrastructure in the cloud. This was a case of critical infrastructure in a very cheap consumer grade cloud with no SLA. Seriously much of the critical infrastructure of some of the largest companies are hosted by cloud providers now. The difference is that instead of you being unable to call them, they will be actively calling you.

      I once was asked why my company pays $5000/m for a shitty little DSL connection. Well when that connection went down and we had technicians from the ISP deployed within 20minutes from a company who's consumer service will happily tell you "someone is coming next week on thursday between 12:00 and 17:00 and then doesn't show up at all, THAT's why you pay for a certain kind of service.

      Clouds are no different. Pay peanuts, get monkeys.

    2. Re:Also... by LordWabbit2 · · Score: 1

      Well where I live (in a third world country) losing internet connectivity is a lot more common, Microsoft have started building data centers here, but AFAIK they are still working on it. One ships anchor dropped in the wrong spot halved our countries internet speed, going full cloud is not an option for anything critical. Power outages used to be a big problem, it's all well and dandy if you're generators kick in and your routers etc. are running, just to be cut off the internet when the nearest exchange goes down because they have not maintained their generators. Don't get me wrong, we use the cloud a LOT in my current company, but it's an international company and I think a lot of the cloud mindset is creeping in from there when, we are not in a position to use it reliably here.

      --
      There are three kinds of falsehood: the first is a 'fib,' the second is a downright lie, and the third is statistics.
    3. Re:Also... by thegarbz · · Score: 1

      going full cloud is not an option for anything critical

      That's a narrow view of the economics and data flows for businesses. Based on what you just said I would say for many businesses going full cloud would be the ONLY option.

      But yes if you're idea of cloud includes only having your files and services that you need not on your computer then I fully agree with your assessment.

  32. The first time this happened.... by raspberry-togruta · · Score: 1

    Running a large project on the cloud makes a lot of sense, for a number of reasons. However, every cloud provider is vulnerable to down time... just for starters. Add to that, he had once pulled off line by GCP. Why is this not spread across multiple providers? This is 2018, the headline "Single Point of Failure Can Lead to Downtime" is not news.

  33. In summary by sunking2 · · Score: 1

    This company sucks at running a business.

  34. I have never understood the facination of... by Anonymous Coward · · Score: 1

    cloud services as they are marketed by cloud providers. Suicidal for a business if you ask me.

    Basically you are paying another party to have control of your business, being responsible to keep your business up and running. You are putting your business, the source of your livelihood and livelihood of all the company's employees on systems you have NO CONTROL OVER. Basically putting your business in the hands of other people, who quite frankly do not give a damn about your business other than it pays its bill every month. Have any kind of problem, and you are out of business until the problem is rectified. As a software engineer, I advise client against relying upon the cloud for critical business functions. It makes no sense to put applications, or code bases on devices you have NO CONTROL OVER!!!!

    That being said, I do see cloud services being used as part of a businesses disaster preparedness plan...as a backup system...but nothing more. But giving a cloud provider full control over your business? Plain stupid if you ask me.

  35. If millions are on the line by guruevi · · Score: 1

    You should think about divesting from just-google. The cloud is costing you more already, get yourself a number of real servers with real hosting providers dispersed geographically. Running something solely on Google or Amazon clouds is technically identical to hosting everything on a single server.

    --
    Custom electronics and digital signage for your business: www.evcircuits.com
    1. Re:If millions are on the line by malkir · · Score: 1

      > Running something solely on Google or Amazon clouds is technically identical to hosting everything on a single server. That's not even close to equivalent. Nobody who understands hosting sticks shit on a single server, on any platform.

  36. About time to get lawyers involved by mysidia · · Score: 1

    Bring up the SLA and at LEAST request outage credits....

    1. Re:About time to get lawyers involved by gweihir · · Score: 1

      It pretty much looks like they had a cheap personal account (on which you should never, ever run anything critical) hence no SLA. So now they complain publicly about what is essential their own stupidity.

      --
      Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
  37. Wrong question: Why use Google for ANYTHING? by shubus · · Score: 1

    Throughout all of Google's platforms you can expect this kind of treatment. JUST SAY NO TO GOOGLE.

  38. Re:My guess by vtcodger · · Score: 1

    Fortunately, they will only be a threat 30% of the time (40% if they are offshore). If you're smarter than, for example, Google, you can probably develop a strategy to disable them while they are sleeping.

    --
    You can't see ANYTHING from a car, You've got to get out of the goddamned contraption and walk...Edward Abbey
  39. This stands out to me... by bferrell · · Score: 1

    > Great, let's wake up the CFO who happens to be the card holder. What if the card holder is on leave and is
    > unreachable for three days?

    Unreachable for three days? What if the CFO is dead? Oops!

    In some circles, this is called "the campus bus problem"... The guy who knows everything walks out in front of the campus bus. Now what.

    This also used to be seen on University computer systems... Some grad student has written processes in use all over the place, running from the home directory... The graduates/moves on. The account is deleted; chaos ensues.

    Universities learned and moved on.

    If you're anything but a whiny kid, you own your mistake, learn from it and move on. Don't blame someone else. The fact they're whining in the quasi pay walled "The Medium" says even more (I for one am fairly tired on getting nagged by them).

  40. Welcome to 2018 by kackle · · Score: 1

    Remember when we complained about foreign tech. support? Well welcome to 2018, where there is none, even if you're paying. This "disconnect" seems to be how life goes in this so-called Information Age. Everything is SO optimized for profit today, that getting assistance for anything is getting closer and closer to being non-existent. We were sucked into it when we were given free web browsers, etc., that came with no user's manual. I understood, since the stuff was (beer) free. Fine. But these days, I run into trouble and get no help, even if I'm paying. So the digital wall of fine print that the OP ran into doesn't surprise me at all.

  41. Re: Sorry, but...cheap advice. by Junta · · Score: 2

    s3 storage is *massively* expensive at scale, compared to in house. Even among cloud providers, there are competitors that are 75% less.

    --
    XML is like violence. If it doesn't solve the problem, use more.
  42. Microsoft does the same. by lpq · · Score: 1

    Had an identity I used on MS for support and forums for over 15 years. Tried to log in about a month ago and was told it had been temporarily suspended due to violations of their TOS. I tried to find out what
    happened but they refused to tell me anything. They told me the only way I could get my account back was to have them send a text code to my phone. Only phone I have that accepts text is my google number which MS
    WON'T ACCEPT. They don't have a system like amazon or google where they can call you @ a home number to have a robot announce a code over the phone. They demand your text#. If you don't have one, you don't get your account back.

    Tried following up with their support -- twice -- both times was told I violated something in their TOS, and could I review to see what it might have been (WAY too vage). and to use their text-msg system to recover my
    access (which I'd told both of the service reps didn't work -- and MS wouldn't take my voice number.

    Completely lame.

  43. Re:Sorry, but...cheap advice. by Junta · · Score: 2

    One, note that just because they are a big name, it does not mean all their decisions are bullet proof guaranteed the best. Dropbox has the exact opposite story to tell.

    For another, Netflix has a rather special position. They are *the* go-to reference customer for AWS. Amazon with almost every other breath references just how *awesome* Netflix is doing with AWS. As such, they assuredly have special status, Amazon is not going to just screw with Netflix because the second Netflix so much as whispers a negative AWS experience, their biggest reference customer has gone bad. If there is *any* company on the face of the Earth that can get away with single-sourcing from a cloud vendor, it's Netflix.

    For the 98% of customers who are not highly prized marquee customers.. Well your experience will deviate from stories about Netflix.

    --
    XML is like violence. If it doesn't solve the problem, use more.
  44. Multicloud to the rescue by jurtax · · Score: 1

    As a cloud migration consultant, I see a lot of companies going dual cloud with a sort of DR model in a second cloud provider to avid tat kind of scenario. Yes, it creates quite an overhead but it could be worth it.

  45. NEVER put all your eggs in ONE basket by p51d007 · · Score: 1

    I keep stuff on my laptop, pc, cloud, secure HDD's, safe deposit box. On a business scale, you should be keeping stuff backed up across multiple platforms.

  46. "The Cloud" is just someone else's computer. by jcr · · Score: 2

    Mission-critical functions should be kept in-house. Never farm out anything that can kill your business if your vendor fails to do their job.

    -jcr

    --
    The only title of honor that a tyrant can grant is "Enemy of the State."
    1. Re:"The Cloud" is just someone else's computer. by KC0A · · Score: 1

      Sure, do it yourself. If you can recruit, hire, and compensate the people (>=2) needed to design, install, and manage a server architecture with no single points of failure, and you are able to maintain geographically separated server rooms (>=3) so you aren't vulnerable to natural or man-made disasters (like a fiber cut).

      While you are at it, you probably should build your own electrical generating capacity and emergency water supply.

    2. Re:"The Cloud" is just someone else's computer. by malkir · · Score: 1

      Your vendor provides a service, and cloud providers provide what are largely ephemeral services. Mission critical infrastructure should be build with redundancy in mind... it sounds like not only did they not build things redundantly, but they did a major boo-boo if their *entire account* was shut down. Pay your bills, offsite backups, and terraform or something and this would have never have happened.

    3. Re:"The Cloud" is just someone else's computer. by thegarbz · · Score: 1

      Mission-critical functions should be kept in-house. Never farm out anything that can kill your business if your vendor fails to do their job.

      -jcr

      Don't be so narrow minded. There are many people out there that are far more capable at hosting mission critical functions than even some of the largest companies.

      More accurately: Mission critical functions should be covered by service level agreements based on the criticality of the function. "No phone to call" "Customer service chat is off" aren't words associated with cloud vendors. They are associated with someone renting cheap server space.

    4. Re:"The Cloud" is just someone else's computer. by Whiney+Mac+Fanboy · · Score: 1

      Mission-critical functions should be kept in-house. Never farm out anything that can kill your business if your vendor fails to do their job.

      [emp mine]

      Like legal advice? Networking infrastructure? Roads to your business? Power generation? Clean water delivery? Insurance? Property protection? Defense of borders? Being an absolutist pedantic asshole?

      --
      There are shills on slashdot. Apparently, I'm one of them.
  47. Humans don't understand it either by RogueWarrior65 · · Score: 1

    "We apologize for the inconvenience." bullcrap. I had the same exchange with the morons at Sears appliance service. They kept transferring me to god-awful call centers in Bangalore or some other place where their accents are so thick that talking to someone in the Deep South would be easier. After talking to six different people, I realized that they are all working off the same set of rules: Ask for the same damn information over and over, Try to sympathize, Pretend your computer is frozen, Blow off the customer.

    These bastards had the balls to try to sell me a whole-home warranty. Why the eff would I buy that when I can't get you bastards to come fix my dishwasher? No wonder your company is going down in flames and good effing riddance.

  48. Re: Sorry, but...cheap advice. by Desler · · Score: 1

    And your belief matters how?

  49. Lets be honest here! by oldgraybeard · · Score: 1

    Has anyone ever talked to anyone @ Google? I in all the years since I first heard "Google". I have never been able to chat with a live person on anything ever.

    To be fair, I have chatted with the
    Azure/365 folks (took time, many calls (2 months), but did get "their" Information Protection/Crypto issues worked out, once they stopped pointing the finger at me),
    Amazon (not so much, selling mostly, interface & whole experience sucks), AWS, never pulled the trigger, but did get through (pricing is mind numbing and complex),
    other smaller data centers, GoDaddy (Good/Bad), (their interface just keeps getting worst), others, etc pretty good.

    Just my 2 cents ;)

  50. Is it really true that you can't phone Google? by rainer_d · · Score: 1

    Or is it just a question of how large your account is?

    We're just a mid-sized MSP, but there's always a way to get someone on the phone, 24x7. The on-call number customers call is the security-service that does our physical security, they forward the call to the on-call engineer (after the customer is verified using a "password"). The intermediate step is to ensure people don't call the on-call engineer for sysadmin-tasks that could be done during business hours.

    Every customer can get the on-call number, provided they cough-up the money. For most, it's not worth it because servers are quite stable these days.

    The problem is of course that Google is so big and has so many customers that it's not possible to "know" every single customer anymore.

    --
    Windows 2000 - from the guys who brought us edlin
  51. Hard lessons by nehumanuscrede · · Score: 1

    Remember folks:

    The simple definition of " Cloud " is infrastructure you neither own nor control.

    By offloading this responsibility to a third party ( Google in this case ) you simply add an additional point
    of failure in the chain.

    With any substantial amount of money on the line, the better way to do things is to have your own servers
    ( preferably two locations, one primary and one backup ) so if one site goes down, it's more of an annoyance
    than a Class A Catastrophe.

    Most companies, however, have to get burned before they understand that there is a limit to the number of
    corners you can cut.

    1. Re:Hard lessons by Tough+Love · · Score: 1

      You can forgive smaller startups for going with what looks like the easier route. But when the business is well on its way, that is when to get out of the cloud, in the interest of minimizing risk and expense.This is also the point in the lifecycle where the business is most likely to believe that their judgment is infallible, simply because they are successful so far. Basically, it was this attitude that doomed Yahoo, do you want to be the next Yahoo?

      --
      When all you have is a hammer, every problem starts to look like a thumb.
  52. In a word by Tough+Love · · Score: 1

    In a word: arrogance. We see this way too much with Google, the new evil.

    --
    When all you have is a hammer, every problem starts to look like a thumb.
  53. Don't use Google Cloud's... by viperidaenz · · Score: 2, Insightful

    ... cheapest service they offer, the one that doesn't include 24/7 phone support - let alone a guaranteed SLA, to host your multi-million dollar wind/solar plant, where any service outage will cost you millions in service penalties.

    1. Re:Don't use Google Cloud's... by malkir · · Score: 1

      Seriously, this sounds mismanaged from top to bottom. Saying 'dont use Google Cloud' is like saying dont store files on computers because computers break. These guys just completely failed at devops.

  54. Exactly!! Ding, Ding, Ding! by King_TJ · · Score: 4, Insightful

    I may be one of the "old timers" who I'm told is thinking about things in an "old school" way when I say this. But I've *always* warned people that "The Cloud" just means you're giving somebody else the responsibility of handling your data and the systems it runs on.

    That makes sense sometimes. I'm not "anti cloud". But for anything really critically important to a business, I feel you should have it running locally and THEN consider cloud options as hot-failover sites, backup sites, etc. With cloud hosting, the whole thing is off limits to you as soon as your Internet circuit goes down, for one thing. With it running locally, you can still use it just fine anywhere on your LAN.

    But additionally, if the provider hosting your stuff goes bankrupt or merges with someone else, or just plain decides it's not profitable enough without some pricing changes -- where does that leave you? Technically, they can just disappear with your whole software and data configuration overnight. Or they can put trained apes in charge of maintaining things so it suddenly has huge security holes. Who knows?

    When you run things yourself, YOU are where the buck stops if things go wrong. If you're good at what you do, that should be more of a comforting thing than a scary thing. I've seen too many shops trying to cut corners on the I.T. hiring budget by bringing in less experienced people who really can't properly run the systems they're supposed to be caring for. The cloud for them is a crutch ... a way to get things done that are beyond their abilities. But that's not an ideal situation for a business to put itself in.
     

    1. Re: Exactly!! Ding, Ding, Ding! by reanjr · · Score: 2

      Here's the thing about hosting on AWS. When your shit goes down, everyone's shit goes down. Github is down, Slack is down, Pinterest, Reddit, etc.

      In the end, none of our customers blame us because we can simply point to all the other well known sites that are down at the same time.

      It is what it is.

  55. Re:Exactly!! Ding, Ding, Ding! by CaptainDork · · Score: 1

    I agree.

    Get this: At the initial meeting, I asked if there were any speed issues and the vendor said, "No, you'll operate much faster than you do now."

    The fucking latency was shit.

    And get this: The firm logged in using RDP.

    It was actually just one big duplication of our production servers (I had a dual system) loaded up to the "cloud."

    --
    It little behooves the best of us to comment on the rest of us.
  56. Millions? by p4nther2004 · · Score: 1

    child's-play. Try Billions.

    Last project involved cloud-computers where we processed a BILLION USD every year (credit card payments)

    And you're ALWAYS trusting a 3rd party. You own IT guys or cloud. Doesn't matter.

    What DOES matter is backups and fail-overs. (These guys NEEDED a proper failover)

  57. Site relibility engineer's opinion by buss_error · · Score: 1

    For the past 15+ years, I have worked with systems that must not go down. No - really, 1 second of down time in a year is intolerable for some applications. (Not always every system I work with, but at least some fall into this category.)

    If the original poster of this story failed to keep in mind when designing the infrastructure that a computer that won't ever go down isn't available, then he/she failed the first principal of fault flexible systems. That principal is to come to terms with the level of acceptable fault vs. the available budget. 9's cost money. How many do you want to buy? A uptime SLA for 99.99% is still three and a half days down over 1 year. 99.999% SLA will cost more. A lot more.

    From the OP " millions of dollars in lost revenue." - this is perhaps the first time I've heard that statement that I actually believe it. That still doesn't mean "you have chosen wisely".

    So, let us do a thought experiment in the absence of any data. I'm going to get things wrong here for this particular situation because I don't know details and will be making assumptions.

    Data Storage: Sounds like "We don't need backups because we have RAID" issue. Sorry, study availability zones in S3 on Amazon. You can stripe your RAIDs across a local zone, stripe it to other data centers geographically separated. On Linode, you can rsync between data centers. On Rackspace, you can use object storage in multiple data centers (Linode and Rackspace this will require software to accomplish as they do not offer a API to fire off copies unlike S3 does.) This ensures that the totality of data is not stored in one basket. That still doesn't back it up - and you need to do that too. Data may need to be restored due to human as well as machine error. One situation I remember was engineering a database system with 4, 8, 16, 24, 36, and 48 hour delayed slaves for super fast point in time recovery - with hot transaction backups that could be selectively applied. Just to add to the server count, it was also Master|Slave^6/Master|Slave^6 - 14 DB servers and 7 ingress data clusters of 5 each. (I'd say if you are in the US, there is a 100% chance you've done something that was processed by this system in the last year if you drive a car.)

    Data Acquisition: This should use something like Atom Hopper/Rabbit MQ/Service Mix (oh god, please not Service Mix) in a clustered environment across data centers. Use DNS round Robbin for a cheap way out to find your ingress servers but requires some consideration of DNS timeouts. Datapoints are posted as messages, and servers claim the work entry from the Atom Hopper queue. This prevents the situation where clients cannot report their data, and also protects the processing servers from having a regional failure or failing to process a data message.

    Also, systems should be configured so that Ansible, Salt, Git, or Puppet can automatically build systems in the cloud when stressed or even a total data center failure. Strict deployment and versioning should be enforced so that no server can't be replaced at the drop of the hat somewhere else due to "one off" changes.

    Treat servers like cattle, not pets.

    Last - KNOW YOUR VENDOR. This is absolutely critical. Here is something I suggest you try for your self - call the support line and time how long it takes to get a human on the phone. Remember that some vendors cost more than others, and there's a good reason for that. Getting, keeping, and paying really smart people to help you out with zero notice costs money. A lot of money. And a vendor that will be willing to let you skip tier I - III support (after you've proven you're not an complete idiot) is even more rare. I know of only one company that will do that, and it's the same one that will win the call and get a human test. But they cost twice to three times what others do - because they give you what the others will not.

    What I hear in the OP's post is that they suffered a catastrophic failure and rate limits and fraud protection kicked in. I do understa

    --
    Necessity is the plea for every infringement of human freedom. It is the argument of tyrants; it is the creed of slaves.
  58. Sorry, but this is your fault. by malkir · · Score: 2

    Sorry, but this is your fault. We put multi million and billion dollar clients on AWS and GCP and have never, ever had an issue like this. 1) You're hosting in the cloud and not actually understanding what a cloud provider is or does. 2) You don't have a plan that is reasonable for a multi-million dollar organization, one that includes some level of support or SLA. 3) You're not building with DR in mind. Data should be backed up somewhere safe. Your infrastructure should be 'infrastructure as code' which can be spawned up essentially at a moments notice. 4) Your admins dont have an escalation path that doesnt involve waking up the CFO. 5) Someone did something wrong, either you didnt pay the bills, or your usage was completely fucked so badly they shut your account down? I bet money that it was you didn't pay your bills. Seriously... no offense, but you guys need to look at your business and spend a bit more time/money/effort on building this stuff up in a way that doesn't just fall over...

  59. You must suck at AWS by mveloso · · Score: 1

    Sorry, you must be really bad at AWS.

    Amazon's hardware isn't HA, your solution is.

    I've run small to huge workloads on amazon, and saying that it's designed for companies that have hundreds/thousands of servers is totally wrong. In fact, it's not really that well designed for tons of servers because they don't really have a lot of built-in automation to handle hundreds, if not thousands, of servers.

    Just because you use a cloud provider doesn't mean that it's turnkey. You still have to know everything. The difference is it costs less and generally it's easier.

    1. Re:You must suck at AWS by mrfaithful · · Score: 1

      Amazon's hardware isn't HA, your solution is.

      So you're saying it's like RAID, you take a stack of unreliable hard drives and through the power of redundancy you make a reliable data set that's more than the sum of its parts. Except in this case Amazon is just providing you the hard drives and you have to do the RAID on your own? It seems every time I hear about cloud computing the problem domain it actually solves seems to get smaller and smaller.

    2. Re:You must suck at AWS by Wycliffe · · Score: 1

      Sorry, you must be really bad at AWS.

      Amazon's hardware isn't HA, your solution is.

      Just because you use a cloud provider doesn't mean that it's turnkey. You still have to know everything. The difference is it costs less and generally it's easier.

      Which is my point exactly. For a small shop without a dedicated IT team that can implement a HA setup, Amazon is a bad choice. Amazon is designed to sit under a HA solution. There are lots of providers that provide more stable single servers and have a phone number you can call for free if something is not working. They aren't HA solutions but most small businesses aren't willing to pay for the expertise needed to have a true HA setup.

  60. Cloud is awesome ... with a failover in place. by Qbertino · · Score: 1

    Cloud is awesome. Scale indefinitely, work out the kinks, test your product. All on a shoestring/pay-as-you-go budget. Very nice.

    Yet Only cloud is shite in production. Provider X (in this case Google) has you by the balls. You do *not* want that. Tried and true fallback and failover with your own Docker setup and a stack of rented blades in a rack including nightly to-my-desk backups are an absolut must for any critical infrastructure. No matter how cool Kubernetes and Spanner are handling your stuff today, you at least want to be able to save your data tomorrow when things turn south.

    BTW, all this is Captain Obvious speaking.
    This isn't news, this is basic web stuff 1-oh'-1 that every third-grade webshop running critical WordPresses knows. It's really the exact opposite of rocket science.

    The kid who set up this disaster deserves a smacking.
    Lesson learned I guess.

    --
    We suffer more in our imagination than in reality. - Seneca
  61. If you look at the first comment in TFA by Chrisq · · Score: 5, Informative
    If you look at the first comment in TFA it turns out that the poster was using a consumer account for a million-dollar mission critical application:

    I highly recommend establishing an enterprise relationship with Google Cloud. It seems you are running a mission critical application on a consumer account and this issue could have been avoided. Reach out to the support team and let them know you want to discuss enterprise options to ensure you have done everything possible to ensure your account is never impacted like this in the future. Ping me if you have any trouble getting through. - Mike Kahn Customer Engineer, Google Cloud. All views and opinions are my own. @mkahn5

    This sounds like asking fro trouble to me!

  62. Actually you have SLAs by overnight_failure · · Score: 1

    SLAs are what ensure companies can't do this to you. If you don't have an SLA with the cloud provider then you should probably run across multiple clouds and/or in-house infra.

  63. You shouldn't just use one service by johnsie · · Score: 1

    I worked at a Hydroelectric project and used GCM to alert people when the values of PLCs hit certain thresholds (low voltage, high temps etc). There is no way I would just use Google for this though. These alarms are important, you should never just depend on one service. We also had a directly connect pro-face (with alarms) screen in the control room as well as various live graphs, android apps and widgets and email alerts. So if Google Cloud Messaging went down there were still other ways to detect a problem.

  64. Nope.... by MerlTurkin · · Score: 1

    I never use any online cloud. Stupid idea IMHO.

  65. Contracts by sjbe · · Score: 2

    As other people have pointed out, the magic letters here are "SLA". You must have a contract stating what the vendor's responsibilities are and be able to enforce that contract.

    A contract is only as valuable as your ability to ensure it is enforceable. When you are dealing with a company the size of Google they can hire some flesh eating lawyers and have the bank account to keep you busy until you die and so if you plan to bring a lawsuit you'd better be prepared for shock and awe. Just having a contract isn't enough by itself.

    You are right that a service level agreement is a very good idea but it isn't going to matter if it is cheaper for them to screw you anyway.

    Otherwise, you don't have a business, you just have a hobby.

    That's a nice sound bite but it's complete BS. When you are a small business or a startup you generally simply don't have the resources to fight a company the size of Google. You can have whatever agreements you want but if they decide to screw you there isn't much you can do about it. I've started several companies where we had to depend more than is ideal on a single large vendor and it's freaking terrifying if/when you don't have alternatives - contract or no.

    1. Re:Contracts by drinkypoo · · Score: 1

      Otherwise, you don't have a business, you just have a hobby.

      That's a nice sound bite but it's complete BS. When you are a small business or a startup you generally simply don't have the resources to fight a company the size of Google.

      I don't think that makes it BS. It just makes it spectacularly hard to have a credible business in the current market landscape, where there are a few big players wielding disproportionate power.

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  66. Comment removed by account_deleted · · Score: 1

    Comment removed based on user account deletion

  67. Google is a machine, no human beings are involved by kbdd · · Score: 1

    As much as I appreciate Google's services as an individual, I do not use them for any business-critical need for the exact reasons listed in your article. The service may be generally reliable, but what is not acceptable is the casual way they handle exceptions. It clearly defines the pecking order. They are the masters, you are the peon, the paying peon but still a peon.

  68. Dont wake the CFO by trybywrench · · Score: 1

    If millions are on the line and you don't wake the CFO, he/she/it is going to be very angry. Chances are, if millions are on the line, the CFO will be waking you up even if the problem is technical.

    --
    I came to the datacenter drunk with a fake ID, don't you want to be just like me?
  69. not a failure by sdinfoserv · · Score: 2

    This is not about a server failure, router failure, logic bomb, or hacking incident - this is a hosting company deciding to flip a switch and simply turn off your entire infrastructure because an automated process determined "something" in your entire ecosystem was abnormal......
    Frack that. never. not 1 dollar.

  70. Re:Exactly!! Ding, Ding, Ding! by shaitand · · Score: 1

    Absolutely agree. The benefits of "the cloud" which is one of a string of terms which initially had zero meaning and people kept trying to figure it out until they'd invented something... can be had in a private cloud infrastructure, anything else is just leveraged hosting which should definitely be a no-no for enterprise.

  71. If it was so important... by jbmartin6 · · Score: 1

    If the site was so important, why didn't they plan for an outage? Sounds like they just assumed it would always be up. Why didn't they ask Google about the conditions and possible sources of outages, and plan accordingly? My employer has done multiple cloud deployments, and this is always part of the planning. What do we do if cloud provider goes down or experiences and error? What are their operational norms around infrastructure changes and the like? What is our downtime tolerance? Is the cost worth the risks (i.e. not that important) or do we invest in multiple sites? Perhaps not the whole story is here, but it sure sounds like they didn't do much due diligence before throwing all in on "cloud"

    --
    This posting is provided 'AS IS' without warranty of any kind, implied or otherwise.
  72. Run locally and use could for disaster/peak load by Proudrooster · · Score: 1

    Here is my bit of wisdom. Get everything running locally in a cloud container framework like Heroku or whichever on you prefer. When and if you get to the point where you need disaster recovery or more scalability you can shift everything to the cloud until you recovery or at peak periods load balance to the cloud.

    The cloud is not magic. It just means that other systems administrators are running your servers and they do screw up quite often.

    If you are really serious about running in the cloud and having high reliability, let me introduce chaos monkey
    https://github.com/Netflix/cha...

    This is what Netflix uses to make sure that they can keep running with Amazon availability zones, instances, or whatever they call them disappear. I believe that the day that Amazon accidentally took all of it's storage offline and killed half their cloud, that Netflix survived as was able to keep going.

    Read this for amusement: https://aws.amazon.com/message...

    Ooops, we deleted AWS....

  73. Re:Sorry, but...cheap advice. by shaitand · · Score: 1

    Netflix gets less and less stable daily... were you under the impression that large enterprises are making good decisions lately? As someone who works on enterprise infrastructure I can give you a hint, they aren't.

  74. Re:Exactly!! Ding, Ding, Ding! by drinkypoo · · Score: 1

    The benefits of "the cloud" which is one of a string of terms which initially had zero meaning and people kept trying to figure it out until they'd invented something

    False. "Cloud computing" originally meant that you were leasing instances on someone else's servers, and you could spin up more instances any time you were willing to pay for them. That's still a useful meaning, unfortunately people now have "private clouds" which we just used to call "clusters", so the phrase actually means less than it did originally.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  75. You put such mission critical stuff ... by cascadingstylesheet · · Score: 1

    You put such mission critical stuff on "Google Cloud"?

    Why??

  76. Re: Exactly!! Ding, Ding, Ding! by CaptainDork · · Score: 1

    Part of the migration should be not.

    --
    It little behooves the best of us to comment on the rest of us.
  77. Good - Fast - Cheap: Pick two. by rocket+rancher · · Score: 1

    There is a three-part mantra that project managers learn at the enterprise level:

    1. If you want it good and fast, it will not be cheap.
    2. If you want it good and cheap, it will not be fast.
    3. if you want it fast and cheap, it will not be good.

    This project fell to the third part.

  78. Re: Exactly!! Ding, Ding, Ding! by HornWumpus · · Score: 1

    You work in an industry where an excuse is good enough. Which is nice for you.

    --
    John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
  79. Your on-prem req is a rounding error by tepples · · Score: 1

    If a vendor is selling to market that needs an on-prem requirement, it's a non-starter to not offer such a feature.

    Many vendors, after doing their market research, have concluded that on-premises requirements such as yours are a rounding error. The benefit of satisfying them does not exceed the opportunity cost of monopoly rents that can be extracted by not satisfying them.

  80. Cloud....its all hot air. by da_Den_man · · Score: 1

    Sooner or later folks are going to have to realize that "Cloud" is just market-speak for another box that you don't have access to (until you pay the $9.95 access fee...). I have run my "own cloud" since 1999. Sits right across from me, and anytime it goes down I have direct and full immediate access. And this is on stuff only I consider important (apparently..)

    --
    You keep going until you die..."Me".
  81. In other words.. by LaughingRadish · · Score: 1

    In other words, one must also consider malice from the hosting provider when shopping for a hosting provider.

  82. Rained on by ebvwfbw · · Score: 1

    The cloud rained on him. If it's a bad enough storm, you lose everything.

  83. Re:Sorry, but...cheap advice. by omnichad · · Score: 1

    They are *the* go-to reference customer for AWS.

    I'd reserve that spot for Amazon themselves, but Netflix is a good third-party reference. Kind of scary to let your direct competitor be the host of your content, though.

  84. Basic risk mitigation 101 by Assmasher · · Score: 1

    If you're not going to be hybrid (which gives you other opportunities) then you simply DNS load balance between regions AND cloud providers. Easiest way? Containerize.

    The complexity comes into play around your databases, but there are a myriad of well known solutions to all of these problems.

    There's no other rational way to provide solid service that to spread your risk.

    Setup DNS Failover (or load balance if you prefer)
    Setup in multiple AWS regions.
    Setup in multiple GCE regions.
    Optionally setup for Azure
    If you're really paranoid, have an on premise instance somewhere local to you (or a metapod in house or something similar.)

    Containerization makes all of this vastly simpler than in the past.

    As many others have mentioned - don't trust anyone to be up all the time, trust that at least one of them will be up all the time.

    --
    Loading...
  85. Cloud services have value by DRJR · · Score: 1

    There seems to be a lot of comments saying why would you ever use someone else's infrastructure. The last few companies I worked for have needed to and they were big companies. You pay services to be reliable. They'd lose business if they weren't.

    Building infrastructure is not cheap. There's a reason only a few companies build things like electrical grids and telecommunications networks. Businesses do business with other businesses. Unless your server needs are small and simple or you have a lot of money to burn, there is value of large-scale distributed server centers. Even if your server needs are small, you should have a guaranteed better uptime than doing it locally-- they have redundant machines with redundant storage and redundant power supplies with generator backups. Also, having your server and data offsite and backing them up means you don't lose everything if your brick and mortar business catches on fire.

    Check your provider out if you have concerns and make sure you have a disaster plan. Also, redundant cloud services is always an option.

  86. Re: Exactly!! Ding, Ding, Ding! by reanjr · · Score: 1

    SMBs run their businesses on SAAS services. So, essentially anyone serving SMBs is in the same boat.