Slashdot Mirror


Akamai DNS Outage Messes up Net

katre writes "Checking all my favorite sites this morning, I saw that about half a dozen seem to be offline. Trying to figure out why, I found an interesting article on the front page at http://isc.incidents.org/. Seems that the problems at Akamai are screwing over Yahoo, Google, Microsoft, Fedex, Xerox, Apple, and others. Whatever happened to my decentralized net with no single point of failure?"

33 of 522 comments (clear)

  1. I'm definitely not a technical guru... by Dagny+Taggert · · Score: 5, Interesting

    but I believe the centralized concept of the 'net is something that is coming to an end, much to our loss. I'm pretty bothered by the fragility of this system. How many of you can't work without web access?

    --
    Don't be a looter...and yes, I know that it's spelled with an "A" instead of an "E".
    1. Re:I'm definitely not a technical guru... by Elecore · · Score: 3, Interesting

      I do. In fact, I'm trying to move as much of my work onto the web as possible. First it was email, then my calendar and contacts (Horde). Now, I'm starting to run finances online so I can access it from anywhere. A few years ago there was a lot of hype about moving all software to webbased software and I think this is still the eventual outcome of the internet. It makes sense to everybody. No costs in making CDs or packaging, ability to work on anything from anywhere, and easy ability to upgrade software. For example, say the latest version of word was access from apps://MSWord. When a bug is found, MS patches the program and everybody is now using the fixed version. Admitantly, this gives more control to the manufacturer. If they want to add/remove a feature, you have no choice in upgrading. Hmmm... ok, so I'm inconclusive. Personally, I like the idea of a full PC of software and data available to me regardless of where I am. So yes, I rely on the internet to work and if we move towards this way, problems like you are describing will eventually be solved. Supply and demand. Thoughts?

    2. Re:I'm definitely not a technical guru... by Malc · · Score: 4, Interesting

      I'm not being condescending, but unless you're very junior or just a small pawn in a very large company, even server mainenance positions require some offline work. There's always some planning that needs to be done for tasks during today, the coming week, or even long term. Phone calls can be made, documentation updated, etc. It really depends on how long the outage is for and how quickly you're able or willing to switch gears and tasks.

      I realise that some jobs are much more inpractical when there is downtime, but not everybody even here on /. requires 100% connectivity. I myself live over 4,000 km from work (I haven't even met them face-to-face for more than four years) and I would be pretty pissed off and delayed on one of those days I have to do all my work via Terminal Services on machines at the main office or colocation facility... but let's be honest, a day's outage wouldn't mean I couldn't do any productive work. I once had a 10 day outage when the local telco switched my DSL line from interleaved channelisation to fast-path, but as I was in the middle of a long stretch of software development, I really didn't need a lot of internet. Batches of dialup and patience all-around sufficed ;)

    3. Re:I'm definitely not a technical guru... by jdray · · Score: 4, Interesting

      I work for an electric utility and, with the current state of web-based scheduling of electricity in the U.S. (a mandatory requirement by regulatory agencies), loss of proper traffic routing on the Internet can have difficult-to-overcome effects.

      --
      The Spoon
      Updated 6/28/2011
    4. Re:I'm definitely not a technical guru... by Buran · · Score: 2, Interesting

      A lot of what I do can be done offline, although I use the Web a lot to download scientific journal articles (I work in a med school research lab). However, while you may say "go down to the library and photocopy the articles you need", that only works in some cases. Photocopies are far more expensive for color pages (so we photocopy in greyscale only) and the print versions of many journals are no longer subscribed to because the storage space savings are substantial, there's no books to have to inventory and track (and rebind if they get old or repair them if they are damaged, and people do do that -- someone stole the original Watson & Crick DNA article out of our '53 Nature archival copy, and so the replacement was a crappy Xerox version that has horrible quality and is worn out... grr. Discovered that when I was asked to get a copy of the article.)

      I find dead-tree editions easier to work with, anyway.

    5. Re:I'm definitely not a technical guru... by Ancipitis · · Score: 2, Interesting

      This isn't quite how it works. The point of Akamai's service is to resolve to different servers based on your (network) proximity to them. If you use some web-based DNS resolver, you are going to get a server closest to that resolver -- not closest to you. And who knows when that particular Akamai server is going to be taken down? Happens all the time -- and their DNS servers dynamically map you somewhere else.

      For most sites using Akamai, there is no "real" address -- you ALWAYS are going through an Akamai server. It is simply a matter of which one. The Akamai server is the one that connects to the company's server (if it needs to).

  2. points of failure by rlthomps-1 · · Score: 4, Interesting

    DNS dying on you? Just throw it on the pile of other connection problems

    I think everyone has several "single" points of failure -- my cable modem dies at least twice a month and my wireless router conks out at least twice a day ;)

    1. Re:points of failure by rlthomps-1 · · Score: 2, Interesting

      well, if my cable modem doesn't work, I really don't give a damn about akamai, do i? So to me, its just as important.

    2. Re:points of failure by Speare · · Score: 2, Interesting

      Try keeping your cablemodem a bit cooler than your average home. I keep my cablemodem cooled by a 120mm fan 24/7 and it never goes down. The day my fan's bearings failed, the cablemodem dropped signal again.

      --
      [ .sig file not found ]
  3. releted to linux kernel DoS exploit? by Anonymous Coward · · Score: 4, Interesting

    Do we know if this at all related to the Linux kernel 2.4.2x/2.6 DoS exploit discovered yesterday?

  4. Preformance vs reliability by Kardnal · · Score: 3, Interesting

    When Akamai's system was first announced, most people thought this was a great idea. It made sure that the sites that used this technology would always have the bandwidth they needed, when they needed it. Like with everything else in life, there's always a trade-off between preformance and reliability...

    --
    ------------------
    "Never Attribute to malice what is adequately explained by stupidity..."
  5. Lack of notification by sphealey · · Score: 5, Interesting

    What ticks me off about this incidents (and I suspect that there have been several in the last 6 months) is that there is absolutely no notification given, either during or after the event. During this outage, some news outlets were still reachable (including Slashdot), and a simple notification would have saved hours (* 10s of thousands of network dudes worldwide) of time and much grief from the big bosses who couldn't reach Yahoo Finance, I mean critical business web sites.

    Are these guys so convinced of their omnipotence and indispensibility that they don't feel the need to communcate with the world about what is going on?

    sPh

    1. Re:Lack of notification by pandrijeczko · · Score: 2, Interesting
      a simple notification would have saved hours (* 10s of thousands of network dudes worldwide) of time and much grief from the big bosses.

      Erm, ever heard of traceroute, maybe followed by a quick packet sniff?

      Can't see why it would take more than a few minutes to prove this sort of problem as being outside of your responsibility and network.

      --
      Gentoo Linux - another day, another USE flag.
  6. Re:Terrorist attacks, anyone? by ZHaDoom · · Score: 2, Interesting

    Akamai is a distrubuted server platform they are all over a hard target, but they are prone to software updates and virus. =)

    --
    War isn't about who's right. It's about who's left.
  7. Akamai's DNS black magic by frankie · · Score: 4, Interesting
    Akamai uses (some would say ABuses) DNS in ways the rest of us (even global megacorps) wouldn't dare. Half of Akamai's magic is their 10000+ carefully-scattered servers, but the other half is their routing. Those servers are listed differently depending on where you ask from.

    It's not like a092156fg.akamai.net is in Seattle and k1039665.akamai.net is in Saskatoon. Instead, all of *.akamai.net goes to whatever cluster is "closest" to the requesting IP (based on BGP, Colonel's Secret Recipe, etc)

    So if Akamai's DNS gets screwed up, I would expect major weirdness. And as more sites join EdgeSuite (where you host your entire domain on Akamai's servers & DNS) the effect must magnify.

    Of course, I could be completely wrong. I'm not a routing god, just a guy who thinks Akamai is a cool hack.

  8. I wonder by rabtech · · Score: 2, Interesting

    I wonder why these companies wholly switched their nameservers over? Why not have #1 and #2 be Akami, and #3 & #4 be your own nameservers? Preferably on different coasts or in different countries.

    This would seem an obvious solution. You are allowed to have many nameservers you know...

    --
    Natural != (nontoxic || beneficial)
  9. Re:Root servers not decentralized? by Syberghost · · Score: 4, Interesting

    The fact is that there are 13 of them, in widely scattered locations across the globe, and it's not decentralized?

    Damn man, what exactly would you consider "decentralized" then?


    Akamai has 13, in widely scattered locations, as well. That in itself doesn't make them sufficiently decentralized.

    The reason the root servers don't have this problem is that they don't all run the same software (anymore) and aren't all administrated by the same people.

    I'm making an assumption here, of course, but I will not be a bit surprised if it turns out that Akamai loaded something that hit all their routers at once.

  10. Dogpile by RAMMS+EIN · · Score: 2, Interesting

    So I wasn't the only one who couldn't get to Google the Great. Fortunately, Dogpile still worked. I used that meta search engine until Google started getting big and beating all the others in turning up relevant search results.

    I wonder if Google will now turn to fully manage all their assets themselves...

    --
    Please correct me if I got my facts wrong.
  11. Re:decentralized net? by Nasarius · · Score: 2, Interesting
    IIRC, Helldesk people bitch about this - people calling up and saying 'I can't get to www.mytimewastingbullshitpage.com, is the net down?' Not realizing that just becuase one or two or thirty sites are down, the net is still up....

    Now now. I'm sure most of these people don't actually mean "is the Internet down"; they really mean "is something wrong on your end?", they just lack the technical experience and vocabulary to really understand things.

    When a number of sites stop working, it can be for several reasons. The last time it happened on my ISP, part of their backbone was down.

    --
    LOAD "SIG",8,1
  12. Reminds me of a story by Venner · · Score: 5, Interesting

    Not too long after 9/11, I was surfing the net and needed to look up something at the Library of Congress for one of my classes. It wouldn't connect. At first I thought we'd just lost DNS (not so uncommon an occurance at my university in those days), but found I could still connect to slashdot.org and some other sites.

    Being a geek, I thought up a list of about 30 sites to ping, scattered across the US. (.govs and .edus mostly.) The ones that replied, I plotted on a US map based on their DNS LOC. (A project I wrote for a previous class.)

    I freaked out a bit when the mid-atlantic seaboard came up missing. I crossed my fingers hoping that it was just some idiot who'd accidently cut one of the main fibers (which it what it ended up being) and not that Washington DC was now a big hole in the ground.

    --
    A preposition is a terrible thing to end a sentence with.
  13. Interesting... by bonch · · Score: 2, Interesting

    I remember when people were bashing Microsoft for using Akamai caching to avoid Windows Update getting hit by the first RPC worm (the one that was patched two months beforehand), since Akamai used Linux and it was somehow amusing that Microsoft chose that caching service.

    If Akamai was running on Windows servers, I guarantee it would have been mentioned in both the headline and in the article summary today. But instead it's just mysterious "DNS issues." It's kind of like how when that Windows source code was stolen, Slashdot reported on it yet neglected to mention that the code was stolen from a hacked Linux computer at a company called Mainsoft.

    Just little slants in reporting I can't help but notice.

    1. Re:Interesting... by SirTalon42 · · Score: 1, Interesting

      It wasn't on a Linux computer. At ONE POINT in time it was on a Linux computer. The file list is proof of that.

  14. Wild Whois results by dankstick · · Score: 2, Interesting

    Take a look at what internic.net gave me on some of these domains....

    Microsoft.com
    ----
    MICROSOFT.COM.SUX.BUT.PYROF REAK.ORG.RULEZ.AND.DIOX YTECH.NET.DELETED.GANDI.NET
    MICROSOFT.COM.SMELLS. SIMPLECODES.COM
    MICROSOFT.COM.SHOULD.GIVE.UP.BECA USE.LINUXISGOD.CO M
    MICROSOFT.COM.RAWKZ.MUH.WERLD.MENTALFLOSS.CA
    M ICROSOFT.COM.OHMYGODITBURNS.COM
    MICROSOFT.COM.LOV ES.JU1C3.COM
    MICROSOFT.COM.LIVES.AT.SHAUNEWING.CO M
    MICROSOFT.COM.IS.NOT.AS.COOL.AS.SIMPLECODES.COM
    MICROSOFT.COM.IS.IN.BED.WITH.CURTYV.COM
    MICROSO FT.COM.IS.GOD.BECOUSE.UNIXSUCKS.COM
    MICROSOFT.COM .IS.A.STEAMING.HEAP.OF.FUCKING-BULLSH IT.NET
    MICROSOFT.COM.HAS.TEH.GAY.OMFGLOL.COM
    MIC ROSOFT.COM.HAS.ITS.OWN.CRACKLAB.COM
    MICROSOFT.COM .HAS.A.PRESENT.COMING.FROM.HUGHESMISS ILES.COM
    MICROSOFT.COM.FLINGS.POO.AT.MONKEYCORE.C OM
    MICROSOFT.COM.FILLS.ME.WITH.BELLIGERENCE.NET
    MICROSOFT.COM.CAN.GO.FUCK.ITSELF.AT.SECZY.COM
    MIC ROSOFT.COM.ARE.GODDAMN.PIGFUCKERS.NET
    MICROSOFT.C OM.AND.MINDSUCK.BOTH.SUCK.HUGE.ONES.AT. EXEGETE.NET
    MICROSOFT.COM

    Yahoo.com
    ---
    YAHOO.COM.WANADOODOO.COM
    YAHOO. COM.TWIXTEARS.COM
    YAHOO.COM.TW
    YAHOO.COM.SUPERCB CENTER.COM
    YAHOO.COM.SG
    YAHOO.COM.PURRFURRED.COM
    YAHOO.COM.OPTIONSCORNER.COM
    YAHOO.COM.IS.N0T.AS .1337.AS.SEARCH.GULLI.COM
    YAHOO.COM.DALLARIVA.COM
    YAHOO.COM.BR
    YAHOO.COM.BERKELEYNATURALBEAUTIES. COM
    YAHOO.COM.AU
    YAHOO.COM

    Altavista.com
    ---
    ALTAVISTA.COM.IS.N0T.AS.1337 .AS.SEARCH.GULLI.COM
    ALTAVISTA.COM

    Apple.com
    ---
    GOOGLE.COM.SUCKS.FIND.CRACKZ.WIT H.SEARCH.GULLI.COM
    GOOGLE.COM.HAS.LESS.FREE.PORN.IN.ITS.SEARCH.ENGI NE .THAN.SECZY.COM
    GOOGLE.COM

  15. Re:Root servers not decentralized? by Omnifarious · · Score: 4, Interesting

    The root nameservers are not under decentralized political control, which still makes them a single point of failure, albeit a different kind of failure.

  16. Hmm . . . by npsimons · · Score: 2, Interesting

    "Checking all my favorite sites this morning, I saw that about half a dozen seem to be offline. Trying to figure out why, I found an interesting article on the front page at http://isc.incidents.org/. Seems that the problems at Akamai are screwing over Yahoo, Google, Microsoft, Fedex, Xerox, Apple, and others. Whatever happened to my decentralized net with no single point of failure?"

    Hmmm, corporate whore much? Slashdot, Debian and my own two sites seem to be working just fine. Maybe the sites you choose to visit just don't get the 'net and it's decentralized nature.
  17. Living w/out instant net access, therapeutic by greendot · · Score: 2, Interesting

    For 10 years I was a net junkie. If I didn't get my email, news, laugh, or enough time on my fav mmorpg then I was twitchy and grouchy.

    Then, two years ago my wife and I decided to take a year off and go tour SE Asia, mainly Viet Nam.

    Yes, they have Internet there but it is mainly in Internet cafes, which are hot, crowded, and quite slow. There are dialups but once you've lived on broadband for such a long time the dialup becomes something you use only when you have to. And so that was what happened. Internet became something that was used when needed. I still checked my email regularly but instead of every hour it was every 2 or 3 days, same with Slashdot. :)

    I had a few personal (programming) projects I was working on which fit nicely onto the laptop, along with a good 20gig of mp3s. I was amazed at how fast I detached from the net. My productivity shot thru the roof, namely because my concentration was focused.

    Even here in the states I have yet to reach that state of Zen again primarily because, even though I try, I know the net is right there. The little net thoughts nag at you.

    But, back to the topic. You would be amazed at how much technical work you can accomplish without the net being there.

    Would I give up what I have now and go back? You bet. Would I miss it? Nope. Broadband is used for P2P or games. That's all I use broadband for anyway. :) But while I'm here in the states, I *need* to be connected. I think because everybody else is.

    On a global scope, 99% of all the really cool groundbreaking stuff in the last 100 years, computer or not, was done detached from the net.

  18. it's like your rights, you can sign them away by swschrad · · Score: 2, Interesting

    and folks often do... witness the onerous "personal contracts" you have to sign to get into the music business, where you are essentiall a creative wage slave and don't own your stuff. non-compete and discoveries-belong clauses in your work contract also sign your rights away to The Man. similarly, if you register your DNS information independently and run your own servers, your ISP and its uplines do the same, and so on including all the sites you visit, you theoretically should not be captive to any of the commercial DNS services.

    as I understand it, akamai is a distributed content hosting/caching service that also does DNS server services. they put a blade in your local ISP under contract, and popular pages from their customers serve off the local akamai server cache. they handle the DNS for those sites as I understand. if their blade caches get fed evil data, you get evil data, and www.fartblossom.org may disappear.

    you can kill DNS by screwing up your own router, too. lots of ways to kill a distributed service that requires everybody to cooperate on a common set of standards and parameters.

    --
    if this is supposed to be a new economy, how come they still want my old fashioned money?
  19. Created SPoF by Todd+Knarr · · Score: 4, Interesting

    The problem is that those sites created their own single point of failure by all using Akamai for DNS. When Akamai DNS fails, sites that depend on it for their own DNS fail.

    It used to be nearly impossible for this to happen. The original rules for DNS were that you had to have at least 2 nameservers for your domain, preferrably 3 or more, and they couldn't be on the same physical networks. With that rule having a single network go down rarely made any domain unresolvable (backbone networks whose outages could render dozens or hundreds of other networks unreachable being the exception). Maybe we should put the old nameserver-diversity rules back into place.

  20. Re:Uh by bonch · · Score: 2, Interesting

    I guess seeing things like "PWD=/usr/ms/win2k_sp1/private/security/msv_sspi" isn't enough to convince this troll. So do a Google search, like I said. The code was taken from one of Mainsoft's hacked Linux machines. This was already reported in the past on other sites.

  21. Re:"DNS was not quite designed in such a way" by chef_raekwon · · Score: 2, Interesting

    im sure the admin made a mistake -- huge corporations do not put everything on a thin wire, and hope they stay up. there are multiple connections and multiple servers. if the stuff goes down, its usually because of a mis-deployment of some new code, or, mis-deployment of some zones

    whatever...i couldnt read the article either -- it wouldnt resolve. oh, the irony.

    --
    We're like rats, in some experiment! -- George Costanza
  22. No, not really... by sterno · · Score: 2, Interesting

    I've been on-line a lot today and didn't even know those sites were down. Didn't effect me in the least. The internet, by it's nature, will always be plagued by the occasional downtime of various services here and there. But in the end, the Internet keeps moving right along.

    Think about the worst thing that's ever happened to the Internet and how much that really impacted your daily activity. I don't know about you, but it's always been local connectivity failures that have caused me the most trouble. The occasional site being down really doesn't make a big difference.

    --
    This sig has been temporarily disconnected or is no longer in service
  23. Just got off the phone with Akamai... by LordJezo · · Score: 2, Interesting

    They are telling me that it was indeed an attack, but an attack aimed not only at them but other companies as well.

    I wonder what really happened and who else was attacked..

  24. Re:TROLL/KARMA WHORE ALERT by nick_marden · · Score: 2, Interesting

    The Register must be wrong about this. I used to work at Akamai, and I feel pretty damn sure that no one crashed those servers by getting *on* them to run the 20-line snippet of code that locks the kernel (assuming we're talking about the kernel lock exploit that was being widely discussed recently; it requires shell access).

    What is much more likely is that somebody found a way to DDOS the Akamai top-level name servers, or that configuration files containing incorrect/conflicting/nefarious information were pushed out to the top-levels.

    Knowing how many stages and checks there are in the Akamai deployment procedures, and how much monitoring there is of the network health, I would be astonished if someone managed to foobar the top-levels with a bad configuration. A co-wortker of mine did it once, a long time ago, so I guess it *could* happen, but it was one of those perfect-storm sorts of things. And even then, it just slowed things down a little - certainly not enough to make the news like this.