Slashdot Mirror


Akamai DNS Outage Messes up Net

katre writes "Checking all my favorite sites this morning, I saw that about half a dozen seem to be offline. Trying to figure out why, I found an interesting article on the front page at http://isc.incidents.org/. Seems that the problems at Akamai are screwing over Yahoo, Google, Microsoft, Fedex, Xerox, Apple, and others. Whatever happened to my decentralized net with no single point of failure?"

86 of 522 comments (clear)

  1. I'm definitely not a technical guru... by Dagny+Taggert · · Score: 5, Interesting

    but I believe the centralized concept of the 'net is something that is coming to an end, much to our loss. I'm pretty bothered by the fragility of this system. How many of you can't work without web access?

    --
    Don't be a looter...and yes, I know that it's spelled with an "A" instead of an "E".
    1. Re:I'm definitely not a technical guru... by Malc · · Score: 5, Insightful

      How many *think* they can't live without web access? Offline working can be surprisingly productive, and as it often forces more thinking and planning (e.g. in preparation for being back online, and just thinking through what would happen of you could be online) the results end up being better.

    2. Re:I'm definitely not a technical guru... by MindStalker · · Score: 5, Insightful

      You mean decentralized?
      Anyways butting both DNS records on the same point of failure breaks standards. These companies deserve to be hit hard (PR wise) for not building a roburst network.

    3. Re:I'm definitely not a technical guru... by jocknerd · · Score: 3, Insightful

      I actually would probably get work done without web access!

    4. Re:I'm definitely not a technical guru... by Elecore · · Score: 3, Interesting

      I do. In fact, I'm trying to move as much of my work onto the web as possible. First it was email, then my calendar and contacts (Horde). Now, I'm starting to run finances online so I can access it from anywhere. A few years ago there was a lot of hype about moving all software to webbased software and I think this is still the eventual outcome of the internet. It makes sense to everybody. No costs in making CDs or packaging, ability to work on anything from anywhere, and easy ability to upgrade software. For example, say the latest version of word was access from apps://MSWord. When a bug is found, MS patches the program and everybody is now using the fixed version. Admitantly, this gives more control to the manufacturer. If they want to add/remove a feature, you have no choice in upgrading. Hmmm... ok, so I'm inconclusive. Personally, I like the idea of a full PC of software and data available to me regardless of where I am. So yes, I rely on the internet to work and if we move towards this way, problems like you are describing will eventually be solved. Supply and demand. Thoughts?

    5. Re:I'm definitely not a technical guru... by Pizzop · · Score: 5, Informative

      It would be hard to do most of my work (Server Maint.) without the net. I might have to actually go to the servers instead of ssh. Wait, what am I talkin about, without the net I wouldn't HAVE a job.

    6. Re:I'm definitely not a technical guru... by fish_in_the_c · · Score: 5, Insightful

      you can still get to all those sites. You just have to REMEMBER the ip instead of depending on the computer to look it up for you ;). TCP/IP was designed to have not centeral point of failure and still does it's job well. DNS was not quite designed in such a way.

      --
      âoeTolerance applies only to persons, but never to truth. Intolerance applies only to truth, but never to persons.
    7. Re:I'm definitely not a technical guru... by AKnightCowboy · · Score: 5, Insightful
      How many *think* they can't live without web access?

      *Live* and *work* are too entirely different things. I could not get any of my work done with network access.

    8. Re:I'm definitely not a technical guru... by bluethundr · · Score: 4, Insightful

      ...how many *think* they can't live without web access? Offline working can be surprisingly productive, and as it often forces more thinking and planning (e.g. in preparation for being back online, and just thinking through what would happen of you could be online) the results end up being better.

      F'real. To think, they did all that even before the Altair was a twinkle in Ed Roberts' jockey shorts!

      --
      Quod scripsi, scripsi.
    9. Re:I'm definitely not a technical guru... by endx7 · · Score: 5, Funny

      Offline working can be surprisingly productive

      Because that means then you aren't on slashdot?

      er....brb, I should probably get back to work.

    10. Re:I'm definitely not a technical guru... by aonaran · · Score: 4, Informative

      Solution to akamai problems:
      go to <a href="http://www.dnsstuff.com/">your favorite DNS lookup page</a> and lookup the akamai hosted site. (getting the real address rather than the akamized version) Now open your hosts file and add that in.

      Now you will always get the non-akamized version of that site. Akamai problem solved.

      I keep google in my hosts just so I can be sure that DNS issues like this won't cut me off from my favorite search engine.

    11. Re:I'm definitely not a technical guru... by AKnightCowboy · · Score: 4, Funny
      I could not get any of my work done with network access.

      Errr, obviously I mean without network access. Although I'd spend less time on Slashdot so perhaps I can't get my work done with network access.

    12. Re:I'm definitely not a technical guru... by Malc · · Score: 4, Interesting

      I'm not being condescending, but unless you're very junior or just a small pawn in a very large company, even server mainenance positions require some offline work. There's always some planning that needs to be done for tasks during today, the coming week, or even long term. Phone calls can be made, documentation updated, etc. It really depends on how long the outage is for and how quickly you're able or willing to switch gears and tasks.

      I realise that some jobs are much more inpractical when there is downtime, but not everybody even here on /. requires 100% connectivity. I myself live over 4,000 km from work (I haven't even met them face-to-face for more than four years) and I would be pretty pissed off and delayed on one of those days I have to do all my work via Terminal Services on machines at the main office or colocation facility... but let's be honest, a day's outage wouldn't mean I couldn't do any productive work. I once had a 10 day outage when the local telco switched my DSL line from interleaved channelisation to fast-path, but as I was in the middle of a long stretch of software development, I really didn't need a lot of internet. Batches of dialup and patience all-around sufficed ;)

    13. Re:I'm definitely not a technical guru... by Shalda · · Score: 3, Funny

      Without web access, I have nothing to do but work.

    14. Re:I'm definitely not a technical guru... by jdray · · Score: 4, Interesting

      I work for an electric utility and, with the current state of web-based scheduling of electricity in the U.S. (a mandatory requirement by regulatory agencies), loss of proper traffic routing on the Internet can have difficult-to-overcome effects.

      --
      The Spoon
      Updated 6/28/2011
    15. Re:I'm definitely not a technical guru... by flink · · Score: 3, Funny

      I can survive without net access, but I wouldn't call it living.

  2. Well . . . by Maradine · · Score: 4, Insightful
    Whatever happened to my decentralized net with no single point of failure?


    Its still there, and you're using it. The only organizations affected by this are those who chose to use a service that acts as a single point of failure.

    --

    trustedworlds.net - gaming, security, and the gunk that lives in between

    1. Re:Well . . . by Anonymous Coward · · Score: 3, Insightful

      Yup, 13 nameservers that all do the exact same job located at different places around the world, with different access providers. All fully capable of doing their job without the others.

      Sure sounds like a single point of failure to me.
      </sarcasm>

    2. Re:Well . . . by Bob9113 · · Score: 4, Insightful

      Whatever happened to my decentralized net with no single point of failure?

      Its still there, and you're using it. The only organizations affected by this are those who chose to use a service that acts as a single point of failure.


      You said it brother (and beat me to the punch). This is a clear talking talking point for anyone who is attempting to justify avoiding a monoculture. When you brings up Microsoft, around which revolve a number of good examples of the dangers of monoculture, you risk the debate turning political and will almost certainly be discounted as a Linux/Apple/Unix zealot by at least some in the listening audience. It is very worthwhile to have other examples besides Microsoft and cotton when explaining the risks.

  3. Whatever happened to my decentralized net? by mattkime · · Score: 5, Funny

    Whatever happened to my decentralized net with no single point of failure?

    Its there. Get out your old Usenet reader. See, you still have your porn.

    --
    Know what I like about atheists? I've yet to meet one that believes God is on their side.
  4. points of failure by rlthomps-1 · · Score: 4, Interesting

    DNS dying on you? Just throw it on the pile of other connection problems

    I think everyone has several "single" points of failure -- my cable modem dies at least twice a month and my wireless router conks out at least twice a day ;)

    1. Re:points of failure by pairo · · Score: 3, Funny

      You're comparing your home connection with Akamai? :-)

  5. Clear your cache by Frennzy · · Score: 5, Informative

    Yahoo is already resolving through scd instead of akamai. I didn't check any of the others.

    If you clear your cache, you will probably get the new entries, unless your ISP hasn't caught onto the problem yet.

    1. Re:Clear your cache by strictnein · · Score: 4, Informative

      for the windows users out there:
      ipconfig /flushdns

    2. Re:Clear your cache by jeffasselin · · Score: 4, Informative

      For OS X users:

      lookupd -flushcache

      --
      If he explores all forms and substances Straight homeward to their symbol-essences; He shall not die.
    3. Re:Clear your cache by Anonymous Coward · · Score: 3, Funny

      For Linux users:

      sudo su -
      /etc/init.d/network stop eth0
      /etc/init.d/network start eth0
      /etc/init.d/iptables stop
      /etc/init.d/iptables start
      /etc/init.d/ntpd stop
      /etc/init.d/ntpd start

      yes, I think I'm funny... :-P

  6. ok by Anonymous Coward · · Score: 3, Funny

    vague explanation, just a link to the ISC's Incidents website and not the article, and now that site is inaccessible courtesy the slashdot effect. Nice job, now we cant even find out what's going on!

  7. Good morning, Mr. Gore. by Quarters · · Score: 5, Funny
    Whatever happened to my decentralized net with no single point of failure?

    How ya doin', Al?

    1. Re:Good morning, Mr. Gore. by Ralph+Wiggam · · Score: 3, Insightful

      Damn that was funny 4 years ago. Do you have any good "hanging chad" material?

      Al Gore was talking about creating *legislation* that helped foster the Internet.

      Why do Conservatives bitch to high hell when anything they say it taken out of context, but repeat dumb quotes by Liberals out of context for years and years?

      Maybe they should stop worrying so much about people who havn't had a political job in 4 years and worry about the people who do have important jobs now and are doing them so amazingly badly.

      -B

    2. Re: Good morning, Mr. Gore. by whereiswaldo · · Score: 4, Funny

      I've seen the same joke once a week for 4 damn years. I'm just sick of it.

      In Soviet Russia the world revolves around YOU!

      "He's sick of the jokes boys. Let's shut 'em down." -- Chief Wiggum

    3. Re:Good morning, Mr. Gore. by AtariDatacenter · · Score: 3, Funny

      So you could say that Mr. Gore was the fertility clinic of the Internet, yes?

  8. Ironically... by xbrownx · · Score: 5, Informative

    ...I can't even get to http://isc.incidents.org/

  9. Single point of failure by jelizondo · · Score: 5, Funny

    You could still access Slashdot, couldnt you?

    --
    Be very, very careful what you put into that head, because you will never, ever get it out. - Cardinal Wolsey
  10. Hmmm by Mz6 · · Score: 5, Funny
    "Seems that the problems at Akamai are screwing over Yahoo, Google, Microsoft, Fedex, Xerox, Apple, and..."

    ... http://isc.incidents.org.

    --
    Hmmm.
  11. Whatever happened to your decentralized net? by YetAnotherName · · Score: 4, Insightful

    The web happened my dear friend, and it was based on the predominant distributed computing model at the time: client/server. Even DNS, with its highly distributed spread of processing and data, has a set of (overloaded) root servers with the commensurate single points of failure. The solution? Peer-to-peer.

    Too bad even the term P2P raises so many red flags with certain Associations of America. :)

    1. Re:Whatever happened to your decentralized net? by Tenareth · · Score: 4, Informative

      Uhm, the root servers are not overloaded... this has nothing to do with the root servers, this has to do with Akamai having problems.

      They have a private cached network they sell access to. It's like taking a service road around crowded highways to get closer to the final destination.

      One of the companies I used to work for used Akamai, nice network... not so great customer service unless you are a really big customer.

      --
      This sig is the express property of someone.
  12. 2nd time in a month by ZHaDoom · · Score: 5, Informative

    This should cause some problems for akami, they had an outage may 24th. Once can be overlooked twice? these are some big companies they are going to be calling them. I bet there is some sweating techs in the cool noc right now

    --
    War isn't about who's right. It's about who's left.
  13. DNS issue... by Tuxedo+Jack · · Score: 3, Insightful

    You would think that the root DNS servers would be kept up to date with critical information. Just what happened, and how did Akamai get knocked around this? Did they screw with their DNS information and change their nameserver addresses or something?

    --

    Striking fear in the authors of godawful fanfiction, I am here, appearing in darkness, Tuxedo Jack!
    1. Re:DNS issue... by ObsessiveMathsFreak · · Score: 5, Funny

      Well I guess it's back to IP addresses for us!!!
      ....
      I'll be at 127.0.0.1 until this blows over.

      --
      May the Maths Be with you!
  14. releted to linux kernel DoS exploit? by Anonymous Coward · · Score: 4, Interesting

    Do we know if this at all related to the Linux kernel 2.4.2x/2.6 DoS exploit discovered yesterday?

    1. Re:releted to linux kernel DoS exploit? by mtenhagen · · Score: 5, Funny

      It probarbly is, I did send an email to a guy the akamai noc and told him to execute a lttile attached application on all the dns servers and he would receive free porn if he did.

      Iam now trying to send the porn but the mail server is unreachable.

      --
      200GB/2TB $7.95 Coupon: SAVE90DOLLAR
  15. my failure by pcmanjon · · Score: 3, Funny

    My primary point of failure is my router, the damn clip that keeps the cat6 cable plugged in the router always falls out.

    My central point of failure... :(

  16. Preformance vs reliability by Kardnal · · Score: 3, Interesting

    When Akamai's system was first announced, most people thought this was a great idea. It made sure that the sites that used this technology would always have the bandwidth they needed, when they needed it. Like with everything else in life, there's always a trade-off between preformance and reliability...

    --
    ------------------
    "Never Attribute to malice what is adequately explained by stupidity..."
    1. Re:Preformance vs reliability by br0ck · · Score: 3, Funny

      Like with everything else in life, there's always a trade-off between preformance and reliability...

      You really can have both!.. have you tried Viagra?

  17. Re:decentralized DNS is a pipe dream by RT+Alec · · Score: 3, Informative

    I don't think this had anything whatsoever to do with any of the root servers. This has to do with Akamai's DNS servers, and the companies (domains) that are using them.

  18. Lack of notification by sphealey · · Score: 5, Interesting

    What ticks me off about this incidents (and I suspect that there have been several in the last 6 months) is that there is absolutely no notification given, either during or after the event. During this outage, some news outlets were still reachable (including Slashdot), and a simple notification would have saved hours (* 10s of thousands of network dudes worldwide) of time and much grief from the big bosses who couldn't reach Yahoo Finance, I mean critical business web sites.

    Are these guys so convinced of their omnipotence and indispensibility that they don't feel the need to communcate with the world about what is going on?

    sPh

    1. Re:Lack of notification by Syberghost · · Score: 3, Funny

      What ticks me off about this incidents (and I suspect that there have been several in the last 6 months) is that there is absolutely no notification given, either during or after the event. During this outage, some news outlets were still reachable (including Slashdot), and a simple notification would have saved hours (* 10s of thousands of network dudes worldwide) of time and much grief from the big bosses who couldn't reach Yahoo Finance, I mean critical business web sites.

      Yeah, they should post a notice on their web page, saying their internet connection is down. Bastards.

  19. I'm surprised... by swasson · · Score: 5, Funny

    that the /.'ers aren't trying to take credit for slashdotting the entire WWW.

    --
    "Facts are meaningless. You could use facts to prove anything that's even remotely true!" -- Homer Simpson
  20. Well, it wasn't out for that long ... by hattig · · Score: 4, Informative
    Typically, the domain itself (e.g. 'google.com') still resolves, but popular hostnames, like 'www.google.com' will not resolve.


    Pwned by CNAME to Akamai?

    (You can't have CNAME records for the base domain, hence google.com would have had an A record instead, whilst www.google.com would have been a CNAME to akamai)
  21. can we figure out... by kaan · · Score: 3, Funny

    ... a way to blame the outage on Microsoft instead of (or in addition to) Akamai?

    (come on, it's funny. at least I didn't suggest blaming SCO...)

  22. Akamai is evil! by scovetta · · Score: 3, Insightful

    When I was in grad school at Cornell, my O/S professor went on a rant about the evils of Akamai. No one believed him. Now we know he was right.

    --
    Wer mit Ungeheuern kämpft, mag zusehn, dass er nicht dabei zum Ungeheuer wird. --Nietzsche
  23. Root servers not decentralized? by Otto · · Score: 5, Insightful

    It's not truely decentralized...
    The root nameservers are the most obvious example...


    The most obvious example? The fact is that there are 13 of them, in widely scattered locations across the globe, and it's not decentralized?

    Damn man, what exactly would you consider "decentralized" then?

    Root servers go down all the time. It's not particularly unusual. There's THIRTEEN of the things. Up to 8 have been down at once with no major effects on the network, IIRC.

    --
    - Give a man a fire and he's warm for a day, but set him on fire and he's warm for the rest of his life.
    1. Re:Root servers not decentralized? by Syberghost · · Score: 4, Interesting

      The fact is that there are 13 of them, in widely scattered locations across the globe, and it's not decentralized?

      Damn man, what exactly would you consider "decentralized" then?


      Akamai has 13, in widely scattered locations, as well. That in itself doesn't make them sufficiently decentralized.

      The reason the root servers don't have this problem is that they don't all run the same software (anymore) and aren't all administrated by the same people.

      I'm making an assumption here, of course, but I will not be a bit surprised if it turns out that Akamai loaded something that hit all their routers at once.

    2. Re:Root servers not decentralized? by Omnifarious · · Score: 4, Interesting

      The root nameservers are not under decentralized political control, which still makes them a single point of failure, albeit a different kind of failure.

    3. Re:Root servers not decentralized? by sys49152 · · Score: 4, Insightful

      I'm sorry, my friend, but thirteen servers does not mean decentralized it means replicated. The fact that they are geographically dispersed doesn't matter. Furthermore, the root servers just redirect to the authoritative server, so your "company.com" search goes to Verisign for resolution. What happens when Verisign, oh, I dunno, decides to send back the IP address of a cheesy search engine instead on an error code for domain names that don't exist. I tell you what happens, the Internet breaks.

      To be truly decentralized not only do we need more than 13 overloaded root servers, but no one entity should be authoritative. How that's done is left as an exercise to the reader.

    4. Re:Root servers not decentralized? by tyler_larson · · Score: 3, Insightful
      I'm sorry, my friend, but thirteen servers does not mean decentralized it means replicated. The fact that they are geographically dispersed doesn't matter.

      I'm sorry, my friend, but it most certainly does mean decentralized. Here's why:

      Decentralized means "having power or function dispersed from a central to local authorities". Each individual top-level nameserver operates entirely independantly of the others to the extent that it is capable of remaining completely operational in the absence of the others.

      DNS is actually the epitome of a decentralized service--as perfect an example as there comes. Assuming it is implemented as perscribed in the RFCs, there is no single point of failure (an incorrectly implemented DNS system is not the result of a poor design, it's the result of poor implementation--you can't blame DNS).

      There are 13 totally and completely independant top level servers. The only thing that ties them together (in a practical sense) is that they speak the same protocol and synchronize with eachother if possible. All top-level domains have at least two nameservers (generally much more), and all second level domains are required to have at least two authoratative nameservers as well. If any one of these servers in the whole chain fails at any time, the others will pick up the slack--it's part of the protocol.

      Implementing this service correctly such that no failure will take down your own domain is left as an exercise for you. It's your domain and your nameserver. You're responsible for insuring that it works. The "system" correctly assures that each one of your own nameservers will be queried until one responds. If you take all of your own nameservers offline, there's obviously nothing that the DNS system can do to help you. That's what Akamai's problem was. Don't blame DNS.

      --
      "With sufficient thrust, pigs fly just fine. However, this is not necessarily a good idea...."
      RFC 1925
  24. Lack of multiple points of failure by bastardadmin · · Score: 5, Insightful

    I can see the logic that went into this plan:
    "Well, Akamai has a few million DNS boxes, if we put everything there we'll be fine! That's not a single point of failure!"
    Yeah, about that... multiple vendors may have been a good idea in retrospect instead of just one monolithic provider.
    Time to re-examine the definition of Single Point of Failure.

  25. You know... by Mz6 · · Score: 5, Funny
    If we timed our stories right we could pull this all off as the /. effect...

    Let's see so far today.. We had a report on Yahoo... They're down. A report to a virus linked to Symantec.. they are up and down. We always link to Google, they are having problems... wooo. Now we just need another patent from Microsoft to bring them down... which by my records shouldn't be too long.

    --
    Hmmm.
  26. Need my Xerox fix! by goober · · Score: 5, Funny

    Checking all my favorite sites this morning...

    Microsoft, Xerox and FedEx are some of my favorite sites too! But due to the outage I'm stuck slumming it here on Slashdot...

  27. We fixed it quick by Apreche · · Score: 5, Funny

    Yeah, google didn't work and we didn't know what to do. We tested and determined the problem was akamai within a minute. So I used AIM to ask a friend who could still resolve google what the ip was. he passed it to me over aim using gaim encryption no less. We then created an alias for google on our dns server. google.ourdomain.com.

    We also developed a new DNS protocol in the process. ESEDOIM: Extremely slow encrypted DNS over instant messenger. Who wants to write an RFC?

    --
    The GeekNights podcast is going strong. Listen!
    1. Re:We fixed it quick by MyHair · · Score: 4, Funny

      Must file patent...clowns will eat me.

  28. Akamai by junctionvin · · Score: 3, Informative

    I run a small ISP and we happen to have 3 of their linux boxes on our network. I've never experienced a problem with them before today. For the hack of it we decided to just reboot their servers and now things are working correctly.

    For those that were wondering why it would affect DNS; Akamai somehow tinkers with DNS and BGP to redirect content to their edge servers.

    As for Akamai being outdated, it still seems to me that its a good idea for Yahoo and some of the high traffic sites on the net. Akamai has thousands of distributed servers colocated with ISPs and NAPs. And they do seem to absorb nasty bursts in traffic (ie Star Report) better than a centralized server farm. But for their own sake, they better hope to not have another repeat of todays events.

  29. Akamai's DNS black magic by frankie · · Score: 4, Interesting
    Akamai uses (some would say ABuses) DNS in ways the rest of us (even global megacorps) wouldn't dare. Half of Akamai's magic is their 10000+ carefully-scattered servers, but the other half is their routing. Those servers are listed differently depending on where you ask from.

    It's not like a092156fg.akamai.net is in Seattle and k1039665.akamai.net is in Saskatoon. Instead, all of *.akamai.net goes to whatever cluster is "closest" to the requesting IP (based on BGP, Colonel's Secret Recipe, etc)

    So if Akamai's DNS gets screwed up, I would expect major weirdness. And as more sites join EdgeSuite (where you host your entire domain on Akamai's servers & DNS) the effect must magnify.

    Of course, I could be completely wrong. I'm not a routing god, just a guy who thinks Akamai is a cool hack.

  30. NANOG Postings by TheSync · · Score: 5, Informative

    From NANOG:

    From here neither www.google.com, nor www.apple.com work. Both seem to return CNAMES to akadns.net addresses (eg, www.google.akadns.net, www.apple.com.akadns.net), and from here all of the akadns.net servers listed in whois are failing to respond.

  31. Success considered harmful? by DragonHawk · · Score: 3, Insightful

    I was thinking about this while scrambling to answer the phone, check outage reports, and generally calm down customers.

    If a product or service, such as Akamai, does their job very well, everybody will want to use them. If everybody uses them, you create a single point-of-failure. Any design flaw in that product or service becomes a disaster, simply through volume. Does this mean a successful product or service can actually be a bad thing for people?

    Other examples include just about anything from Microsoft, older versions of Sendmail and BIND (worm-of-the-week problem), and Firestone tires.

    (I'm not trying to advocate communism, excessive government regulation, or anything like that. So fanatical libertarians, conspiracy theorists, etc., can put down the rant-o-matic flamethrowers. :) )

    Comments?

    --

    dragonhawk@iname.microsoft.com
    I do not like Microsoft. Remove them from my email address.
  32. Correction by PhuCknuT · · Score: 4, Insightful

    Akamai didn't mess up the net. Akamai messed up some web sites that are akamai customers. Remember kids, www is only a subset of the internet, and akamai customers a small fraction of the www.

  33. Re:Terrorist attacks, anyone? by GlacierPilot · · Score: 5, Informative

    The real cost of a web site dropping is a lot more difficult to figure out than you might imagine. Say Amazon goes down for a couple of hours. Are all those potential sales lost forever? I doubt it. Some people will just come back and order later. The firm is unlikely to see any long term impact unless the outage becomes habitual. Non-retail sites probably have even more flexability. About the only area in which an outage could have a real, long term adverse impact would likely be in financial services. If Schwab goes down for half a day they will suffer big time for a long time. If you're talking "the economy" as in the big picture economy" suffering - forget it. Web based commerace isn't that important yet.

  34. Having an 'incident' of their own... by cuzality · · Score: 3, Funny

    Later they can post an 'incident report' on the slashdotting they're experiencing right now!

  35. Luckily it's 99.45% shit to begin with. by gelfling · · Score: 4, Funny

    Seriously we need a *.sht domain.

  36. Tech details by DragonHawk · · Score: 4, Informative

    It appears that, at around 8:30 AM EDT (US Eastern Daylight Time), Akamai's DNS network experiened some kind of major failure. All of their DNS servers (that anybody could find) were not responding to DNS queries. It appears that Akamai started to come back online at around 10:00 AM EDT.

    Since a great many big name sites use Akamai, this effectively made large parts of the Internet unreachable. The destination servers themselves were up, but clients were unable to turn names (like www.example.com) into network addresses (like 192.0.2.42).

    As Akamai maintains dozens, if not hundreds, of DNS servers across the globe, it is extremely unlikely that this was due to a normal equipment failure or DoS attack. Some kind of internal system trouble is much more likely. Whether a deliberate attack, or an accident, is unknown to me at this time. It could just be an internal configuration change blew up in a really bad way. Sh*t happens.

    I do not know if this was just an Akamai DNS problem, or if other Akamai services were also affected.

    Due to the way Akamai is usually implemented, it happened that, in many cases, the second-level domain names (like example.com) worked, but subdomains (like www.example.com and mail.example.com) did not. This is because most organizations put in CNAME records (pointing to names in *.akadns.net) for the subdomains. You cannot use a CNAME record for a domain that has other records, though, so most domains still had traditional A records, on their own nameservers, at the second-level.

    The following sites/organizations are known to use Akamai: Yahoo, Google, Microsoft, Altavista, FedEx, Xerox, Apple

    --

    dragonhawk@iname.microsoft.com
    I do not like Microsoft. Remove them from my email address.
  37. Reminds me of a story by Venner · · Score: 5, Interesting

    Not too long after 9/11, I was surfing the net and needed to look up something at the Library of Congress for one of my classes. It wouldn't connect. At first I thought we'd just lost DNS (not so uncommon an occurance at my university in those days), but found I could still connect to slashdot.org and some other sites.

    Being a geek, I thought up a list of about 30 sites to ping, scattered across the US. (.govs and .edus mostly.) The ones that replied, I plotted on a US map based on their DNS LOC. (A project I wrote for a previous class.)

    I freaked out a bit when the mid-atlantic seaboard came up missing. I crossed my fingers hoping that it was just some idiot who'd accidently cut one of the main fibers (which it what it ended up being) and not that Washington DC was now a big hole in the ground.

    --
    A preposition is a terrible thing to end a sentence with.
  38. "DNS was not quite designed in such a way" by Ernesto+Alvarez · · Score: 5, Insightful

    you can still get to all those sites. You just have to REMEMBER the ip instead of depending on the computer to look it up for you ;). TCP/IP was designed to have not centeral point of failure and still does it's job well. DNS was not quite designed in such a way.


    DNS was designed to be robust enough. Not one root server but many (ok, that's the weak point, we've all seen many DDoS against them, but it's not THAT bad). All zones are handled by their own servers, and (in theory) multiple servers for each zone. All in all, it's not a bad design.

    If what happened was that someone put all the servers behind one link, it's not DNS' fault, the BOFH there screwed up (and considering it's akamai, they should not have done that).

    (If that's not what happened, sorry, I couldn't RTFA, it's slashdotted or there's some sort of DNS problem there too).
  39. From Akami's Page by esconsult1 · · Score: 3, Informative
    Some info from Akami...

    # Maximizes e-business revenue by guaranteeing 100% availability

    EdgeSuite Enterprise Edition is built on the globally distributed and highly scalable Akamai EdgePlatform, comprising over 14,000 servers deployed in over 1,000 networks across more than 70 countries. With this global reach, users can deliver their content from the edges of the Internet - closest to their users.

  40. How Sites are Coming Back Online by TheSync · · Score: 5, Informative

    From NANOG mailing list again:

    Google pulled references for akamais dns servers a short period ago. they are presently serving their own dns requests.

    Also:

    People seem to be getting around this by changing their DNS entries.

    E.g. www.yahoo.com always used to be a CNAME for www.yahoo.akadns.net. But
    now:

    # host www.yahoo.com
    www.yahoo.com is an alias for www.dcn.yahoo.com.
    www.dcn.yahoo.com has address 216.109.118.64
    www.dcn.yahoo.com has address 216.109.118.65
    www.dcn.yahoo.com has address 216.109.118.66
    www.dcn.yahoo.com has address 216.109.118.67
    www.dcn.yahoo.com has address 216.109.118.68
    www.dcn.yahoo.com has address 216.109.118.69
    www.dcn.yahoo.com has address 216.109.118.70
    www.dcn.yahoo.com has address 216.109.118.71
    www.dcn.yahoo.com has address 216.109.118.72
    www.dcn.yahoo.com has address 216.109.118.73
    www.dcn.yahoo.com has address 216.109.118.74
    www.dcn.yahoo.com has address 216.109.118.75

    Which is owned by Yahoo! (via HotJobs.com).

  41. Whatever happened to my decentralized net... by Lord+Kano · · Score: 3, Insightful

    Whatever happened to my decentralized net with no single point of failure?

    Outsourcing and consolidation.

    LK

    --
    "Hi. This is my friend, Jack Shit, and you don't know him." - Lord Kano
  42. Happy now? by SpinyManiac · · Score: 3, Informative

    Handlers Diary June 15th 2004
    Updated June 15th 2004 14:31 UTC (Handler: Lenny Zeltser)
    Akamai DNS outage
    Akamai DNS problem

    Starting at around 8:30 am EDT (12:30 UTC), a number of sources started to report a widespread Akamai DNS issue. Large web sites, which use Akamai for its DNS service, did no longer resolve. Effected sites are Yahoo, Google, Microsoft, Fedex, Xerox, Apple and likely many others.

    At this time (10:30 am EDT), some effected domains removed the Akamai DNS servers and are reachable again using their own DNS servers.

    Typically, the domain itself (e.g. 'google.com') still resolves, but popular hostnames, like 'www.google.com' will not resolve. As a result, the web site is no longer reachable.

    The effect appears to be world wide. Some of the Akamai servers do respond to pings, but do not respond to DNS queries.

    posts to the NANOG mailing list regarding this issue:
    http://www.merit.edu/mail.archives/nanog/m sg05267. html

    --
    It's never too late to have a happy childhood.
  43. Re:Interesting... by Xaroth · · Score: 5, Funny

    If it weren't slanted, it'd be |.

    (Apologies to whomever I'd seen that from before.)

  44. Created SPoF by Todd+Knarr · · Score: 4, Interesting

    The problem is that those sites created their own single point of failure by all using Akamai for DNS. When Akamai DNS fails, sites that depend on it for their own DNS fail.

    It used to be nearly impossible for this to happen. The original rules for DNS were that you had to have at least 2 nameservers for your domain, preferrably 3 or more, and they couldn't be on the same physical networks. With that rule having a single network go down rarely made any domain unresolvable (backbone networks whose outages could render dozens or hundreds of other networks unreachable being the exception). Maybe we should put the old nameserver-diversity rules back into place.

  45. novell and dns... by ecalkin · · Score: 3, Insightful

    This was years ago (3? 4)... I set up a novell server and setup dns on it as a forwarder and pointed workstations to my novell server for dns.
    One of the neat things was the log screen that showed dns actions and you could follow the trail of dns requests to see how they were resolved. what makes this not O/T is that i beleive that this went into a log.

    The reason that I think about that is, if DNS stopped working, i'm not sure that i have cached numbers that i could easily get to....

    eric

  46. "Caught in a BIND" by stock · · Score: 3, Informative
    Jon Lasser predicted some troubles long time ago : http://crashrecovery.org/bind9.html . His article is on http://theregister.co.uk/content/55/28235.html and titled "Caught in a BIND".

    Robert

  47. Re:Interesting... by digidave · · Score: 3, Informative

    The reason why it's a mysterious "DNS issues" is because we don't know what the problem is. It'd be the same if it was a Windows DNS server (not that anybody uses those for major networks like Akamai). Seeing as Akamai uses more than one DNS server it's more likely a administrator error than a Linux crash. Nobody would be blaming Windows if an administrator screwed up.

    You are also confusing their cache servers with their DNS servers. They're completely different.

    --
    The global economy is a great thing until you feel it locally.
  48. Google down? by thenerdgod · · Score: 3, Insightful

    My god... with google down my effective IQ is 12!

  49. Akamai does use *some* win servers by Jayfar · · Score: 3, Informative

    I wouldn't presume they use any for their dns funtionality, but fact of the matter is Akamai does have a small proportion of windows servers in their distributed clusters. Seen 'em with my own eyes.

  50. Re:Interesting... by Anonymous Coward · · Score: 4, Funny

    I wonder if Microsoft/AdTI will buy the "\." domain? News for Nerds slanted the other way!

  51. Re:Uh by Slime-dogg · · Score: 5, Insightful

    It is misleading to refer to the box as a "Linux" box. Was it really the kernel that was at fault for the machine being cracked, or was it a bug in one of the daemons that the machine was running? There are differences between a Linux box that runs BIND and another that runs EZ-DNS (or whatever).

    How about this: Instead of labelling the Akamai boxes that have problems as "Linux" boxes, label them as "BIND" boxes, or whatever DNS server it is that it runs. Perhaps there's a FreeBSD machine in there that is having similar problems.

    It is allowable, though, to refer to a Windows box as just that. MS ships an all-in-one product, and seldomly do admins use Windows to run BIND, Apache or other OSS servers.

    All of this hand-ringing in an effort to paint "Linux" as bad, or as "just as bad" is dopey. One might as well point a finger at the administrator of the machine that was hacked, the services that were running on it, etc. Most Windows problems are caused by the same thing too. It is wiser to point at the admin (and the services one chooses to run) than to point at the OS, or the kernel.

    --
    You need to restart your computer. Hold down the Power button for several seconds or press the Restart button.
  52. Washingtonpost.com says it was a denial of service by tsu+doh+nimh · · Score: 3, Informative

    ...according to this story at washingtonpost.com The story says it was a distributed denial of service attack against Akamai, among others.

    --
    ...because you never know who you're dealing with.