Slashdot Mirror


One Broken Router Takes Out Half the Internet?

Silent Stephus writes "I work for a smallish hosting provider, and this morning we experienced a networking event with one of our upstreams. What is interesting about this, is it's being caused by a mis-configured router in Europe — and it appears to be affecting a significant portion of the transit providers across the Internet. In other words, a single mis-configured router is apparently able to cause a DOS for a huge chunk of the Net. And people don't believe me when I tell them all this new-fangled technology is held together by duct-tape and baling wire!"

50 of 412 comments (clear)

  1. Few stories back... by Anonymous Coward · · Score: 5, Funny

    A couple of Nuclear Subs probably cut an underwater cable...

    1. Re:Few stories back... by morgan_greywolf · · Score: 5, Funny

      Nuclear Sub? Is that a new sandwich from Subway?

      "The all new Subway Nuclear Sub: It glows in the dark! Get a lotta green for a little green! Now only $5.99 for a 12-inch! Subway: Eat Fresh!"

    2. Re:Few stories back... by mschuyler · · Score: 4, Funny

      That is actually correct. The sub shop in Bremerton (West Coast port for Trident Ballistic Missile Submarines, SSBN-726, etc.) sells the Trident Nuclear Submarine Sandwich with an extra serving of horseradish somewhere in the middle of it. It'll light your hair on fire, or, in my case, turn my scalp red.

      --
      How about a moderation of -1 pedantic.
  2. Half the internet? Are you serious? by Anonymous Coward · · Score: 5, Informative

    A router takes out 'half the internet' and I learn this from Slashdot?

    Seriously, what is/was the impact? I work for a large e-commerce provider and haven't seen a thing that would indicate a problem today.

  3. Sorry by Alcoholist · · Score: 4, Funny

    My bad. I never should have cut that tape.

    --
    Bibo Ergo Sum.
  4. BGP by winkydink · · Score: 5, Informative

    The internet's dirty little secret. It's amazing it works at all.

    --

    "I'd rather be a lightning rod than a seismometer." -Ken Kesey

  5. You get Duct tape? by Forge · · Score: 4, Funny

    Lucky Yankees with all your fancy technology. If I told you what we use, nobody would respond for fear that in attempting to respond I would cause a few fatalities.

    --
    --= Isn't it surprising how badly I spell ?
  6. Yep, Its true by Bryansix · · Score: 4, Informative
    Our Hosted VOIP service took a dump today at 8:40 AM PST. Supposedly it was a server in the Czech Republic. From the Carrier

    2009-02-16 0945 PST CP experienced a core network connectivity issue due to a world wide BGP issue that affected all BGP interconnected networks. A rouge machine in the Czech Republic was making bad AS advertisements that caused systems world wide to fail. We have worked with our providers as well as our internal Engineering department to effectively block this node and restore service to our network. This is an ongoing issue that is still being worked to get a 100% correction. There is a workaround currently in place until a complete fix is available.

    1. Re:Yep, Its true by radish · · Score: 5, Funny

      A rouge machine in the Czech Republic was making bad AS advertisements that caused systems world wide to fail.

      Now I really don't know all that much about large-scale networking so maybe someone could explain this to me. What difference does it make if the router is rouge, versus say, green? or black?

      Thanks for any insight :)

      --

      ---- Den ene knappen er powerknapp, den andre er Bender voice knapp "Bite My Shiny Metal Ass"

    2. Re:Yep, Its true by ChunderDownunder · · Score: 5, Funny

      Since folks on Slashdot seem to like car analogies, I'll just mention that Red Cars Go Faster and assume that the same law applies for routers.

    3. Re:Yep, Its true by pyite · · Score: 5, Funny

      Now I really don't know all that much about large-scale networking so maybe someone could explain this to me. What difference does it make if the router is rouge, versus say, green? or black?

      So they announced a route that was, shall we say, malformed. Part of the problem is that due to a Cisco bug (CSCdr54230), some routers choke on it instead of ignoring it. The bug is fixed. It was fixed some time ago. Nonetheless, it's a pretty bad bug, labeled as "1 - catastrophic" by Cisco (in red letters, even). Routers still running affected code versions are having issues.

      And it's only at this point in writing my reply that I realize you were taking advantage of a pun by way of misspelling. I'll leave my reply anyway ;-)

      --

      "Nature doesn't care how smart you are. You can still be wrong." - Richard Feynman

    4. Re:Yep, Its true by Anonymous Coward · · Score: 4, Funny

      Everyone knows rouges are overpowered, just ask any mage.

    5. Re:Yep, Its true by myowntrueself · · Score: 5, Insightful

      That's the problem. You shouldn't use rouge on your routers.

      I think that a rouged router would possibly be overly promiscuous.

      No wonder problems like this can spread like the clap in a port town!

      --
      In the free world the media isn't government run; the government is media run.
    6. Re:Yep, Its true by Hecatonchires · · Score: 4, Funny

      Yes, Mages are known for powdering their cheekbones. Rogue's on the other hand, like to stab people in the back.

      --

      Yay me!

    7. Re:Yep, Its true by andrikos · · Score: 5, Funny

      Rouge is overpowdered!

    8. Re:Yep, Its true by travbrad · · Score: 4, Insightful

      I'm going to go with option G) Laziness

  7. AS 47868 by Anonymous Coward · · Score: 5, Informative

    There is a post in nanog and on isc.sans.org.

    AS 47868 causing AS paths to become too long...

    http://www.merit.edu/mail.archives/nanog/msg15472.html

  8. I lost a router by Philip+K+Dickhead · · Score: 5, Funny

    And took out THE _WHOLE_ INTERNET!!!!!

    It's true! Ask my wife!

    --
    "Speaking the Truth in times of universal deceit is a revolutionary act." -- George Orwell
    1. Re:I lost a router by biocute · · Score: 5, Funny

      Which one is true? A lost router took out the whole internet, or you have a wife?

    2. Re:I lost a router by zobier · · Score: 4, Funny

      Are you saying that you accidentally the whole Internet?

      --
      Me lost me cookie at the disco.
    3. Re:I lost a router by Random+Destruction · · Score: 5, Funny

      Yes. I accidentally the whole internet.

      Is that bad?

      --
      :x
  9. Ditto the A.C. by khasim · · Score: 5, Informative

    It must have been the "half the Internet" that I don't use. Which would be an interesting half because many of the sites I visit regularly are based in Europe.

    From the thread, it looks like AS 47868 was the route being lost.

    http://en.wikipedia.org/wiki/Autonomous_System_Number

    1. Re:Ditto the A.C. by roc97007 · · Score: 4, Funny

      > It must have been the "half the Internet" that I don't use.

      The non-pr0n half.

      --
      Oliver's law of assumed responsibility: If you're seen fixing it, you will be blamed for breaking it.
    2. Re:Ditto the A.C. by besalope · · Score: 5, Funny

      > It must have been the "half the Internet" that I don't use.

      The non-pr0n half.

      Such a place exists? 0.o

    3. Re:Ditto the A.C. by petecarlson · · Score: 4, Informative

      It wasn't just AS47868, it was kicked off by AS47868 sending real long routes like you can get to a by going through b, c, d, e, f ,g, h... and so on and so forth. Older versions of IOS wack out with the crazy long routes and lose their BGP sessions so it is possible that he lost half of the internet while you were on a network segment which was not seeing the issue. If the OP were to post the ASN or IP block he was on we could run BGP play and see just how much of the net he really lost. I'm going to guess about .5%.

  10. baling wire, not bailing wire by bugi · · Score: 4, Informative

    http://en.wikipedia.org/wiki/Baling_wire

    I think you mean baling wire. One uses buckets for bailing.

  11. Oblig. I.T. Crowd by XanC · · Score: 4, Funny

    What is Jen doing with The Internet??

  12. Re:Half the internet? Are you serious? by Frosty+Piss · · Score: 4, Funny

    A router takes out 'half the internet' and I learn this from Slashdot?

    Non, no, no. You messed up the troll and got modded "Insightful". Let me fix that for you:

    A router takes out 'half the internet' and this is front page news at Slashdot? Slow news day?

    Thank you, I'll be here all week...

    --
    If you want news from today, you have to come back tomorrow.
  13. Outage Cause: Old software by Anonymous Coward · · Score: 5, Informative

    The AS 47868 decided that they wanted to prepend their ASN about 75 or so times to their BGP announcements. When this got re-populated throughout the rest of the world, a bug in older versions of Cisco IOS still in use on many ISP/NSP networks does not like paths this long. As soon as they saw the prefix with that long of a path, the software terminated the BGP session, resulting in the doorway being closed between the two networks -- So on and so forth throughout the rest of the web.

  14. Make sure you are using cat 5 bailing wire. by tlambert · · Score: 4, Funny

    Make sure you are using cat 5 bailing wire.

    -- Terry

    1. Re:Make sure you are using cat 5 bailing wire. by egcagrac0 · · Score: 5, Funny

      Can't. It's Monday. No cheezburgers.

  15. It took out 9000 internets by need4mospd · · Score: 4, Funny

    In other words, a single mis-configured router is apparently able to cause a DOS for a huge chunk of the Net.

    This means the router was able to take out over 9000 internets. Quite impressive.

  16. Re:Intelligence Op by agm · · Score: 5, Funny

    They need to replace it with a network that is designed to survive a nuclear attack. Oh wait, hang on....

  17. Am I being too vauge? by HTH+NE1 · · Score: 5, Funny

    That's the problem. You shouldn't use rouge on your routers.

    They think a rouge router is in vouge, but they're out of their leauge. We should haranuge them! A plauge on them! Rip out their tounges so they cannot aruge! Them and their colleauges. Nothing but demagouges and idealouges I say. There can be no dialouge on this matter. Send them to the moruge!

    Are you intriuged by my ideas and want to subscribe to my travelouge?

    --
    Oh, say does that Star-Spangled Banner entwine / The myrtle of Venus with Bacchus's vine?
    1. Re:Am I being too vauge? by rts008 · · Score: 4, Funny

      *calls 911*
      I think I just witnessed a brutal murder...of a spell checker. Gotta hide my dictionary!

      --
      Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
  18. Ye olde versions of IOS by DeadBeef · · Score: 5, Informative

    This only broke BGP implementations that are getting pretty long in the tooth now, on a moderately recent version of IOS all we saw is:

    Feb 17 05:25:03.731 nzdt: %BGP-6-ASPATH: Long AS path 10026 3356 29113 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 47868 received from xxx.xxx.xxx.xxx: More than configured MAXAS-LIMIT

    It was definitely an insane path, our routers were configured to drop anything with an AS path longer than 75, old versions of IOS would often just drop the BGP session ( or even crash with some _really_ old versions ).

    I'm sure there will be some red faced network engineers updating IOS or even doing forklift upgrades of old boxes at their edges in the near future.

    --
    I am a lawyer and this constitutes legal advice and I shall indemnify you against any losses arising from taking it.
  19. Only some old versions of IOS broke by lotaris · · Score: 5, Informative

    This only took down people running fairly old versions of IOS that didn't patch a known bug.

    Did not affect non-cisco.
    Did not affect modern versions of IOS
    Did not affect old versions of IOS that set the knob to limit the max as-path.

  20. Re:Pre-FUD propaganda by Hecatonchires · · Score: 4, Funny

    You left out 'updating your myspace page', 'writing poetry about how no-one understands' and 'cutting yourself'

    --

    Yay me!

  21. Re:Half the internet? Are you serious? by Anonymous Coward · · Score: 5, Funny

    A router takes out 'half the internet' and I learn this from Slashdot?

    Seriously, what is/was the impact? I work for a large e-commerce provider and haven't seen a thing that would indicate a problem today.

    Well I'm not sure about you.

    Personally, I have BIGGER news! A single router in a remote rural US state managed to take down the ENTIRE INTERNETS!!!!

    Yes, indeed when I noticed my cat had unplugged the power adapter, I replaced it. Then the ENTIRE internet came back! It was amazing how I single-handedly brought back the whole internets. Al Gore would be proud.

  22. Re:I'm not sure I follow by petecarlson · · Score: 4, Informative

    If I'm understanding this 'router' thing correctly, its like a faucet connected to the series of tubes?

    If not, exactly what role does this router thing play in tube interaction?

    Your understanding is rather accurate but what your missing is the manifolds. You see, all the tubes connect to big manifolds with valves to control what gets sent where. At each manifold room there is some poor admin who is in charge of opening and closing valves in order to make sure that the right AOL gets sent down the right tube. In order to keep track of what tube to send your AOL down, the admin keeps a list of all the other manifold rooms and how to get to them. Some of the manifold room operators didn't have a wide enough notebook to write down the new directions so they just closed all of their valves and went home.

  23. Lord of the Token Ring!!! by Genda · · Score: 5, Funny

    Welcome to Sauronet... One Router to Rule them ALL!!!!

  24. Re:Intelligence Op by kenj0418 · · Score: 5, Funny

    Don't worry, it wasn't a DOS attack. That was just the Internet becoming self-aware.

    OK, on second thought, maybe worrying is in order.

  25. Re:Intelligence Op by CarpetShark · · Score: 4, Funny

    Yeah, this was my first thought as well. It seems clear that the internet, while designed to route traffic through all sorts of alternate links, is almost certainly being routed through single, centralised listening posts at various intervals.

  26. Re:Intelligence Op by Anonymous Coward · · Score: 5, Funny

    The last time I experienced a DOS attack it evolved into Windows. Didn't come out of that one unscathed.

  27. Re:Intelligence Op by Medievalist · · Score: 5, Interesting

    They need to replace it with a network that is designed to survive a nuclear attack. Oh wait, hang on....

    Wish I had mod points today. Parent should already be SCORE:5 Funny. Apparently not enough Slashdotters know the history/evolution of the net.

    If you're referring to the myth that the Internet was "designed to withstand nuclear attack", perhaps Slashdotters know more than you think.

    The Internet was designed to allow distributed control, and to withstand telephone company malice and incompetence. This was a much more useful goal than withstanding nuclear attack.

  28. TAG THIS ARTICLE KDAWSONSUCKS by Anonymous Coward · · Score: 5, Insightful

    This "article" is incredibly misleading as nothing has really gone awry. It is just another pointless KDAWSON post. These things are getting REALLY old, KDAWSON.
     
    I work for a tier-3 provider, and if "half the Internet" dies, you are going to hear from a half-brained big media outlet (e.g CNN, ABC) VERY fast.

  29. Re:Intelligence Op by hardwarefreak · · Score: 5, Informative

    They need to replace it with a network that is designed to survive a nuclear attack. Oh wait, hang on....

    Wish I had mod points today. Parent should already be SCORE:5 Funny. Apparently not enough Slashdotters know the history/evolution of the net.

    If you're referring to the myth that the Internet was "designed to withstand nuclear attack", perhaps Slashdotters know more than you think.

    The Internet was designed to allow distributed control, and to withstand telephone company malice and incompetence. This was a much more useful goal than withstanding nuclear attack.

    One of the early arguments made by DARPA folks to politicians, in order to secure continued federal funding for packet switched network development, was the ability of the network to route around failed or destroyed nodes. They made this argument in the context of the cold war, of nuclear war.

    It reality, as you state, this argument had little practical impact on the technical development or evolution of the the network. However, it most certainly did have an impact on the commitment of federal/military funding. This is the origin of the "surviving nuclear attack" lore of the development of DARPANET. It's not a myth. It's real.

    Take Obama's current stimulus package as a parallel example. It's not going to solve the recession, but it's being sold as such. And the congress bought into it. Just as this stimulus bill isn't what it's being sold as, most likely DARPANET wouldn't have really given us what it was sold as at one point. Nonetheless, it was sold as such, thus creating the lore that you call myth.

  30. Mod parent up by mbone · · Score: 4, Informative

    Mod the parent up - this is the real cause of the problem.

    bgp maxas-limit 75

    would stop this on most routers.

  31. Re:Intelligence Op by TubeSteak · · Score: 5, Interesting

    One of the early arguments made by DARPA folks to politicians, in order to secure continued federal funding for packet switched network development, was the ability of the network to route around failed or destroyed nodes. They made this argument in the context of the cold war, of nuclear war.

    They made that argument in the context of a widely distributed POTS copper wire network.
    The infrastructure of today's internet is fiber based.
    And most of that fiber is consolidated in a small number of long backhaul runs.

    Remember that grad student whose thesis was classified because he gathered up public documents and mapped out the fiber runs that make up the domestic internet? They classified it (and pulled most of the references he used) because his analysis showed there were a few critical points which, if disrupted, would effectively fracture the domestic internet infrastructure.

    The internet isn't nearly as bulletproof as the DoD would like and there isn't much they can do about it short of laying new fiber that skips over the vulnerable points.

    --
    [Fuck Beta]
    o0t!
  32. Re:Intelligence Op by JWSmythe · · Score: 4, Informative

        Aw heck, someone in Nebraska is going to trip over one power cord, and shut down the Interweb. :)

        In addition to using public maps, I did a lot more research. I had my own little project going for a little while. The project was intended to monitor for faults between datacenters we had equipment in. I added the root nameservers. I also had a few other points, such as friends houses and places they had virtual hostings at.

        Simply enough, it was running traceroutes from everywhere I had control to all points in my "network". I stored what router attached to each hop in a database.

        I located each hop simply by the city it was located in. Some were easy. Some weren't so easy.

        It was fun and games with 100 routers. I was manually setting city and state locations.

        It was a little less fun when it grew to 500 routers. I wrote regular expressions to take known naming conventions and make them into city names. That sounds easy, but it gets pretty hard pretty quick.

        It was a lot less fun when the list grew to several thousand routers.

        Basically, ever time there was a routing change, I found new routers.

        I had a lot of fun using both Google Maps to show the routes (for routers that I could place in a city), and a Graphviz model of the Internet as we observed it. It was a very big map. That was only what we had observed. I doubt we even saw a very small percentage (probably less than 0.01%) of the routes.

        The map got very very very complicated. I could point out choke points. They existed, but there were also alternative routes.

        Hell, even on a single good provider, there are no good choke points. On one Tier 1 provider that I used, in a non-core city, they had 6 diverse routes with OC192's. It wasn't a matter of me trusting them when they told me. I saw the routes showing up.

        There are 4 cities in the US, where if say a big nuke hit each one, ya, the Internet would be hurting. You may not get from Provider A to Provider B, but you'd still have some connectivity within your own provider, and other peerings would start working fairly quickly. More obviously, you'd find that some sites that are hosted in one city would be inaccessible. That's why geographic and topological diversity is very important for anyone who wants to keep their stuff up and running.

        Google puts stuff out all over the place for a reason. If a route, or a dozen routes, go funky, you'll very likely still be able to reach some datacenter.

        My office is connected by 3 uplinks. They're all with different providers. The odds of a provider outage killing the office is pretty slim. Other things can happen though. Lightning hit a transformer across the street, which serviced our building. From what people on that side of the building said, it was very pretty. :) Was our Internet connection dead? No. Well, not totally. We still had 2 uplinks working. We didn't have power for the desktops though. The UPS (a big one, not the little desktop ones) provides for the server room and a very few workstations.

        The biggest effect we saw from that outage was that cell phone service became minimal. The top of our building is also used for cell phone coverage. Without those antennas working, we only had service from the surrounding towers. It probably didn't help that there was now an office building full of people who were evacuated to the ground floor (it tripped the fire alarm), so almost everyone were on their cell phones making calls to customers, friends, family, etc.

        The most upset people were stuck in the elevator. They were already going downstairs for a smoke break, when it got stuck because those aren't backed up with anything at all.

       

    --
    Serious? Seriousness is well above my pay grade.