Slashdot Mirror


Tier One ISPs Dying

xbmodder writes "Two tier one ISPs are down today. At about 23:30PST both Verio and Level 3 starting having problems with routes. According to Level 3 this is a software upgrade gone awry. Is this the end for Level 3?" Many, many reports about this are coming in, and if you're wondering why the stories were rather sparse overnight, it's because it's difficult to post them without internet access. Hope everyone else is back online too.

55 of 394 comments (clear)

  1. Could be good by sdo1 · · Score: 5, Funny

    Maybe I'll get some work done today for a change.

    -S

    --
    --- What parts of "shall make no law", "shall not be infringed", and "shall not be violated" don't you understand?
  2. Flicker by Oculus+Habent · · Score: 5, Funny

    Is there a term for this kind of intermittant site inaccessability due to Internet outage -- not the user or the server being offline, but the Internet failing?

    --
    That what was all this school was for... to teach us how to solve our own problems. -- janeowit
    1. Re:Flicker by aamcf · · Score: 5, Funny

      Actually, I think calling it an Online to Offline Provisioning Suituation in the Internet Environment would be better - "Call the engineers, there's been an OOPSIE!".

    2. Re:Flicker by Ando[evilmedic] · · Score: 4, Funny

      Kill yourself.

  3. Outtage Explained by MMyers5 · · Score: 5, Funny

    It's nice to see something explaining why I was paged at 2:30am. And now, to whom from Level3 do I send my bill?

  4. Call me silly? by Killjoy_NL · · Score: 3, Funny

    But what is a tier 1 ISP?

    Is that like a bandwidth wholesaler or something?

    --
    This is the sig that says NI (again)
    1. Re:Call me silly? by rylin · · Score: 4, Informative

      One that doesn't lease their infrastructure.
      Eg. you have your own large backbone, you own all your equipment.

      In effect, a small and wholly owned internet that peers with other internets.

    2. Re:Call me silly? by CvD · · Score: 5, Informative

      Tier 1 are the huge ISPs, which peer with eachother (and don't pay eachother transit fees) and sell transit services to smaller ISPs (which do pay fees to send traffic through the Tier 1 ISPs). So yeah, bandwidth wholesalers is pretty accurate. See this wikipedia article.

    3. Re:Call me silly? by Anonymous Coward · · Score: 5, Interesting

      In 1994, The National Science Foundation (NSF) awarded contracts to replace the National Science Foundation Net (NSFNet) Internet backbone. These contracts were for backbone transport, routing arbiter and traffic exchange points (NAPs).
                These contracts were awarded for the original 15 NSF sponsored NAPs, and to become a Tier 1 ISP, you had to have atleast DS3 connectivty to all 15 NAPs.
                It's a very old and crappy definition, and I wish people would stop using it, because it is very easy to meet now adays, and most of those original NAPs are now insignificant, compared to the power of the force.

    4. Re:Call me silly? by w4pso · · Score: 3, Interesting
      My understanding is that a Tier 1 ISP is now defined as one that has established free peering points with all other Tier 1 ISPs.

      Considering that free peering is likely only established between 2 networks that have close to a 1:1 bit exchange, this is a very high bar to meet.

  5. What is this about? by PlatinumX · · Score: 3, Insightful

    An ISP's server being down 1 day is unacceptable of course, but to say it is dying already? or is there more to these ISP's? (haven't heard of them before)

    --
    Bottles Of Beer On The Wall - Advertising Fun! Get your bottle of beer on the wall today!
    1. Re:What is this about? by neosake · · Score: 4, Informative

      This is not _just_ their server being down, this is the entire network of two tier 1 carriers.

      The (basic) implications of this is that a good chunk of the internet as a whole is inaccessible to the rest of the internet.

      --
      "When a ball dreams, it dreams it's a frisbee"
    2. Re:What is this about? by raddan · · Score: 4, Informative
      We're not talking about just a server. We're talking about the entire ISP's networking capability. Tier 1 ISPs own huge swaths of networks-- literally miles and miles of cable, and sometimes radio and other links. They route the traffic across these lines.

      When a Tier 1 provider goes down, their customers go down too. That picture on the Boing Boing page shows a list of the Tier 1 providers. Every ISP that is NOT a Tier 1, gets their access from a Tier 1.

      People speculate that Level 3 is dying because they've been making some really bad decisions lately, resulting in a lot of outages. A couple of weeks ago, they actively filtered out traffic from their competetor, Cogent, over a dispute from how much to charge at the point their networks exchanged traffic (called a 'peering point'). Now this. The rumor is that the company is in financial trouble.

    3. Re:What is this about? by Alioth · · Score: 3, Informative

      It's not a rumour that Level 3 is in financial trouble - it's clear for all to see. They have crushing debt repayments right now.

      The Cogent spat isn't over yet either - Level 3 are going to de-peer Cogent again on November 9th. They are trying to force Cogent to pay for transit, but right now it looks like Cogent holds the strongest hand and Level 3 will be once again forced to back down.

    4. Re:What is this about? by Jugalator · · Score: 4, Informative

      Tier 1 ISPs own huge swaths of networks-- literally miles and miles of cable, and sometimes radio and other links. They route the traffic across these lines.

      More precisely, Level3 seem to own 23,000 miles of optic fiber. :-)

      The rumor is that the company is in financial trouble.

      Yeah, not so much of a rumor anymore either -- Level 3 loss widens.

      --
      Beware: In C++, your friends can see your privates!
    5. Re:What is this about? by shokk · · Score: 3, Insightful

      Yeah, that's it. They're dead. You pinned it right on the nose.
      They won't be back tomorrow. All gone.

      Reality check: An internet outage, no matter how big, is no different than a power outage. Yeah, here in the US we would be talking about loss of power to both coasts with only the middle left running. But after the outage life goes on.

      --
      "Beware of he who would deny you access to information, for in his heart, he dreams himself your master."
    6. Re:What is this about? by HavocBMX · · Score: 5, Informative

      The reason that Level 3 isn't happy with the peering arrangement currently is that it's not even remotely even. Level 3 sends almost nothing over Cogent's network and Cogent sends over a vast majority of their traffic through Level 3. A peering agreement is based on the premise that the companies will be sending almost equal amounts of traffic through each network. Level 3 has been analyzing that for a time now but the last straw was when Cogent had a sales blitz targeting Level 3 customers saying that they would dramatically drop their prices to almost nothing to get them to switch away from Level 3. They are now also using the downtime that was experienced due to the peering problem in their advantage even though Cogent is in the wrong. Cogent knew about the depeering and did nothing to resolve it.

    7. Re:What is this about? by Alioth · · Score: 3, Informative

      Oh yes, I'm aware of all of that - but (generally speaking) Cogent has the content, and Level 3 has the users. Guess who catches the most heat from the de-peering from its customers - Level 3 - as their customers will tend to see the problem first.

      I predict that Cogent will do the same again as well - not lift a finger to fix the problem when they are de-peered on November 9th, and Level 3 will probably end up being forced to re-peer as customers whine that they are not getting the whole Internet and threaten to take up Cogent's free 1 year offer.

  6. Guess not by springbox · · Score: 4, Informative

    Take a look at the scoreboard now. The mentioned problems are gone and Level 3 is no longer in the red.

    1. Re:Guess not by LurkerXXX · · Score: 4, Informative

      Uh no. Level3 disconnected from ALL the other Tier1 ISPs. So did Verio. This had nothing to do with last weeks Level3/Cogent spat.

    2. Re:Guess not by Dun+Malg · · Score: 5, Interesting
      The scary thing is it makes you wonder is some terrorist who has intimate knowledge of how Tier 1 ISP's work doing a trial run in the middle of the night by knocking out Level 3 and Verio backbones so later they could try to knock out ALL the backbones in a co-ordinated terrorist attack. (eek!)

      Oh please. You know, it's pretty easy to figure out if it's something likely to be attempted by terrorists or not. The simple test is does it cause mass "terror". As annoying as it might be, lack of internet access is an annoyance. Perhaps a very expensive and exasperating annoyance, but it won't cause mass terror. Terrorists prefer things like bombs, or poison gas, or disease. Some other things people get worked up about but terrorists are unlikely to attempt: sabotaging bridges and tunnels to cause traffic jams; sabotaging electricity distribution to cause blackouts; sabotaging railroad tracks, making commuters late for work!. Think DEATH, not irritation. Quit with the automatic "terrorist hysteria" already, people!

      --
      If a job's not worth doing, it's not worth doing right.
  7. Noticed it this morning by discord5 · · Score: 4, Interesting

    Noticed this this morning when a customer called upset about his hosting services being unreachable. A quick traceroute showed one of level3's ip to be down. A few minutes later more customers had problems with different routers from level3. As soon as I saw level3 I knew enough, shrugged it off and told the customer that it was routing problem we couldn't fix but those responsible were most likely already trying to fix it.

    It seems fixed now though, so no, this isn't the death of the Internet just yet.

  8. This just in... by mtec · · Score: 3, Funny

    The reports of my death have been greatly exaggerated.
    Tier One

    --
    Cake or Death? Cake Please!
  9. TEOTWAWKI by D4C5CE · · Score: 4, Funny
    Is there a term for this kind of intermittant site inaccessability due to Internet outage -- not the user or the server being offline, but the Internet failing?
    Yes. Domesday as predicted in the ancient scrolls. In this day and age it is commonly called The End of the World as We Know It.
    1. Re:TEOTWAWKI by Ripp · · Score: 5, Funny

      The End of the World as We Know It.

      Sure, but I feel fine.

      *shrug*

      --
      Blech. Signatures.
    2. Re:TEOTWAWKI by Lucractius · · Score: 3, Funny

      *calls the RIAA for stealing a line from a song*
      SICK HIM!!!

      --
      XML - A clever joke would be here if /. didn't mangle tag brackets.
  10. /.'ed by connah0047 · · Score: 5, Funny

    Hey look, we slashdotted Level 3!

  11. Non event... for now by elfguygmail.com · · Score: 5, Informative

    While this only lasted a few hours, it still caused a mess across the North American Internet during those hours. The point is a small amount of big networks are responsible for over 90% of the traffic on the Internet. If alter.net went down it would be total chaos. If just one of the major peering points went down, sure the traffic would be rerouted, but overloading the other points at such high latency that it would be almost unusuable. You better hope no one destroys MAE-EAST or we'll have a live example of what ife without the Internet is like.

    1. Re:Non event... for now by Quixadhal · · Score: 4, Funny
      You better hope no one destroys MAE-EAST or we'll have a live example of what ife without the Internet is like.

      Yeah, I remember life before the internet. I used to read books, watch TV, and even occasionally go outside under that big yellow face.

      *shiver*

  12. Re:point of the internet? by Detritus · · Score: 4, Insightful

    Nope. Redundancy and reliability cost money. Fast, cheap, reliable, pick two. Take a look at a typical network and count the single points of failure. Then there are common mode failures, like bugs in router software, that can take down entire networks.

    --
    Mea navis aericumbens anguillis abundat
  13. Link in the article by Alphabet+Pal · · Score: 4, Funny

    I notice the article links back to Slashdot... I wonder is Slashdot is going to get BoingBoing'ed?

    --
    Because you can't spell "slaughter" without "laughter"
  14. What kind of Timeframe by PhraudulentOne · · Score: 3, Interesting

    Is this even an issue? I mean, this was probably scheduled maitenance that went a little longer than expected. I have been through this before. It just sounds like Level 3 dropped some core routers for a few minutes to do a code upgrade - it didn't work so hot, so they were down for a few more mintes, OSPF/BGP decided to tell all the clients that they have no routes, Level 3 gets the routers back up, OSPF/BGP tells everyone that their fine again. Was this like 6 hours, or 45 min?

    --
    You create your own reality - Leave mine to me.
    1. Re:What kind of Timeframe by Jugalator · · Score: 3, Informative

      It was maybe 2 hours or so before new routing tables started spreading to bypass Level3's and Verio's networks, and afterwards it started stabilizing again, then it seems Level3 has since then woke up again. The XO network also had routing troubles from this btw, maybe more too. Sites and services such as AOL, SpeakEasy (when asked, they were stumped and could only say it affected all their customers, hehe), Google, and Wikipedia had access problems depending on where you lived during this timeframe.

      --
      Beware: In C++, your friends can see your privates!
    2. Re:What kind of Timeframe by Anonymous Coward · · Score: 3, Informative

      I dropped my BGP session to Level3 but they did not retract the routes, so not only could they not route my packets but they claimed (via the routing table) that they still could. From my vantage point (Chicago) the problem was resolved in about an hour,

  15. So much for the internet surviving a nuclear war! by Viol8 · · Score: 3, Insightful

    Now it can't even survive a software upgrade on some of the routers!

  16. Is the Internet Down? by PortWineBoy · · Score: 5, Funny

    Why couldn't this have happened during my business day? For just once when a user calls and asks "is the internet down?" I'd like to be able to say "actually, yes, it is."

    --

    this sig deleted by another sig

  17. Re:huh?? by John+Nowak · · Score: 5, Funny

    Don't you get it? They deleted the internet!!

  18. Re:Isn't God trying to tell us something? by m50d · · Score: 3, Insightful

    People have been able to say something like that at every point in history. And I'd hardly call this nastier than hurricanes, and the Tsunami was worse than either them or this. The sky is not falling.

    --
    I am trolling
  19. Nitpick by s20451 · · Score: 4, Informative

    Domesday is something like a census of Britain circa 1085. It has nothing to do with internet outages, which is more akin to doomsday.

    --
    Toronto-area transit rider? Rate your ride.
  20. Overlay Routing by omnirealm · · Score: 4, Informative

    This sort of event provides motivation for overlay routing schemes, which can compensate for major outages along various routes of the backbone:

    http://www.usenix.org/events/nsdi04/tech/full_pape rs/subramanianOver/subramanianOver.pdf
    http://www.eecs.umich.edu/~farnam/pubs/2005-hwj-in focom.pdf

    --
    An unjust law is no law at all. - St. Augustine
  21. flapping by SpectralDesign · · Score: 5, Interesting

    Way back in the day when I was a Network Controller at BBN Planet, if we began to have cascading routing outages we'd call it "Flapping"... Visualize a wounded bird squirming around on the ground flapping...

    Takes me back... My first night on the job a rat in Berkeley chewed through the wrong cable and got himself fried -- he also happened to take the entire west-coast off the internet for the better part of a day.

    Then there was the time an electrical worker got vaporized in a hole near MIT which caused quite a problem too as it overloaded the MIT power station, but the fallout wasn't nearly as bad as the day of the rat...

    --
    Be who you are and say what you feel, because those who mind don't matter and those who matter don't mind. - Dr. Seuss
    1. Re:flapping by rah1420 · · Score: 5, Interesting

      halon defeat

      OT, but it brings back memories of working at Purolator Courier in the machine room. IBM mainframe shop.

      We had had trouble with the damn fire suppression all day. On third shift, around 3 AM, the trouble alarm went off (again) for the umpteenth time. One of the operators, a nervous fellow who was a little bit green, went over to the annunciator panel and opened it to see what the Trouble Might Be.

      A fire technician he was not, and he apparently didn't know the difference between the trouble bell and the klaxon that would sound when a halon dump was about to occur; so he reached around the open panel door and hit the halon defeat.

      Or so he thought.

      It was actually the Big Red Switch.

      The whole room (full of 3420 and 3480 tape drives, the 3745s, the 3800 laser printers; and the floor above, containing trivial bits like the DASD and the CPU all plunged into a deafening silence.

      We all stared at each other and at the newbie BOFHeck.

      A few minutes later, the phone rang. It was the Indianapolis air hub for Purolator, wondering why (when they were about to receive about 150 planes from all over the country) they didn't have anything useful displayed on their green screens.

      That was a fun morning.

      Ah, those were the days indeed.

      --
      Mit der Dummheit kämpfen Götter selbst vergebens.
  22. Re:Looks good now... by afd8856 · · Score: 5, Funny

    First you fail to create a good link, and then that link goes to a login screen? Your link posting rights have been removed.

    --
    I'll do the stupid thing first and then you shy people follow...
  23. The main problem areas... by dantheman82 · · Score: 3, Interesting

    happened in Detroit in the last 24 hours. Apparently all ingoing/outgoing traffic to other Tier One ISPs had problems in that city. Also, Philadelphia had really slow traffic within Level3 (and slower to all the others), and had major problems connecting to Verio. San Diego also had some problems, especially within the Level3 network. St. Louis was the only area without major problems...

    For a breakdown, check out this view of the data.

    --
    This sig donated to Pater. Long live /.
  24. Microsoft? by Frankie70 · · Score: 3, Funny

    Is there any way we can blame Microsoft for this?
    Were they upgrading to one of the Beta builds of Windows Vista Home Edition?

  25. You insensitive clod! by Anonymous Coward · · Score: 3, Funny

    I'm not back online

  26. FFS, what a fucking dreadful summary by Cally · · Score: 5, Insightful
    Is this the end for Level 3?

    No, of course not, you blithering imbecile. L3 had a 2 hour global routing meltdown. Now, it's fixed. Whilst their routes were flapping, other carriers saw transient increases in latency and some problems with reachability, to some sites. However, everything continued to work properly for non-L3 customers. Two hours later L3's routes are back and working properly. End of story, nothing to see here, move along please.

    Slashdot editors, do you really expect us to believe that no-one had submitted a more coherent or accurate story than this one? Come on, for heaven's sake.

    Anyway, a network engineer's view can be seen in the overnight traffic on NANOG: http://www.merit.edu/mail.archives/nanog/2005-10/ "Tier One ISPs dying" indeed. Worst. Story. EVER.

    --
    "None are more hopelessly enslaved than those who falsely believe they are free." -- Goethe
  27. X is Dying by Christianfreak · · Score: 4, Funny

    Glad to see that Tier 1 ISPs are joing the ranks of BSD and Apple.

  28. Re:Tier 1's dying? OMG! by hoborocks · · Score: 3, Funny
    --
    AccountKiller
  29. Clarification by Just+Some+Guy · · Score: 4, Informative

    I'm sure you know this, but for the rest: "flapping" is the common term for when a router's routing tables rapidly cycle between two invalid states. The dead bird analogy is pretty descriptive, but the term "flapping" has technical and not allegorical origins.

    --
    Dewey, what part of this looks like authorities should be involved?
  30. Nitpick to the max: Ye Olde Spell(ing)s by Anonymous Coward · · Score: 3, Informative
    nothing to do with
    Your own first link explains:
    "Domesday" (Middle English spelling of "Doomsday")
    We regret to inform you that have failed the nitpicker's exam. Better luck next tyme.
  31. Comment removed by account_deleted · · Score: 4, Funny

    Comment removed based on user account deletion

  32. Re:Looks good now... by EnronHaliburton2004 · · Score: 4, Funny

    Don't blame him! This was a software upgrade gone awry.

  33. Re:Nitpick - gang of thugs by Kenshin · · Score: 3, Interesting
    It doesn't speak much for the Slashdot community when Wikipedia has to put this warning at the top of a Slashdot-linked page:

    This article has recently been linked from Slashdot.
    Please keep an eye on the page history for errors or vandalism.
    --

    Does it make you happy you're so strange?