Slashdot Mirror


LiveJournal Servers Go Down

Wind writes "According to any journal hosted off of LiveJournal.com, the LiveJournal data center Internap has suffered a critical power failure, leaving all of LiveJournal and its content temporarily offline and requiring the revival of 100+ servers. Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size? Updated information is posted here."

596 comments

  1. Lights out by r_glen · · Score: 5, Funny

    Sounds like someone was taking a nap over at Internap

    1. Re:Lights out by nocomment · · Score: 1

      hehe at least they know what's going on, and have a chance at fix%^J)g...tk%503 Service Unavailable.

      --
      /* oops I accidentally made a comment, sorry */
      /* http://allyourbasearebelongto.us */
  2. The Pain ... by webfiend · · Score: 5, Funny

    You can't imagine the withdrawals I'm going through. It's like the great Slashdot brownouts of '98.

    I need my fix, man!

    1. Re:The Pain ... by MightyTribble · · Score: 1

      my god, I remember those! Days without Katz ... how did I survive?!

    2. Re:The Pain ... by thoughtcriminal87 · · Score: 4, Funny

      Or the great Slashdot 503s of '04

    3. Re:The Pain ... by SunPin · · Score: 1

      You mean 98 days ago?

      --
      Laws are for people with no friends.
    4. Re:The Pain ... by spitstatic · · Score: 1

      i've been hitting my friends page for hours and hours -- i am there with ya.

      --
      This is my signature.
    5. Re:The Pain ... by DrEldarion · · Score: 5, Funny

      Honestly! Now we have to wait a day or so to find out what MelissaMinx492 ate for breakfast today!

    6. Re:The Pain ... by Anonymous Coward · · Score: 0

      Usernet porn.

    7. Re:The Pain ... by Lord+Omlette · · Score: 1

      I think the really big slashdot knockout, where it went down for a couple of days (and there was all this drama), generated some great discussion. Especially when a short while afterwards, Hotmail (or MSN?) went down for almost a week and when it came back, Microsoft didn't say a word about it.

      --
      [o]_O
    8. Re:The Pain ... by webfiend · · Score: 1

      I'm trying to remember. Was the Hotmail outage before or after they switched off of FreeBSD?

    9. Re:The Pain ... by Anonymous Coward · · Score: 0

      There's a thread in the fansite forum which describes today's meals.

    10. Re:The Pain ... by Kenshin · · Score: 1

      I was planning on going out tonight... but got caught late at the office this evening and got home too late to.

      I was gonna bitch about it on LJ, and now I can't. /admittedly pathetic //wants to go out, drink, and pretend to dance

      --

      Does it make you happy you're so strange?

    11. Re:The Pain ... by Nykon · · Score: 1

      Are you crazy if it wasn't for LJ I'd have to leave my house to find out what was going on in my friend's lives.

      --
      "It's better to be a pirate then join the Navy"
    12. Re:The Pain ... by MikeXpop · · Score: 4, Funny

      Don't worry, it was waffles. But her dad used the last of the syrup. Man, he never can think of her needs, can he? I mean really, what's his problem?

      --
      Etiquette is etiquette. He kills his mother but he can't wear grey trousers.
    13. Re:The Pain ... by RichardX · · Score: 1

      ...Or the great Wikipedia inaccessability of uh.. last night

      --
      Curiosity was framed. Ignorance killed the cat.
    14. Re:The Pain ... by Issue9mm · · Score: 1

      After, and then after again... and then there was that time they forgot to renew the domain name, and then there was that time they went down again... and then again. And then... [loop]

      -9mm-

    15. Re:The Pain ... by RichardX · · Score: 4, Funny

      Man, I need them to get those servers back up! I've got a whole pile of journals to not read. I'm getting behind on my ignoring

      --
      Curiosity was framed. Ignorance killed the cat.
    16. Re:The Pain ... by AndroidCat · · Score: 1

      Bah! I ignore a far higher grade of journals than that!

      --
      One line blog. I hear that they're called Twitters now.
    17. Re:The Pain ... by nike2422 · · Score: 1

      I'm with you there. I'm seriously jonesin because I can't have my LJ fix.

      They must have an AA group for us somewhere.

      --
      What Would Scooby Do?
    18. Re:The Pain ... by elemental23 · · Score: 1

      No one's breakfast journal could beat LJ user liza31337's. Unfortunately, she stopped updating it a while back after doing it for a full year.

      That link isn't going to work right now, obviously.

      --
      I like my women like my coffee... pale and bitter.
    19. Re:The Pain ... by henni16 · · Score: 1

      Didn't you mean:
      "Don't worry, it was, like, waffles. But her dad like used like the last of the syrup. Like, man, he never can think of, like, her needs, can he?"

    20. Re:The Pain ... by Anonymous Coward · · Score: 0

      Ahhh! So Anne Thomlinson works for LiveJournal now.

    21. Re:The Pain ... by Zarhooie · · Score: 1

      It's Microsoft. That's all you can really say on that subject. Microsoft doesn't give a shit what its users think about their product, as long as they keep using it. They're going to need to change their policies with the advent of useable Linux and Macintosh interfaces. You don't have to be super special or talented with an OS to use the alternate OS's anymore, so there's been a lot of drift towards their products. Add in the fact that any downtime on a website that has a comparable popularity to hotmail is explained almost immediately, and you find millions of users migrating towards services like Gmail.

      --
      Why die today if you can live tommorrow?
    22. Re:The Pain ... by IrishMASMS · · Score: 1

      Do not fret, her boyfriend took care of her needs last night! (or was that his needs?)

      Unless they live in 'Arkansas', then it is her & her Dad's needs! ;)

  3. Perhaps not. by entitude · · Score: 0, Offtopic

    We should tell them so. =)

    --
    ----geppy -
  4. hmm... by signingis · · Score: 1

    Oops?

    --

    I prefer a void in conversation to a vacuous one.
  5. Elsewhere by jm92956n · · Score: 4, Funny

    In related news, 6,000 teen-age girls were heard yelling "OMG! WTF! How will John know I life him if I can't blog about it!"

    --
    An effective signature identifies a particular user amongst a base of thousands.
    1. Re:Elsewhere by barryman_5000 · · Score: 3, Funny

      Thats right, sometimes I go by the name John.

    2. Re:Elsewhere by Anonymous Coward · · Score: 0

      (not a spelling nazi post) is "life" some term young people use, or a misspelling of "love"?

    3. Re:Elsewhere by barryman_5000 · · Score: 1

      I think he implies that the girl live for Me.

    4. Re:Elsewhere by Anonymous Coward · · Score: 0

      Or just a jest on the fact that these people can't type and never, ever, proof read.

    5. Re:Elsewhere by nikanorov · · Score: 1

      Only in USA. In russia for ex. it's best place to found all web developers and famous people.

    6. Re:Elsewhere by sendai2ci · · Score: 1

      6000? They have over five million members, with half of those active...

    7. Re:Elsewhere by kd5ujz · · Score: 1, Insightful

      I think he intended "like".

      --
      -William
      God is everything science has yet to explain.
    8. Re:Elsewhere by Fractal+Dice · · Score: 1

      So you're saying it should be called "she-dot"?

    9. Re:Elsewhere by Anonymous Coward · · Score: 0

      I heard in russia, LJ blogs you.

    10. Re:Elsewhere by spac3manspiff · · Score: 1

      I wish I was this john you speak of.

    11. Re:Elsewhere by sommie · · Score: 0

      Quite possibly 'love'.

    12. Re:Elsewhere by LGagnon · · Score: 4, Funny

      This is LiveJournal we're talking about; I think his spelling came out pretty accurate.

    13. Re:Elsewhere by metamatic · · Score: 0, Troll

      Never mind the teenage girls, where will the poor trolls go now that the community that nurtures them is gone?

      --
      GCHQ Quantum Insert installed. If only our tongues were made of glass, how much more careful we would be when we speak
    14. Re:Elsewhere by DaRiachu · · Score: 1

      Wrong; he used punctuation. :)

    15. Re:Elsewhere by WebCrapper · · Score: 1

      I seriously thought it was supposed to be like that though. I laughed when I saw it since that's how they write these days.

    16. Re:Elsewhere by Anonymous Coward · · Score: 0

      I think some of them have found this forum....

    17. Re:Elsewhere by TsukiKage · · Score: 1

      Well, actually...

    18. Re:Elsewhere by Anonymous Coward · · Score: 0

      I am one of those 6,000 teenage girls! but his name is matt, not john.

  6. ./ed !!!! by ikkibr · · Score: 1

    Well, it wasn't slashdot atleast... Bringing 100+ servers back online isn't an easy task lol ^^.
    Good luck to them.

    1. Re:./ed !!!! by idiotfromia · · Score: 1

      Yeah, but the slashdotting will do nothing but further hinder their work to get everything back up.

    2. Re:./ed !!!! by Anonymous Coward · · Score: 0

      How do you 'slashdot' a server that is only serving up a 276 byte text file?

    3. Re:./ed !!!! by Anonymous Coward · · Score: 0

      By linking from slashdot

    4. Re:./ed !!!! by Xiridion · · Score: 1

      Bringing 100+ servers back online isn't an easy task lol ^^

      Not too bad if you realize that you just triped over the power cord to the server farm....

      O.O;;

    5. Re:./ed !!!! by Anonymous Coward · · Score: 0

      No you silly prick...it's dynamic content that brings servers down...when /.ed any how...

    6. Re:./ed !!!! by Anonymous Coward · · Score: 0

      ah, dynamic content or no, without the bandwidth, you're screwed

    7. Re:./ed !!!! by Anonymous Coward · · Score: 0

      Er...it's only 276 bytes. If you hit it a million times, it would still be only 276 Megabytes... Most basic hosting packages offer gigabytes of bandwidth per month with usually 100 MBps throughput.

    8. Re:./ed !!!! by Hooded+One · · Score: 3, Informative

      You do realize that LiveJournal handles far more traffic than Slashdot, and when Slashdot got linked on the front page of LJ, Slashdot started spewing out errors (more than normal).

      Oh hey, Slashdot just went down as I was typing this. Smooth.

    9. Re:./ed !!!! by WebCrapper · · Score: 1

      I had to look at this to see for myself. While I can see where its possible, after looking at it and thinking about it - I don't think its true in the context that you meant it.

      Alexa/Amazon has no way of knowing true traffic of a website without actually getting access to the logs. I would believe that this report is based off of searches and other factors (link traffic and response time maybe?).

      Now, in that context, I can believe that Mary, the 12 year old, is going to search for her website instead of actually typing it in...

      I decided not to be a lazy as normal. Further research shows:
      "Over the last six years Alexa has been crawling and archiving the net at an unprecendented rate, delivering web intelligence to millions of toolbars, and mining the net and web usage paths for traffic data, related links and other data."

    10. Re:./ed !!!! by Anonymous Coward · · Score: 1, Insightful

      No, it is based off them having toolbars installed on people's computers monitoring (with the user's semi-agreement) what URLs they hit.

      It all depends on their sample representing the internet as a whole, whatever that is.

    11. Re:./ed !!!! by WebCrapper · · Score: 1

      Thus the "mining" in the above quote... So my initial theory was flawed - at least I did the research to point it out :-P

      I don't agree with what Alexa does, but your average geek won't have alexa installed. I expect most of the "Slashdot" hits to be the younger crowd or people that are curious. While my wife has the Google toolbar installed on her laptop, I actually just enjoy opening a new tab and going directly to the site. Call me old fashioned, but thats how I work. These days, anything can watch you.

    12. Re:./ed !!!! by Hooded+One · · Score: 4, Informative

      The Alexa link was the only tangible example I could find. I distinctly recall seeing a post by Brad himself mentioning how much more traffic LJ handles, but obviously I can't link to it at the moment.

      Anyway, as of Google's last crawl of the stats page (shortly before the outage), there were almost 6 million LJ users, a little under half of those "active." I don't know if /. has any stats available, but skimming through this page, the highest UID I see is in the 800,000 range. I'm not going to even attempt to guess what the relative activity level of LJ users is compared to /., or which has bigger pages or whatever, but I would offhand say that LJ probably handles more image traffic (user pictures, and now the in-testing photo hosting service). I know they used to use Akamai for that, but I seem to recall that fairly recently they switched over to doing something else. (I think they handle it themselves again, but I'm not sure.) There's also the audio files from phone posts. I'd say there's little question that LJ is the more heavily trafficked site.

      Besides, a lot of the DB load on Slashdot is eased tremendously by Memcached, developed by... Danga Interactive, i.e. LJ. Wikipedia uses it too, and just started using Perlbal. (And I do mean "just") Ditto for Audioscrobbler/Last.fm. So /. isn't in much of a position to pooh-pooh the technical ability of Brad/LJ.

    13. Re:./ed !!!! by Bob+Wehadababyitsabo · · Score: 1

      The case could be made that /.'s traffic totals are depressed compared to other sites because of the more techy crowd that visits. They're not likely to have the alexa spyware installed.

      --
      fsck -u
    14. Re:./ed !!!! by utopianfiat · · Score: 1

      even better is the fact that xanga gets far more traffic than either one. Imagine what would happen if THEY blacked out. :o

      --
      +5, Truth
    15. Re:./ed !!!! by Hooded+One · · Score: 1

      The difference is that Xanga has all the lame 14-year-old walking stereotypes as LJ, but not a single redeeming quality.

  7. In other news... by Anonymous Coward · · Score: 5, Funny

    ...the collective IQ of the internet has raised about 20 points.

    1. Re:In other news... by Anonymous Coward · · Score: 2, Funny

      ...the collective IQ of the internet has raised about 20 points.

      So just imagine the result of when Slashdot dies!

    2. Re:In other news... by Rie+Beam · · Score: 1

      ...to a grand total of 20...

    3. Re:In other news... by studboy · · Score: 1

      err, have you read Slashot at -1 lately?

    4. Re:In other news... by Anonymous Coward · · Score: 0

      ...and you're doing your best to reduce it.

  8. Sucks for LiveJournal... by whiteranger99x · · Score: 2, Funny

    but that's ONE HELL of a Slashdotting! :)

    --
    Join the TWIT army now!
  9. slashdot has repeated 503 errors, by Anonymous Coward · · Score: 5, Insightful

    and search.pl is constantly being trashed by distributed xanga botnets. perhaps michael wasn't quite prepared to be an editor of slashdot?

    1. Re:slashdot has repeated 503 errors, by stupidfoo · · Score: 5, Insightful

      How is this a troll? It's funny that an "editor" at site with as many problems as slashdot has feels that it isn't amazingly hypocritical to mock another site that is currently having problems. People in glass houses indeed.

      Slashdot has semi-major problems almost every day. 503 errors, "nothing for you to see here" annoyances, and a search engine that goes down more than a Thai hooker.

    2. Re:slashdot has repeated 503 errors, by webfiend · · Score: 2, Funny
      Slashdot has semi-major problems almost every day. 503 errors, "nothing for you to see here" annoyances, and a search engine that goes down more than a Thai hooker.

      Oh man, that would be one *fantastic* search engine! Is there a Google beta for this?

    3. Re:slashdot has repeated 503 errors, by Anonymous Coward · · Score: 0

      It's the /. search engine, that means two things:

      1) it's essentially beta anyways
      and
      2) even if it goes down more than a Thai hooker, it still won't find what you want it to find

    4. Re:slashdot has repeated 503 errors, by elemental23 · · Score: 3, Informative

      Perhaps you're new here, but italicized text in Slashdot stories is written by the story submitter. Editorial comments, if any, are not in italics. In other words, Michael didn't say anything at all in this story.

      That said, the story submitter is clearly trolling himself, as neither 6A's nor LJ's staff had anything to do with the massive power failure at their co-lo.

      --
      I like my women like my coffee... pale and bitter.
  10. Got petrol? by Anonymous Coward · · Score: 0

    Someone forget to buy gas?

  11. Internap is *down*? by MightyTribble · · Score: 5, Informative
    Internap *down*?
    Bush just appointed Internap's CEO to his National Infrastructure Advisory Council, yet the man can't keep a co-lo facility switched on.

    I'm not sure what that says of Bush or of Interap. And it certainly doesn't seem to have anything to do with SixApart.

    1. Re:Internap is *down*? by Anonymous Coward · · Score: 0

      It says nothing of Bush or Internap. It says everything about cheapskate blog admins who think they can run servers without paying for battery backup.

    2. Re:Internap is *down*? by gl4ss · · Score: 1

      * Re:Internap is *down*? (Score:0)
      by Anonymous Coward on Friday January 14, @10:52PM (#11370963)
      It says nothing of Bush or Internap. It says everything about cheapskate blog admins who think they can run servers without paying for battery backup.*

      not really. 100+ servers going down and getting major downtime because turning them on is slow is a major cockup. what would be intresting to know would be if some of those 100+ servers were indeed on some plan or another that implied they'd be on battery backup. perhaps even the switches were down...

      --
      world was created 5 seconds before this post as it is.
    3. Re:Internap is *down*? by gl4ss · · Score: 2, Interesting

      replying to myself. mod my other post down or something.

      "Our data center (Internap) lost all its power, including redundant backup power, for some unknown reason. (unknown to me, at least) We're currently dealing with bringing our 100+ servers back online. Not fun. We're not happy about this. Sorry... :-/ More details later.

      Update #1, 7:35 pm PST: we're up on 'dirty' power for now (it works, but it's unreliable), and we're working to assess the state of the databases. The worst thing we could do right now is rush the site up in an unreliable state. We're checking all the hardware and data, making sure everything's consistent. Where it's not, we'll be restoring from recent backups and replaying all the changes since that time, to get to the current point in time, but in good shape. We'll be providing more technical details later, for those curious, on the power failure (when we learn more), the database details, and the recovery process. For now, please be patient. We'll be working all weekend on this if we have to."

      soo.. EVEN THE BATTERY BACKUP WAS FUCKED. GOOD GOING.

      --
      world was created 5 seconds before this post as it is.
    4. Re:Internap is *down*? by slashrogue · · Score: 4, Interesting

      Sorry, but one man can't control power to an entire co-location. I used to work for a local telecom and one day our fiber went down for about 1/3 of our customers. The reason? Some guy shootin' squirrels blew the fiber lines apart. It was on a Sunday too, when there was minimal phone tech support, I think one guy ended up fielding 350+ calls by himself.

    5. Re:Internap is *down*? by Cramer · · Score: 4, Interesting

      I recall a story from a CP&L (now Prgress Energy) lineman about how someone had shot the fiber line strung on the high voltage tower -- between the million volt lines. As he put it, you put a pencil through the hole and line up where the "hick" (his word) proped up the rifle to shoot what he must've thought was a power line. It no fun working up there. And even less fun having to make fusion plices surrounded by million volt lines. (linemen work up there, not fiber techs.)

    6. Re:Internap is *down*? by slavemowgli · · Score: 1

      Still - if you're providing hosting, especially for large customers that probably pay quite a bunch of good money, then you should have decent UPS systems in place, and if one man *can* control that, then it probably is the CEO.

      Now, you might say that they *do* have good systems, but quite honestly, I don't think that's the case. The point of an UPS is, after all, not that it can sit there and look pretty and enable you to tell customers how great your company is and how well-prepared you are for all eventualities; its point is to get the job done and provide backup power in case there is a power outage, for as long as is necessary until the main problem that led to the power outage in the first place can be fixed.

      It's still entirely possible that you can't ultimately do something about the whole thing, of course - if the whole power grid in the state you're in goes down for a week or something like that, then you can't really blame the hosting company.

      But when it's not something like that, well... I, at least, do expect a UPS that allows me to run my sites without having to shut them down, then. If that's not possible, then either the UPS is not good enough (because it doesn't last for as long as necessary to allow the tech guys to fix the problem), or the tech guys aren't (because they aren't able to fix the problem before the UPS goes down, too). And in either case, I'd definitely consider switching to a different (read: better) hoster, *especially* when something like this happens more than once (in the case of Livejournal, it's the second time already).

      --
      quidquid latine dictum sit altum videtur.
    7. Re:Internap is *down*? by Koutarou · · Score: 1

      Things can also happen even if there are redundant UPS and generators in the picture.

      Last year I had a power failure on some of my equipment hosted at C&W in Tokyo where one of the power distribution bars between the UPS/generators and my equipment failed.

    8. Re:Internap is *down*? by gblues · · Score: 1

      Ah, yes. I used to do support for MSN, and once we experienced an honest-to-God BIF.

      "What's a BIF?" you ask yourself?

      Backhoe-Induced Failure.

      Nathan

    9. Re:Internap is *down*? by ces · · Score: 2, Informative

      The co-location facility in question has plenty of backup UPS power with plenty of generator capacity behind that. Supposedly there is enough generator capacity to fully power everything in the building including the network TV station even with one generator out.

      The UPS gear in Internap's space is all top-of-the-line big datacenter grade stuff. Apparently there was some sort of wiring fault in one of the new UPSes they were bringing online that caused both building power to fail and the self-protection circuits in all of their UPSes to trip.

      IOW it was either a faulty UPS or a faulty wiring job by the electrical contractor.

      Livejournal isn't the only ones who got burned by this outage. The colocation facility in question is supposedly one of the most solid in the state and nothing short of a direct strike from a comet is supposed to be able to take it offline. My company was in the same boat as our gear is in the same facility as LiveJournal's.

      Sure both LiveJournal and the company I work for could have hedged our bets by having redundant gear in another facility in another state, but that is a pain in the ass especially when backend databases are involved. To tell you the truth it probably isn't really worth the bother unless you truely have a need for six nines of uptime.

      --
      Happy Fun Ball is for external use only.
    10. Re:Internap is *down*? by Detritus · · Score: 1

      If there are two ways to wire an electrical circuit, it will be wired in the way that causes the maximum amount of equipment damage.

      --
      Mea navis aericumbens anguillis abundat
    11. Re:Internap is *down*? by imsabbel · · Score: 1

      True story: back when i was in the (german) army, i was in the it department of a translation/signal tracking station (dont know the english name for such a thing).
      THe whole building had two power circuits. Normal one and failsafe one (with all the worksations and servers on it)
      We had plenty of usv, a 50kW generator in the basement, all good.
      Until they started a building job and needed to disassable some wirering going through a cabletunnel into the room they were cleaning up.
      And of course somebody cut the main line of the secure grid....
      Was really funny. Light stayed on, radio stayed on, coffee machine went on, all servers went down...

      There isnt anything that "cant" break.

      --
      HI O WISE PRINCE. WHT TOOK U SO DAM LONG?
    12. Re:Internap is *down*? by 1lus10n · · Score: 1

      I could be wrong, but as it currently stands LJ has all of their equipment in just the one location. If your running something as widely used (and has heavily dependant on usage) as a major blog site you should have some form of redundancy. Its not as much of a pain to setup as it is to recover from something like this. Last I checked they were already working on 12 hours of downtime.

      Not sure about the stats regarding LJ in specific but I am guessing this is costing them some major money (in relative terms) in the short-term, and untold amounts in the future. Mistakes like this and the constant problems they have had in the past (both with their setup and apparently with their colo) are the type of things that kill companies.

      --
      "Two things are infinite: the universe and human stupidity; and I'm not sure about the the universe." --Albert Einstein
    13. Re:Internap is *down*? by nicolaiplum · · Score: 1

      You can have one person controlling the power for a building.
      About 7 years ago, a company I used to work for had one of their major datacenters in a room in Telehouse Docklands (in London, in the UK). It's a big building and it also hosts the London Internet Neutral eXchange. One day half the building lost power. An electrical technician in the basement turned the wrong switch and then fainted when he realised what he had done. This meant it took a while to turn the power back on, because you don't just walk up and touch an unconscious person next to an anomalous electrical power situation.
      We had in-room UPS but they had to be slaved to the main power switch so we couldn't have an independent electrical fire in the room, so they went off when the power was switched off.
      Most of our servers recovered OK, a couple did not.

      As for it taking some time to get the servers back online, databases always suck for that. None of them seem to start up reliably all the time without DBA intervention; not PostgreSQL, not Oracle, not Sybase. They all have a tendency to eat their innards every so often after an unplanned shutdown.

      --
      "For a successful technology, reality must take precedence over public relations, for Nature cannot be fooled"
    14. Re:Internap is *down*? by JKarp · · Score: 1
      Mod parent down clueless.

      Fire the facilities manager who actually know how the co-lo runs and bring in a clueless dolt. Presto. You're a faulty breaker and a couple of bypass lever pulls from going dark. Qwest has used this recipe for darkness several times in their co-los. My team spent 36 hours bringing up 200 servers and numerous database clusters (and lost 100% of bonus that year from the SLA violation) - I feel these guys pain...

    15. Re:Internap is *down*? by slavemowgli · · Score: 3, Insightful

      That's debatable, and depending on what database you use, having more than one database server (or pool of database servers) in different physical locations that are kept in sync at all times is definitely possible. I'm not sure whether MySQL allows this, but I think if you have a site that has nearly 6 million users, more than 100000 of which are paying you for the service you provide (I'm one of those, one might add), then you really should look into doing just that - or at least I hope the LJ people will do now (I don't really want to blame them for the problem).

      That being said, I think you didn't quite understand what I was trying to say. I really don't care whether they have "plenty of backup power", "plenty of generator capacity" and "top-of-the-line big datacenter grade stuff" (which really sounds more like a collection of buzzwords than anything else, anyway). If a wiring fault (of whatever kind) can bring up the entire UPS system as well as the "generator capacity behind that" and all other safeguards they supposedly had in place as well, then it's just worthless and a waste of money - a UPS is supposed to be an *uninterrupted* power supply.

      And while I admit that it's not possible to guard against *all* problems, saying that the colo facility is "one of the most solid in the state" and supposedly can't be taken offline by something "short of a direct strike from a comet" is just silly when a "wiring failure" can bring down the whole thing, and even more so when it's not the first time that happens.

      Really, this just stinks of an attitude that's all too prevalent in parts of the IT industry - just piecing together the components of a reliable system won't necessarily give you one, and if you can't build one properly, then don't go advertising that you have one. Don't you think the fact that the LJ people are now planning to buy their own UPS equipment to use on top of the facility's should tell you something?

      Oh, and regarding six nines of uptime - I don't think you actually realize for how little downtime that actually would allow. It's about 30 seconds per year, and Livejournal has been down for at least 16 hours, which corresponds to an uptime of about 99.8% - only two nines left. They probably (hopefully!) won't fall down to one, but things are bad enough as it is, and I, at least, fully blame Internap for that (and, again, I'm a paying user on LJ, so I reserve the right to do just that. ^_~)

      --
      quidquid latine dictum sit altum videtur.
    16. Re:Internap is *down*? by slavemowgli · · Score: 1

      Oh, yeah, it *can* happen, but it shouldn't - and in particular, it shouldn't happen more than once.

      --
      quidquid latine dictum sit altum videtur.
    17. Re:Internap is *down*? by jafiwam · · Score: 1

      Heh that event took a bunch of my (ISP part time job) customers offline too.

      You should have seen the look on my boss' face when he got off the phone ofter hearing that.

      It was great fun explaining it to the customers:

      "I guess the fiber line was shot, they are re'splicing it"

      "No, it was tacked down properly, it was shot."

      "No, they normally don't wear out. It was SHOT. BANG BANG bullet SHOT."

      They believe ANYTHING we say about why we are down now. :)

    18. Re:Internap is *down*? by Anonymous Coward · · Score: 0
      > Sorry, but one man can't control power to an entire co-location. I used to work for a local telecom and one day our fiber went down for about 1/3 of our customers. The reason? Some guy shootin' squirrels blew the fiber lines apart.

      Corrected as follows: One man can't control power to an entire colo, unless he has a squirrel gun.

      /your linemen want squirrel guns.

    19. Re:Internap is *down*? by ces · · Score: 1

      That's debatable, and depending on what database you use, having more than one database server (or pool of database servers) in different physical locations that are kept in sync at all times is definitely possible. I'm not sure whether MySQL allows this, but I think if you have a site that has nearly 6 million users, more than 100000 of which are paying you for the service you provide (I'm one of those, one might add), then you really should look into doing just that - or at least I hope the LJ people will do now (I don't really want to blame them for the problem).

      Even with Oracle or DB2 running database servers in multiple locations that need to stay in synch is a massive pain in the ass that you avoid if you can. Given that other problems are at least or more likely to cause your site to be unreachable to at least some customers for longer periods of time than problems with a decent co-location center it really isn't worth the bother for the majority of sites out there.

      Hell last time I checked Slashdot still had all of their servers in a single datacenter.

      For that matter I've seen times with my Verizon DSL line that I've been unable to reach Google, Yahoo, Amazon, Expedia or other internet sites with a high amount of redundancy including multiple datacenters for more than a day due to some problem on Verizon's network or with intermediate backbone.

      But I don't blame say Google for the problems because they really have done everything they reasonably can to make sure their stuff is up and as many people as possible can reach their servers at any given point in time.

      That being said, I think you didn't quite understand what I was trying to say. I really don't care whether they have "plenty of backup power", "plenty of generator capacity" and "top-of-the-line big datacenter grade stuff" (which really sounds more like a collection of buzzwords than anything else, anyway). If a wiring fault (of whatever kind) can bring up the entire UPS system as well as the "generator capacity behind that" and all other safeguards they supposedly had in place as well, then it's just worthless and a waste of money - a UPS is supposed to be an *uninterrupted* power supply.

      By their nature multiplely redundant power systems are a tricky thing that even experts fail to get right. Even then someone hitting the BRS by the door that is required by fire code can still take power to the whole datacenter down. InterNAP is in the process of expanding the datacenter in question which means there are contractors running around all over the place. It is quite likely that someone miswired a new UPS. Shit happens, even to the datacenters of major banks, stock exchanges, or airline reservation systems.

      And while I admit that it's not possible to guard against *all* problems, saying that the colo facility is "one of the most solid in the state" and supposedly can't be taken offline by something "short of a direct strike from a comet" is just silly when a "wiring failure" can bring down the whole thing, and even more so when it's not the first time that happens.

      Really, this just stinks of an attitude that's all too prevalent in parts of the IT industry - just piecing together the components of a reliable system won't necessarily give you one, and if you can't build one properly, then don't go advertising that you have one. Don't you think the fact that the LJ people are now planning to buy their own UPS equipment to use on top of the facility's should tell you something?


      The Fisher Plaza datacenter really is one of the best in the state. It is far from "pieced together". The InterNAP people aren't a bunch of clowns and do acutally know what they are doing (or at least as much as any other top-tier colocation vendor).

      Especially in the cost vs. quality tradeoff InterNAP does really well which is why a fair number of companies around here buy rack space from them.

      I do find it somewhat amusing that InterNAP's first datacenter, which really is

      --
      Happy Fun Ball is for external use only.
  12. emo by Anonymous Coward · · Score: 1, Funny

    Man I am sooo putting this in my LiveJournal!

    1. Re:emo by OAB_X · · Score: 1

      omg, like noone like so totally omg uses the whole like actual name of LiveJournal to like omg describe it. Its just lj

  13. Damn you Six Apart! by rolocroz · · Score: 1

    Why did you have to go and cause a power outage?

    --

    I meta-mod all positive moderation Unfair, because it's abuse of the system.

  14. What a cock by realdpk · · Score: 5, Insightful

    "Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?"

    Perhaps shit happens, and a blog service doesn't warrant the necessary investment to survive whatever caused this outage?

    1. Re:What a cock by Anonymous Coward · · Score: 0

      What do six apart, flickr, and technorati have in common? (flickr and technorati are mentioned in the last /. blurb before this one) They are all either run or were initially funded by Joi Ito, one of Japan's top CEOs, who is on the board of ICANN. Trust me, they are more than prepared.

    2. Re:What a cock by gl4ss · · Score: 1

      well they did(they thought they had prepared for this and bought reassurance for that).
      but internap didn't deliver.

      read the site.. they were on redundant power.. which turned out to not be that redundant after all(only possible explanation really is a major cockup by someone...).

      --
      world was created 5 seconds before this post as it is.
    3. Re:What a cock by qcubed · · Score: 2, Insightful

      i'm sorry, how exactly does this reflect poorly on sixapart?

      THIS doesn't reflect poorly on them. their licensing scheme for movabletype does.

    4. Re:What a cock by rolocroz · · Score: 1

      I'm sorry, how the hell does the MovableType licensing scheme reflect poorly on Six Apart?

      --

      I meta-mod all positive moderation Unfair, because it's abuse of the system.

    5. Re:What a cock by aliens · · Score: 1

      Hate to say it but I have to agree. Anyone who has dealt with trying to manage a large site should never poo-poo the efforts of someone else. It's childish, and unprofessional. But hey it's slashdot, not an upscale establishment.

      The only hope is if he has some sort of vendetta against someone at the company, like they screwed his sister or something.

      To all admins every where I wish you the best of luck in keeping your networks up and running (except you spammers you go rot in hell)

      --
      -- taking over the world, we are.
    6. Re:What a cock by Cramer · · Score: 2, Interesting

      Power can only be so redundant. More than once, the entire server room at a previous employer went dark. Multiple circuits and multiple power supplies won't do any good when a battery in the UPS explodes and blows the main (150A) breaker -- the entire floor went dark... disconnected from UPS, generator, and utility.

      (Once from a battery failure, and once again from a phase mismatch coming off generator that no one caught before the UPS was drained -- the alarm panel was outside the break room and no one that knew what it was walked past it.)

    7. Re:What a cock by bob+beta · · Score: 1

      What's the point in multiple circuits and multiple power supplies and only one UPS?

    8. Re:What a cock by Cramer · · Score: 1

      Marketing and appearance. UPSes ain't cheap or small.

    9. Re:What a cock by gl4ss · · Score: 1

      a fresher update on the site says that they're going to buy a bunch of ups's and tie them into the racks as they obviously can't trust internap to not fuck again. and that it wasn't the first time either..

      makes you wonder if they're thinking of re-locating...

      --
      world was created 5 seconds before this post as it is.
    10. Re:What a cock by 1lus10n · · Score: 1

      What they should do is have half at one co-lo and the other half at another co-lo. For the ammount of customers they have (the people who pay for their premiere accounts, and buy LJ merchandise) they should be able to afford a decent setup and at least one or two decent Linux admins. I really think they have the developers trying to do the admin stuff as well .... which leads to decisions like "lets put them all in one place".

      Perhaps they should bite the bullet and hire some people with prior experience running large setups (perhaps somebody from google is looking for a change in scenery ?).

      --
      "Two things are infinite: the universe and human stupidity; and I'm not sure about the the universe." --Albert Einstein
    11. Re:What a cock by casparianaremi · · Score: 2, Insightful

      I was prepared for my friends page on LJ to be full of "It's SixApart's fault" but didn't expect to see it here! Six Apart bought the company but have made no changes, the servers would be down whether they'd done it or not, so to claim it's their fault is just pretty dumb, IMO.

    12. Re:What a cock by qcubed · · Score: 1

      actually, it doesn't really. =p

      i just don't like it, is all. the original point remains, though, which is that the livejournal servers going down doesn't reflect at all upon six apart.

    13. Re:What a cock by mlefevre · · Score: 2, Insightful

      "half at one co-lo and the other half at another co-lo"

      Then they'd either need multi-gigabit bandwidth between the two co-los (which would probably cost for a week what they make per year), or they'd have to make separate, semi-independent communities. Google's servers don't stay in sync - you get different results according to which servers you hit, which isn't something you can do with "live" journals.

  15. Internap is up by Cooler+Heads · · Score: 1

    Internap.com is still up, they aren't stupid enough to use their own servers.

    1. Re:Internap is up by LinuxHeadMN · · Score: 0

      The only data center which went down was their Seattle/Fisher Plaza Colocation. Most of Internap's main servers are housed in Atlanta, where they are based.

      There aren't alot of details coming out of internap, but their NOC lines are fairly busy at the moment.

    2. Re:Internap is up by CypherXero · · Score: 1

      Web Servers and Data Servers are most often seperate. Like the saying "Don't keep all of your eggs in one basket". This helps to manage your servers, because if you main site went offline, then your data servers would die as well, and visa versa.

    3. Re:Internap is up by Anonymous Coward · · Score: 0

      And so are their customers (at least, their customers as Internap listed them).

      Gateway, Speakeasy, Union Bank of California, and Ticketmaster: all up.

      Did anyone tell them that Internap was down? Or is it that LiveJournal didn't have the proper disaster recovery in place in order to recover from this?

      Linux has the capabilities to restore 100 servers in no time flat: Red Hat has Kickstart; SUSE has AutoYAST, and there are others.

      Windows has auto-rollout (though I don't know what).

      And then there are clusters..... sigh.

      Sounds like LiveJournal wasn't ready for this.

    4. Re:Internap is up by xrayspx · · Score: 1

      Internap has many datacenters, they are most likely globally load balanced between them, or at least fully redundant. For instance, all indications are that this is confined to Internap Seattle. That won't affect Internap in Boston, or Atlanta or wherever.

    5. Re:Internap is up by Anonymous Coward · · Score: 0

      yap, just the Seattle Fisher facility. Power outage on the whole deal for about 30-45 mins. Sux hard, I feel sorry for all the folks there. Internap is known for proactive SLA stuff. Meaning you don't have to jump through hoops or request them, they are automatic. So atleast there's that. But yah, sux the power and ups croaked.

    6. Re:Internap is up by xrayspx · · Score: 1

      Dammit, I stand corrected, sort ot. Internap.com is up, true, it's being served out of Atlanta from the looks of things. However, Internap.net is in Seattle (ONLY???) and is down in a browser.

      Tools.

    7. Re:Internap is up by Anonymous Coward · · Score: 0
      Linux has the capabilities to restore 100 servers in no time flat: Red Hat has Kickstart; SUSE has AutoYAST, and there are others.

      Why the fuck would they need to re-install all their servers after a power outage? Are you retarded?

    8. Re:Internap is up by markana · · Score: 1

      The preliminary report I saw was a failure in one of the UPSs. I'm going to spend a few hours this weekend compensating for a network card that fried when one of our servers came back up.

      Another argument for geographically-dispersed colos. sigh.

  16. i don't get it by dazedyugo · · Score: 2, Funny

    so it's deadjournal now ?

    1. Re:i don't get it by Anonymous Coward · · Score: 0

      No, deadjournal is the even whinier version of Livejournal.

      http://www.deadjournal.com/

    2. Re:i don't get it by TupperTrenine · · Score: 1

      nope, that's a different service :P

  17. OMG by Anonymous Coward · · Score: 0

    How will the world find out how it sucks to be a freshman in Podunktown High School and how Angela is such a slut for making out with that guy in History class and how my parents are total dimwits when it comes to gangsta rap?

    1. Re:OMG by Neduz · · Score: 1

      Just leave your computer for a minute, go outside and socialize with some people. You'll find out that the shit you mentioned also exists outside the Net.

      --
      This is one lame signature, please read the message above instead.
    2. Re:OMG by BosstonesOwn · · Score: 0

      By downloading the txt version on p2p ? Which would then the user would be promptly sued by the RIAA sighting the use of letters in tandom to actually create a sentence is property of the RIAA.

      --
      This package Does Not Contain a Winner
  18. sounds like good news by freakybob · · Score: 4, Funny

    Well now the millions (?) of users might actually have something to write about when the servers are back up. "Today I went outside. My pupils have never been tinier..."

    1. Re:sounds like good news by spamdog · · Score: 1

      Welcome to my friends list. =)

  19. Was that really called for? by Anonymous Coward · · Score: 3, Insightful

    Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?

    Ok, I understand that you don't like Six Apart; I'm no fan of their new licensing scheme either. However, I really doubt that SixApart has any control over any power failures that might occur at Internap.

    1. Re:Was that really called for? by TheGavster · · Score: 1

      Indeed. It doesn't even sound like they changed the hosting; just a new guy paying the bill.

      --
      "Because Science" is one step from "Because old book". Try "Because of my experiment testing my falsifiable assertion".
    2. Re:Was that really called for? by BosstonesOwn · · Score: 0

      It was microsoft. Billy Gates pulled the plug after throwing a tempertantrum because they refused to load windows 2003 on thier servers ? Anything else we can blame on them ? :)

      --
      This package Does Not Contain a Winner
    3. Re:Was that really called for? by Anonymous Coward · · Score: 0

      'Twas Darl McBride, activating all those back-doors that the terrorists submitted and Linux (citizen of communist finland) gladly accepted.

    4. Re:Was that really called for? by Peter+Cooper · · Score: 2, Informative

      To be honest, a deal was announced what.. a week ago? I seriously doubt Six Apart has control over anything at this point.

  20. Re:GOOD! by Starji · · Score: 1

    What a sad, lonely, despicable person you must be.

  21. ANGST!!! by Anonymous Coward · · Score: 2, Funny

    Where will I write about my depression over this event?

    Oh. Slashdot.

  22. Coffee by Rie+Beam · · Score: 0

    "Our data center (Internap) lost all its power, including redundant backup power, for some unknown reason. (unknown to me, at least)"

    Coffee is a hell of a drink.

    1. Re:Coffee by Anonymous Coward · · Score: 0

      good 1 !

  23. Re:GOOD! by Anonymous Coward · · Score: 0

    That's too bad. [sarcasam]I really enjoyed the great editorial features![/sarcasam]

    There are other hobbies in this world besides complaining about something you can't control.

  24. That's what you get by digitalgimpus · · Score: 1, Funny

    That's what you get when you hire Tim Allen as your electrician at a Data Center.

    Al Borland was nailing Heidi behind the stage when the outage occured.

    Where were the APC backups?

  25. Internet smarter? Hardly. by Anonymous Coward · · Score: 0

    No, because there are still such things as chat rooms and forums.

  26. What? by Anonymous Coward · · Score: 0

    Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?

    Perhaps the submitter doesn't have a clue what the fuck he is talking about? Their hosts suffered a complete power failure. How the hell do you protect against that, short of buying out the data centre company and running it yourself?

    1. Re:What? by BosstonesOwn · · Score: 0
      Ah I always thought there was redundant power backups for just such an occasion ? Maybe the systems monitoring those same generators were running an old version of windows and bsod'ed.

      Wow talk about a bsod.

      --
      This package Does Not Contain a Winner
    2. Re:What? by Anonymous Coward · · Score: 1, Informative

      Ah I always thought there was redundant power backups for just such an occasion ?

      No doubt that something went wrong there, but that doesn't change the fact that it's the data centre's responsibility to supply power, so only a complete moron would suggest SixApart were to blame.

  27. Please, Please, Please! by La+Camiseta · · Score: 3, Informative

    Use the Coralized link. No sense in crashing their status page. Plust it'll respond a lot quicker than loading the actual web page.

    1. Re:Please, Please, Please! by moosesocks · · Score: 1

      The point of the LiveJournal status page is that it's hosted at a completely different facility by a different provider.

      If it goes down, the rest of LJ is uneffected

      --
      -- If you try to fail and succeed, which have you done? - Uli's moose
    2. Re:Please, Please, Please! by Anonymous Coward · · Score: 0

      a livejournal girlie! i knew it!

    3. Re:Please, Please, Please! by coyotecult · · Score: 1

      LJ doesn't really get the Slashdot effect; they get way more traffic than we can give them anyway. I doubt we'll crash the sucker -- especially considering how curt of a page it is.

    4. Re:Please, Please, Please! by La+Camiseta · · Score: 1

      When this first got posted, the status page was running horribly slowly. I guess that one server still has trouble handling the load that a bunch of Slashdot users can put onto it, even if the content is static.

    5. Re:Please, Please, Please! by coyotecult · · Score: 1

      Yes, but is that from Slashdot or a million deprived LJ users obssessively hitting reload?

    6. Re:Please, Please, Please! by jonabbey · · Score: 1

      I guarantee you that it's from LJ users, not Slashdot users. Among my circle of acquaintances, the LJ/Slashdot ratio is about 20-1.

  28. Not their fault by tektek · · Score: 1

    It was an on-site power failure -- I don't see how you can blame them (new owners) on that...

  29. denial of service by spitstatic · · Score: 0, Redundant

    http://www.google.com/search?q=livejournal%20outag e&sourceid=mozilla2&ie=utf-8&oe=utf-8 lj_maintenance: Livejournal Outage ... Livejournal Outage LiveJournal is currently under a Distributed Denial of Service attack, and has been since about 5:30pm PST (1:30 AM GMT) tonight. ... www.livejournal.com/go.bml?journal=lj_ maintenance&itemid=55410&dir=next - 11k - Cached - Similar pages lj_maintenance: LiveJournal Outage Update ... LiveJournal Outage Update The Distributed Denial of Service attack that began on Thursday has not subsided. We were able to make ... www.livejournal.com/community/lj_maintenance/55947 .html - 10k - Cached - Similar pages [ More results from www.livejournal.com ]

    --
    This is my signature.
    1. Re:denial of service by Anonymous Coward · · Score: 0

      Note the date on that page.
      More specifically, note the year on it.
      Yeah.

    2. Re:denial of service by spitstatic · · Score: 0

      I don't see a date... I read it three times now.

      --
      This is my signature.
    3. Re:denial of service by rel4x · · Score: 1

      While I'm not exactly a security expert, I feel pretty confident saying that the DDOS attack was probably completely unrelated to the power outage...sorry SpitStatic(yes, I captialized your name so it looks better).

      --

      Before you mod me funny, think, perhaps I was insightfully funny?
    4. Re:denial of service by spitstatic · · Score: 0

      I don't know much about DDOS attacks - I just googled "Livejournal Outage" - and that's what I saw. You can't blame me for thinking it was the case.

      --
      This is my signature.
    5. Re:denial of service by Anonymous Coward · · Score: 0

      The google cache of the page you mention (top google link for "livejournal outage"):
      cache
      Quoted below, but to save you the time that dos was nearly two years ago:

      Swiss Army Spork ([info]lisa) wrote in [info]lj_maintenance,
      @ 2003-02-20 21:01:00
      Previous Entry Add to memories! Next Entry
      LiveJournal Outage Update
      The Distributed Denial of Service attack that began on Thursday has not subsided. We were able to make some large improvements in our load balancing system today that allowed us to remove the filters placed for groups of IP addresses at our upstream providers. This means most of the internet should be allowed to at least reach our site, and any that can not we are still working with our provider to allow.

      As we recover from the attack, the site will continue to be slow. We are working as hard as possible to speed things up, and thank you for your continued patience.

    6. Re:denial of service by spitstatic · · Score: 0

      I apologize. Here I thought I was helping. :/

      --
      This is my signature.
    7. Re:denial of service by Xoder · · Score: 1

      That happened like 6mo ago.

      Can you tell I read status.livejournal.org too often?

      --
      The previous sig has been removed due to /. protecting your best interests
    8. Re:denial of service by FLEB · · Score: 1

      For further reference, you'd probably want to use Google News, or just check reliable sources. I'm pretty sure Google only updates their indexes weekly on... is it Tuesdays or Thursdays? I forget.

      --
      Information wants to be free.
      Entertainment wants to be paid.
      You just want to be cheap.
    9. Re:denial of service by spitstatic · · Score: 0

      Yeah, I gathered that already. Thanks for the redundancy.

      --
      This is my signature.
    10. Re:denial of service by spitstatic · · Score: 0

      Yeah - I definitely won't jump to try to offer any possible info... I'll let the slashdotters handle it. :)

      --
      This is my signature.
    11. Re:denial of service by Xoder · · Score: 1

      Happy to be of service, happily.

      -Xoder, Minister of the Department of Redundancy Department.

      --
      The previous sig has been removed due to /. protecting your best interests
  30. Bad IDea. by barryman_5000 · · Score: 1

    I think it was a bad idea to have a site slashdotted while its down . . . . it shouldn't be able to stand a chance. No really, I wish they would have waited a little while. Now the admins are wondering why they suddenly are getting 200,000 hits.

    1. Re:Bad IDea. by nyquist · · Score: 0

      Wouldn't you think slashdot traffic is a drop in the bucket, compared to the regular LJ user traffic?

    2. Re:Bad IDea. by barryman_5000 · · Score: 1

      hmm . . . well imagine all those people constantly hitting F5 . . . and waves of slashdotters on top of that . . . lol, its so evil!

    3. Re:Bad IDea. by jamie · · Score: 2, Informative
      It is. Slashdot gets about 1/10th the pageviews of LJ.

      The Slashdot effect is more visible because we send all our readers to one place at the same time, while LJ is highly distributed.

    4. Re:Bad IDea. by thephotoman · · Score: 1

      It's like an HP RPG (with lots of underage wizard sex)...only less erotic.

      --
      Haec merda tauri est. Ceterum censeo Carthaginem esse delendam.
  31. It can happen by adeydas · · Score: 1

    Well the power outage was not a person's or organisation's fault, it just happened. I wonder what Danga would have done that Six Apart is not doing to bring the servers back online. By the way, don't they have diesel generators for backup?!

    1. Re:It can happen by TeraCo · · Score: 1

      Their backup power supplies didn't kick in, causing the whole problem in the first place. An article reader is not you. [And in this case, the article was just a few lines of text typed in while they break their back getting the regular servers back up.]

      --
      Not Meta-modding due to apathy.
    2. Re:It can happen by Lehk228 · · Score: 1

      not having enough battery backup available to keep the machines running untill the generators can be switched on is their fault, as well as the previous owners fault.

      --
      Snowden and Manning are heroes.
    3. Re:It can happen by Kiryat+Malachi · · Score: 1

      -"the current owners" -"the previous owners"

      +"the data center"

      You do realize that they don't own Internap, and that the *data center* died, right? They (Danga/Six Apart) probably were paying for battery backup and generator backup. It is not even *close* to their fault that the co-lo center didn't have such things, or had them improperly set up.

      --

      ---
      Mod me down, you fucking twits. Go ahead. I dare you.
      (I read with sigs off.)
    4. Re:It can happen by Lehk228 · · Score: 1

      they should have investigated the setup better, if I was in charge of ordering or maintaining 100+ co-lo'd servers I sure as hell would make sure the co-lo center knew what they were doing.

      --
      Snowden and Manning are heroes.
    5. Re:It can happen by KFK2 · · Score: 1

      From what I understand, the Building UPS "blew" up (as somebody I know that works there said) and the fire alarm shut the entire building down.. so it's not their fault.. the reason it's taking forever to come up, is they have tons of data, and they want to be sure it's all not corrupt..
      I know if I were paying for my 100+ servers to be hosted with a building UPS, I wouldn't bother to have UPSs for every server.

      in other words.. OMG LJ is DOWN.. WTF am I going to do ?!?one?!?eleven!?!

    6. Re:It can happen by Lehk228 · · Score: 1

      a Building UPS is perfectly fine, and if what you say is correct than it really wasn't anyones fault as redundant UPS's for something like that would be somewhat excessive

      --
      Snowden and Manning are heroes.
    7. Re:It can happen by isometrick · · Score: 0, Troll

      There's an oft used, almost cliche saying that applies here: "Easier said than done." How the hell are you, the sysadmin, supposed to go around checking out the power redundancy of a huge, well-known colocation facility? That is what THEY are supposed to do, and NO facility will let you sample the diesel generator when you see fit. Of course, it's really easy to sit on the sidelines and yell at the players. You are an idiot. Go back to your parents' basement and "make sure as hell" that your head is screwed on before you feel like commenting again.

    8. Re:It can happen by Anonymous Coward · · Score: 0

      Danga would have done exactly the fucking same.
      Because Danga is still doing exactly the fucking same.

      Did you even _read_ the announcement about the sale?

    9. Re:It can happen by Cramer · · Score: 1

      "THESE THINGS HAPPEN" -- however rare. Short of installing your own UPSes in your co-lo racks, which many co-lo's either frown upon or strongly discourage, you are at the mercy of the facilities which means their utility power, generator(s), and UPS(es).

      I'd trust Internap to know how to power a co-lo facility. This is just one of those events you look at and shake your head. It's too rare an event to take steps to avoid it (like integrated system power supply and UPS, which I don't think exist anymore.)

    10. Re:It can happen by Anonymous Coward · · Score: 0
      Short of installing your own UPSes in your co-lo racks

      Even if you have your own ups it doesn't matter because their herd of cisco 6500's and Junipers, etc are dead in the water because their UPS's are down.

      So your little servers would be up, crying out
      Help me!
      Heeeelllpppp mmmmeeee!
      and no one would hear them...

      Yes I realize it may be better for you DB servers, but obviously it is just a bad situation.
    11. Re:It can happen by deathazre · · Score: 1

      ... battery... backup?

      I'll take a flywheel UPS over a battery one any day. They take up less space, you don't have to have a dedicated AC unit for the room, and they actually filter the power.

      (disclaimer: I work for a Caterpillar dealer, or rather will be when I get back out of school)

      --
      Karma: Negative (Mostly affected by dorm trolling)
    12. Re:It can happen by dtdns · · Score: 1

      Short of installing your own UPSes in your co-lo racks, which many co-lo's either frown upon or strongly discourage...

      Or disallow in their terms of service. My understanding is that the reason for this is in case of electrical fires. If a fire breaks out and their gas-based suppression system cannot immediately contain it, one of the first things that will happen is they will cut off all power to the racks. If someone has their own UPS in the rack, the facility cannot do an emergency shutdown. If a server connected to that UPS is the source of the fire, then they might have a VERY hard time putting it out.

      At least, that's how it was explained to me by one of the facilities we looked at for our servers.

      Also on the subject, I heard (but have not been able to confirm) that Hi-Velocity, a small hosting/co-lo facility in Tampa was without power for over 24 hours due to a transformer problem in their building. They do have battery backup, and claim to have a generator, but some people I know who have servers there were out for over a day.

      And to make matters worse, Slashdot is giving me the classic 503 when I tried posting this the first few times...

    13. Re:It can happen by Anonymous Coward · · Score: 0

      This is the second time that Internap has had a total power outage in the time that LiveJournal's been there... and LiveJournal's only been there for a few years.

    14. Re:It can happen by Anonymous Coward · · Score: 0

      Read the update page(in all fairness, this was posted six minutes after your post): "We're going to be buying a bunch of rack-mount UPS units on Monday so this doesn't happen again." If they had done that in the first place this wouldn't have been nearly as big of a problem. They don't have much control over the data center dying, but there are things they can do to make sure that if the data center dies that their equipment shuts down cleanly so it can be brought back up as soon as possible.

  32. A disturbance I feel by philkerr · · Score: 4, Funny

    I feel a great disturbance in the force..... It's as if a million bloggers cried out all at once..... and became silent.

    1. Re:A disturbance I feel by Anonymous Coward · · Score: 0

      Damn you, now I will have to root for the emperor!

    2. Re:A disturbance I feel by FEEBLE*BMX · · Score: 1

      Way to repeat some one else's joke, poorly, and still get modded up for it! Congrats.

    3. Re:A disturbance I feel by Anonymous Coward · · Score: 0

      Hey dickhead - look at the post time before you decide to look like an ass.

  33. Tomorrow's news by Anonymous Coward · · Score: 2, Funny

    The population of depessed pre-teens has just dropped by 20%

    1. Re:Tomorrow's news by Miss_Saturnine · · Score: 2, Funny

      ...in other news, Bic announces all-time high profits in sales of disposable razors...

    2. Re:Tomorrow's news by lewp · · Score: 1

      Dare to dream.

      --
      Game... blouses.
  34. I wonder by TCM · · Score: 1

    are servers with LOM (lights out management) superior in this case?

    --
    Of course it runs NetBSD. BTC: 1NT7QvbetmANwaMzhpVL6
    1. Re:I wonder by htmlboy · · Score: 1

      lom requires power to operate, but once power's restored, it can be used to power on machines remotely.

    2. Re:I wonder by TCM · · Score: 1

      No shit, Sherlock?

      --
      Of course it runs NetBSD. BTC: 1NT7QvbetmANwaMzhpVL6
    3. Re:I wonder by htmlboy · · Score: 1

      i was wondering if you were asking for a clarification on which lights can be off, or if it was just another case of a slashdot user being unable to communicate via text.

      thank you for clearing that up, and i do apologize for giving you the benefit of the doubt.

  35. Melodramatic by CypherXero · · Score: 2, Funny

    It's not like most LiveJournal user's have enough to worry about, here's something for most LJ users to get melodramatic about. I'm serious, randomly pick 5 LiveJournal blogs, and I guarantee 4 out of 5 are going to be "Fuck the World" posts.

    1. Re:Melodramatic by whiteranger99x · · Score: 1

      It's not like most LiveJournal user's have enough to worry about, here's something for most LJ users to get melodramatic about. I'm serious, randomly pick 5 LiveJournal blogs, and I guarantee 4 out of 5 are going to be "Fuck the World" posts.

      True, but let me expand this out a bit.

      1 will say "fuck the world"
      1 will say "fuck the world" because they got dumped by their bf/gf for being such a whine ass and figure out why
      1 will say "OMFGWTFBBQ FuK Da WorLD!!"
      1 will say "world the fuck" while whining about being dyslexic
      1 will say "Fuck the world, now i will cut myself to release the pains of the world"

      --
      Join the TWIT army now!
    2. Re:Melodramatic by Grendel+Drago · · Score: 1

      Indeed. I was unaware of the true degree of wankery on LiveJournal until I discovered the 'otherotherkin' community, which exists solely to mock the 'otherkin' community. You know, people who "identify" as dragons, pixies, humans who will some day actually get laid, elves or magical goats.

      And though I don't like kids, it's sobering to see the level of discussion on the 'childfree' community, where it's frequently "I saw a child today, and did not tear off its face and eat it. Why am I so tolerant? They don't deserve my grace."

      Or, you can read six thousand copies of the following message on 'sextips': "OMG I WAS PLAYIN WIT MY BF AN HE CAME IN MY B-BUTTON AM I PREGGERZZ?"

      Ah, good times. At least there are the porn communities.

      --grendel drago

      --
      Laws do not persuade just because they threaten. --Seneca
    3. Re:Melodramatic by CamTarn · · Score: 1

      LJ is remarkably like the Internet, in fact: there may be many gems, but there's a hell of a lot of shit.

  36. You can just imagine it... by dysprosia · · Score: 1

    Mood: anxious...

  37. A Good Laugh by loid_void · · Score: 1

    Check out this page on the Iternap site for a real laugh. The flash page is a real hoot too.

    --
    Anyone seen my jagged little pill?
    1. Re:A Good Laugh by mtrisk · · Score: 1

      Well since it's full of buzzwords, including one I personally loathe, "information superhighway," it seems to be marketed at upper-management positions. But of course...you don't think people with any technical knowledge make descisions on a colo facility do you? They'll see that Internap has more buzzwords, promises more through huge vagaries (I love the part about traffic jams on the information superhighway!) and a nice little flashy presentation, and they get picked over a facility with a more technical overview. It's called marketing, my friend. Know your audience.

      --

      Without a proper flamewar, Anonymous was undecided on what shell to run.
  38. Actual Link by AdamJ · · Score: 1

    LiveJournal's offsite status page is status.livejournal.org.

    1. Re:Actual Link by mlk · · Score: 1

      LiveJournal's offsite status page is Slashdotted :)

      --
      Wow, I should not post when knackered.
  39. There's a logical expanation... by Akki · · Score: 0, Offtopic

    It's terrorists, obviously.

  40. OMFG! I'm totally blogging this! by Anonymous Coward · · Score: 0

    OMG OMG OMG!

    Gonna tell my LJ friends! They're going to FREAK!

    OMG OMG!!!!!!! WAIT LOL!!!! How do you blog that the blog is down when the *blog is down*? This sucks! What do I do! OMGLOLWTF!

    I"m going to take a picture of my cat and put it in flikr like, ten times. LOL OMG !!

  41. Update #1 by Captbaritone · · Score: 1

    "Update #1, 7:35 pm PST: we're up on 'dirty' power for now (it works, but it's unreliable), and we're working to assess the state of the databases. The worst thing we could do right now is rush the site up in an unreliable state. We're checking all the hardware and data, making sure everything's consistent. Where it's not, we'll be restoring from recent backups and replaying all the changes since that time, to get to the current point in time, but in good shape. We'll be providing more technical details later, for those curious, on the power failure (when we learn more), the database details, and the recovery process. For now, please be patient. We'll be working all weekend on this if we have to."

    --
    - Captbaritone
  42. When finally unleashed... by Anonymous Coward · · Score: 0

    ...the collective blogger angst should be a sight to see.

  43. Can't get the staff? by Psychotext · · Score: 0, Flamebait

    Those fools!

    Didn't they see the option in the BIOS clearly marked as "Restart after power failure"? You just can't get the staff these days! * rolls eyes *

    --
    People that believe in their opinions don't post AC.
    1. Re:Can't get the staff? by Psychotext · · Score: 1

      For the humour impaired... that was a joke. * sigh *

      --
      People that believe in their opinions don't post AC.
  44. poor internap by Indy1 · · Score: 2, Interesting

    sounds like all the fucking spammers they host overtaxed spammer-nap's power resources and brought it all down.

    Seriously though, spammer-nap is a massive spam haus, see for yourself

    --
    Lawyers, MBA's, RIAA? A jedi fears not these things!
    1. Re:poor internap by bigberk · · Score: 1

      That's interesting, I did not know that internap was harbouring so many spammers... check out those huge /24's in there, including Bluestream media. thanks for bringing this to my attention

    2. Re:poor internap by ces · · Score: 1

      Whatever are you on about? InteNAP isn't even in the top 10 spam hosting ISP's. They don't tolerate spammers on their network as a general rule.

      On the other hand they aren't going to boot a customer just because some crank complains about getting a commercial email from that customer. It does happen too. I've seen people sign up for a product newsletter. Confirm the subscription and then complain that the newsletter was spam to every single place they could possibly think of reporting it.

      For contrast compare them to MCI, XO, or Above.net. All of those providers are far more spam friendly.

      --
      Happy Fun Ball is for external use only.
    3. Re:poor internap by Indy1 · · Score: 1

      they host spammers and ignore all complaints. Roving.com is one of their worst scumbags.

      They wont boot a customer if the customer signed a pink contract. Spammer-nap is a nasty piece of filth, and i ended up firewalling all of their netspace as a result of them attacking my mail server. And if your curious, my slashdot email (which is a spamtrap of course) was one of the addresses getting hammered. So dont give me any bullshit about opting in.

      --
      Lawyers, MBA's, RIAA? A jedi fears not these things!
  45. Disclaimer: I am Not an Electrical Engineer by ebooher · · Score: 5, Informative

    I know nothing of how InterNap is set up. I just want to throw that out there ahead of time. Now, it's time for my patent pending "Bull Shit Theory of the Day."

    Ok, here is the rant. I used to work for a Colocation facility. Nothing special, small by Telco terms. The whole facility only had about 1500 cabinets. (Though I hear they are now full, and going to be expanding.)

    We had a main power draw off of the local grid. We had a backup power draw off of the *next* cities power grid. (ie, when all the offices around us went dark, we still had power.) And you don't even want to know the kind of red tape we had to go through for *that* pull. I'm still not sure how they did it. We had fly wheel kinetic electricity storage systems, battery backups, and a diesel engine from a train so large it had it's own building.

    We used to joke that if we lost power, we had more important things to worry about. And again, we were small time compared to some of the massiveness that is out there. *cough*AADS Chicago*cough*

    So I'm kind of in agreement with the statement currently on LiveJournal. It's unknown to me how any self respecting colo facility can say "We've had a power outage that also took our redundant systems."

    I have to call bullshit on that entire train of thought. If that's true then they don't *have* any redundant systems, and I'd be looking for a new provider. The most likely thing (at least in my mind) is that someone, somewhere got mad at something specific and decided to make a point by popping the main breaker to their portion of the facility.

    Oh, that was another thing, each room had several "main" breakers. It took a hell of a power surge to pop all of them, and the Liebert systems had power filters of some kind, really really big capacitors or something I think, so a surge really never made it to the other side anyway, it got stored in the cap and then trickled out like the rest of the power.

    But I was a UNIX admin, not the EE that was planning the power generation aspects of the facility. So take some of it with grains of what ever white powdered spice you prefer.

    --
    "Genius may shine aloof and alone, like a star, but goodness is social, and it takes two men and God to make a Brother."
    1. Re:Disclaimer: I am Not an Electrical Engineer by DrBlubGut · · Score: 2, Insightful

      Happens some days. A key breaker at a data center some of my friends work at went bad and took the whole floor with it. Generators didn't even get a shot at takeing the load because the breakage happend later in the circut. No matter how big and bad your infrastructure some points in the design will not be 100% resistent to all problems. We do our best, make plans, design good systems but the world teaches us Shit happens.

    2. Re:Disclaimer: I am Not an Electrical Engineer by CrankyFool · · Score: 4, Interesting

      Back in the great days of the .com boom, people were building their colo facilities to insane (in a beautiful way) standards. I remember touring Exodus and Above.net (I don't know who you're referring to, though I only ever heard of above.net adopting flywheels) and being just very amused at the cool stuff they were putting in place.

      I recently (~8 months back) did some contract work for a small company whose servers were based in some colo facility in San Francisco. One of the first things I noticed was a damn heavy UPS at the bottom of their rack. Weird, I thought -- why not rely on the colo's battery system?

      Because they don't have one.

      Mind you, this was also the colo that had a cardkey system that had long ago stopped being usable, so when you needed access you used a Radio Shack $29.99 wireless intercom system and someone would come to open the door, and when you checked in they carefully wrote your name on a little nametag.

      I think standards have slipped, significantly. In some respects, this is likely a good thing -- it means you have more options now, because you can choose either the super duper "we hook up to two countries' power grids, have eight flywheels and a direct feed from microwaves in orbit" or the "err, here's your cabinet. We'll give you decent power until we don't" options.

      So ... how much are all these people paying LiveJournal again? Couldn't they request some sort of partial refund of their monthly fee?

      Oh, wait...

    3. Re:Disclaimer: I am Not an Electrical Engineer by Ingolfke · · Score: 1

      Someone probably hit the big red switch on the wall, the one covered in a plastic case, the one right by doors, the one that turns on the fire supression system, but also looks like it might open the door. Either that or the cleaning crew was vacumming in the DC again.

    4. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 5, Informative

      My friend's company is hosted by internap. Today he messaged me when the power went down. It was only power to the second floor, my friend's servers, while cut off to the internet were still running (on the 3rd floor). Internap has redundancy and backup generators (and enough fuel onsite to run for 30 days without external power). Apparently there was construction occuring on the second floor... my guess is that some dipshit contractor cut through a power cable or 3 and took the whole floor down.

      To all the people accusing LJ of being stupid for not having UPS systems, Internap has 3 fully redundant power systems (yes, I know, didn't help much) so most people probably don't feel the need to run their own ups.

    5. Re:Disclaimer: I am Not an Electrical Engineer by RFC959 · · Score: 1

      I agree with you for the most part, but I think some of the other posters have good points in that it's hard to plan for and test for everything. I heard of a colo that once had a great setup - huge battery backups, diesel generators, days' worth of fuel, etc. And they tested it, too! Once a month or so, they'd switch over to diesel for a few minutes to be sure they could do it in an emergency. Then one day, they suffered a power failure, and switched over to diesel...and the generator ran for about five minutes, then coughed and died. Turned out their fuel had gotten contaminated or their fuel filter had clogged or some such. This is not to say that these are things you couldn't check for, just that it's very hard to be sure you've really thought of everything. (BTW, if anybody remembers where that story comes from, let me know...can't remember where I heard it.)

    6. Re:Disclaimer: I am Not an Electrical Engineer by Repugnant_Shit · · Score: 1

      The LiveJournal outage doesn't surprise, they probably don't have a large redundant power system.

      Now *financial* data centers, that's something to see. One company has the exact same building in three different locations. Half the building is a mirror of the other half. Each half has its own generators, UPS batteries, data center, etc.

      And for all the money they poured into these facilities things *still* aren't right. Although I bet with Internap/LiveJournal they didn't spend too much time and on redundant power systems, and hell, maybe they're batteries were old pieces of crap that failed immediately.

    7. Re:Disclaimer: I am Not an Electrical Engineer by Xoder · · Score: 2, Informative

      Actually, about a year ago, they had some months of bad performance and gave all paid members an additional 2mo (or so, I forget exactly) of paid member-level service, free of charge.

      --
      The previous sig has been removed due to /. protecting your best interests
    8. Re:Disclaimer: I am Not an Electrical Engineer by jgarzik · · Score: 0, Offtopic

      Somebody mod the parent up, please.

      This person actually seems to have real info (shocking, I know) about Internap.

    9. Re:Disclaimer: I am Not an Electrical Engineer by mizalaina · · Score: 2, Informative

      I work at a co-lo facility now. The problem is probably that what people call redundant power often isn't highly available, nor is usage distributed correctly across the primary and redundant circuits. If one half of your power fails and you've mis-used or overloaded your redundant circuit then the redundant circuit is going to fail when it can't take the load that gets switched over to it. This is a result of poor planning.

      Keep in mind that often people have back-up power that's not conditioned, which is what is indicated by LJ's message. If the power were redundant and both sides were through UPSes, there would be no dirty power at all. A lot of co-lo facilities go on the cheap and their back-up power is just another circuit from a different transformer or a different Hydro company. So think about it: if the grid, transformer or power switching infrastructure fails, and you only have one back-up generator that also fails, or your UPS batteries can't take the pressure, or any of two dozen other things, your power has gone bye-bye.

      My prediction (which we are already seeing at my job) is that power and cooling are the Next Big Problems for co-lo. With blade servers demanding 220V, 30A 3-phase power and pulling 8kVA in 6U of space, no data centre as currently designed will be able to handle that on the scale we're going to see develop in the next year or two. People assumed power and cooling were unlimited resources. We were wrong. Oops!

      BTW, if what LJ is saying is true, this has little to do with Six Apart or Danga. It's Internap's fault within that particular data centre. The sales engineers/technical consultants/whatever they're called at Internap should have thought about this and pushed for audits, but they probably didn't. I doubt Danga knew enough about the potential problem to make good decisions about it: they're just a customer and assumed that the power would work. It's an infrastructure thing, and while the customer should educate themselves, they often don't. It's why I bug my customers constantly with power audits and suggestions.

      Just something to think about. :)

    10. Re:Disclaimer: I am Not an Electrical Engineer by elmegil · · Score: 1
      I've seen UPS batteries fry. Nothing like unexpected smoke in the data center when the grid goes out. I've seen Diesel backup generators get water in them and be unstartable. I've heard stories of people who weren't paying attention to the fact that they'd overloaded their UPS capacity (for anything more than 15 minutes). Etc. etc.

      There are lots of plausible reasons why the backup power didn't work. The real question is, unless it was an act of God, why was InterNap not testing their backups regularly under controlled circumstances to mitigate the risks of these sorts of things happening?

      --
      7 November 2006: The day Americans realized corruption and incompetence weren't addressing 11 September 2001
    11. Re:Disclaimer: I am Not an Electrical Engineer by silas_moeckel · · Score: 1

      OK I am and EE and this is just the result of piss poor planing and execution. Redundany power is expensive. It has to go all the way to the computer (yup each one needs multiple supplies) and you need auto transfer switches everywhere. Problem with one RU servers is very very few of them have redundant PSU's.

      Redundancy is easy you just need N+1 of everything and transfer switches at every stage. (And thoustands to pay to eletricians to put it all in) These guys werent highly avalble they just had a piss poor A+B bus setup and the B bus was lacking :)

      --
      No sir I dont like it.
    12. Re:Disclaimer: I am Not an Electrical Engineer by simsong · · Score: 1

      Power and cooling have been issues for colo centers for at least 6 years. Google has engineers whose primary job is to figure out how much power and cooling they can get in a rack. And outside of colo centers, power and cooling have been issues for, well, forever.

      The big issue that colo centers are not dealing with is security between customers --- one customer taking down another customer's equipment. It's much easier than you might imagine.

      If it really is the case that there was construction going on at the colo center, then that would be in line with reports of other failures. Construction is one of the leading causes of downtime. Software upgrades is another one.

      --
      (Yes, I really am Simson Garfinkel)
    13. Re:Disclaimer: I am Not an Electrical Engineer by EvilMagnus · · Score: 1

      I think that may well have been Exodus in Waltham.

      --
      -EvilMagnus
    14. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 0

      I used to work for a company that co-located a fair number of our machines at an Exodus datacenter in Seattle. It had all the wizbang gizmos like lights in the floor and hand scanners and sliding star-trek doors and whatever else.

      We were promised quite a bit of power redundancy but Seattle (or maybe Washington) fire codes required that the center have a switch that could kill ALL power to the building in case of fire, that a firefighter could find in an emergency.

      Unfortunately that switch, encased in glass (like the emergency-stop buttons on escalators) was right next to the exit.

      Everything was great until one day a customer, unescorted for whatever reason, tried to leave the room. Unfortunately the power-off button wasn't clearly marked, nor was the exit button.

      So he hit the button.

      We lost power to the entire datacenter. ALL power. Including backups.

      So it is possible!

    15. Re:Disclaimer: I am Not an Electrical Engineer by sarahemm · · Score: 0

      Actually, IAANM (i am a noc monkey) and all of the major brands (at least IBM, HPaq, Dell, and a lot of the clones) we work with have redundant PSUs in their 1U boxes. The bottom end clone $100 1U cases often don't, but any decent brand, including a lot of the lesser known ones, does.

    16. Re:Disclaimer: I am Not an Electrical Engineer by Megane · · Score: 1
      Problem with one RU servers is very very few of them have redundant PSU's.

      And even when a server or router or whatever does have redundant PSUs, one of them may get weak such that the computer will work fine with all PSUs powered, but die when one of them goes out.

      And then there's those cool three-headed hydra cords I've seen plugging all the redundant PSUs of Dull servers into the same power plug.

      --
      #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
    17. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 0

      Happens all the time, it'll either be a fault with the electrical distribution system down stream from UPS, or a problem with the fire systems emergency electrical shutoff.

    18. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 0

      I've toured the PNAP in Seattle (at Fischer Plaza where this event occured) several times. It was built as "Essential Services" facilities and it probably one of if not the finest datacenter in the Seattle downtown core.

      If construction really did knock out power to an entire floor, there's a major design flaw in the physical plant or someone not following procedures (I would bet the former). Whatever the cause, I'm sure a full accounting of events will be known by Monday.

    19. Re:Disclaimer: I am Not an Electrical Engineer by Fazlazen · · Score: 1
      so most people probably don't feel the need to run their own ups.

      Or perhaps they are not allowed to run their own UPS there? I know that the codes of the local municipality of my work do not allow us to have individual UPS systems in the datacenter, the whole thing must run from a mammoth UPS with a single kill switch.

      Something to do with fire code if I recall correctly...

    20. Re:Disclaimer: I am Not an Electrical Engineer by nettdata · · Score: 2, Interesting

      Maybe, but sometimes shit just happens, regardless of design.

      I find this Risk site to be very interesting reading, especially when it talks about some failure issues and scenarios.

      My favourite was about Squirrel that took down the Nasdaq. (I've also heard squirrels/mice/rats etc called "self propelled short circuits", but that's another story)

      Now, I've been involved in systems architecture design, planning, and management for years, and I think that a lot of people drastically underestimate just how fscking complex and dificult proper planning, execution, testing, maintenance, and administration of these systems can be... especially when faced with budgetary restrictions.

      The cost of a system rises almost exponentially as you approach 100% uptime... even 99.999 is freaking expensive to implement and manage. Never mind the complexity and administrative requirements.

      Who knows... maybe dealing with the PR issues of this outtage is still orders of magnitude cheaper in the long run than putting in the systems required to achieve the uptime.

      At the end of the day, what are the business impacts of this outtage? For that matter, they seem to have received more exposure than if they were operating normally.

      A lot of people are aware of the fact that sometimes things break, and we're not landing planes in the fog here. The fact that shit broke and they're bringing it back in an informed and somewhat timely manner may HELP them, in that some people may get a stronger sense of "these guys can deal with problems that hit them".

      --



      $0.02 (CDN)
    21. Re:Disclaimer: I am Not an Electrical Engineer by Cramer · · Score: 1

      Well, in all fairness, with a regenerative power system, battery "UPS" backup is unnecessary. The power is backed up by a generator and the kenetic energy of the flywheel while the generator spins up. In such configurations, complicated transfer switches aren't necessary. Simply disconnecting from utility and connecting to the generator (break-before-make) to feed the flywheel motors is far easier than phase sync'd make-before-break transfers. That's simply a "relay"; a clean power transition is unnecessary as the machines in the colo are only connected to regenerated power.

      Teremark's facility in Miami has regenerated power systems. They were all over it in every presentation. It is much cheaper, and less prone to failures. But the jury is still out on "better".

    22. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 0

      I was just down there for several hours - I have servers colo'd there. Current scuttlebutt is that some @#&%(#@&$#@ deliberately tripped one of the master "all off" switches. Oddly, this happened to me several years back too, in the Exodus data center in Seattle.

    23. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 0

      They did? I didn't get anything.

    24. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · · Score: 0

      Hmmm, well Authorize.net was dead as the proverbial door nail for at least two hours today; website, cc gateways, everything. So I think the cost was in real dollars to real merchants. Likely Authnet WILL be researching alternatives.

    25. Re:Disclaimer: I am Not an Electrical Engineer by nettdata · · Score: 1

      For sure...

      I also find that too many clients/customers of co-lo's just don't do their homework or due dilligence when it comes to selecting their co-lo provider.

      Needless to say, I'd be very interested to see the SLA's that the clients had, and what exactly Internap provides by way of uptime. Maybe they just blew the rest of the year's allowable downtime.

      --



      $0.02 (CDN)
    26. Re:Disclaimer: I am Not an Electrical Engineer by Tull · · Score: 1

      To all the people accusing LJ of being stupid for not having UPS systems, Internap has 3 fully redundant power systems (yes, I know, didn't help much) so most people probably don't feel the need to run their own ups.

      Indeed. In addition, adding local UPS systems often reduces the mean time before failure when used with suitably protected power systems, because the local UPS isn't as reliable as the building power.

    27. Re:Disclaimer: I am Not an Electrical Engineer by EvilStein · · Score: 1

      That wasn't "coloserve" - was it? I'm looking for colo space in SF. heh.

    28. Re:Disclaimer: I am Not an Electrical Engineer by mizalaina · · Score: 1
      Google has engineers whose primary job is to figure out how much power and cooling they can get in a rack.

      Yes. Google are unusually well-educated consumers. I imagine they're a dream for a data centre to work with.

      The big issue that colo centers are not dealing with is security between customers --- one customer taking down another customer's equipment. It's much easier than you might imagine.

      Oh, believe me, I can imagine. Part of my job is to help people understand that this is one of the many things that differentiates us from most data centres. I think we're on the same page. :)

      (I don't work for Internap, obviously.)

    29. Re:Disclaimer: I am Not an Electrical Engineer by autocracy · · Score: 1

      Yes, but the expense of each server having less uptime is far, far less than the risk of the whole thing falling apart at once. Everything they have is distributed and redundant...

      --
      SIG: HUP
    30. Re:Disclaimer: I am Not an Electrical Engineer by Knightmare · · Score: 1

      Actually alot of times it's not about feel the need, it can actually be against code. The kill switches in raised floor space are supposed to do exactly that. Kill ALL power in the room in the case of an emergency. If you have in rack UPSs that are not hooked into that kill circuit, you have made that emergency off ineffective.

    31. Re:Disclaimer: I am Not an Electrical Engineer by Unknown+Lamer · · Score: 1

      It was announced in the paid members community and you just had to go a page and click a button because some people bitched about how they didn't want to get two months for free (yeah, weird people...something about not wanting to 'freeload' and 'hurt the project'...and now SixApart owns them).

      --

      HAL 7000, fewer features than the HAL 9000, but just as homicidal!
    32. Re:Disclaimer: I am Not an Electrical Engineer by ces · · Score: 1

      Maybe they just blew the rest of the year's allowable downtime.

      I don't have our contract in front of me but I believe InterNAp just blew the next two years allowable unplanned downtime.

      Fortunately for them their SLAs are only year to year or they would still be working off the time a customer hit the BRS thinking it would open the door.

      --
      Happy Fun Ball is for external use only.
    33. Re:Disclaimer: I am Not an Electrical Engineer by Critical+Facilities · · Score: 1

      I should begin by saying that I am not a sys admin. I am, however, a critical facilities engineer who until very recently worked at a Tier I data center for a VERY large Texas based technology company (think Ross Perot). I am in agreement with the assertion that LJ is responsible for their outage by failing to either inspect what their Facilities people are assuring them of, or failing to do it themselves if they're operating their own facility. The data center I worked at also had 2 redundant power feeds from different grids, so if 1 went down, we had another. Each of these fed 2 seperate switchgears at either "end" and were seperated by a Tie-Breaker, so the gear could be fed by either source if the other were to fail. We had 2 redundant UPS systems, each comprised of 4 modules, 1 of which was also redundant. Each module had its own dedicated string of wet cell batteries. This meant that if 1 entire UPS system went down, we could stand on the other one. It also meant that we could migrate the building load to one UPS so we could PM the other one. The redundant module was also significant. If in the unlikely event that we were to lose 1 entire UPS, we could still shift the load around on the remaining UPS modules (while online) in the event of a component failure on one of the modules. The critical load (the servers, tape silos, mainframes, etc) were all fed from PDU's, which were each fed from Static Transfer Switches. These Static switches were "make before break" using SCR's, so in the event of a loss of one source, you could roll to the other source transparenlty with regard to the computers. Also, a lot of the newer server racks are made to be "dual fed". That means, even if the entire system works properly, but the PDU (power distribution unit) from which the server gets its power were to fail, the server is still fed from another PDU (which in turn is fed from the aforementioned redundancies) so failure is EXTREMELY unlikely. And yes, we had 6 diesel generators. 3 Caterpillar (much more reliable) and 3 Cummings. The building could run off of 3 (2 in a real crisis with some load shedding), and each generator had its own 6,000 gallon underground fuel tank. We tested each of these components regularly and meticulously. Once a month, we rolled the entire building load to the generators for 2 hours. We did breaker testing of ALL critical breakers annually. We did Infrafred Scanning of all breakers, swtichgear, tranformers, transfer swtiches, etc annually looking for sources of heat, indicating potential problems. We load banked our generators annually, running them at 100% for 4 hours each. We had regularly scheduled PMs on EVERY piece of critcal gear, and guaranteed 100% uptime. My point is, it is MUCH more complex than a UPS just "blowing up". The many layers of redundancy should be more than adequate to handle a catastrophic event. If any of the components were to blow up, (i.e. Static Switch, UPS module, battery string, Main Breaker) there should be a redundant one in place to assume duty. Additionally, I have to wonder why the generators didn't fire up (I'm assuming they didn't if the site was down for so long). Anyone who would invest the time and money to put in this kind of infrastructure and not do regular testing of each component is an idiot, plain and simple. Sure, there could have been extenuating circumstances, but when you're in Critical Facilities Management, your job is to think of and plan for these things before they happen. No, you can't prevent shit from happening, but you can make it very unlikey that shit will happen to all of your equipment at the same time. Just my 2 cents, thought it might be interesting.

    34. Re:Disclaimer: I am Not an Electrical Engineer by vertinox · · Score: 1

      I think people fail to see something:

      Redundancy means little when failure is intentional.

      Conspiracy theory aside... I've often wondered what it would take to bring down say Everquest servers. Ala walking in and pulling some plugs etc...

      Who knows. Maybe someone had a post they wanted to really get rid of.

      --
      "I am the king of the Romans, and am superior to rules of grammar!"
      -Sigismund, Holy Roman Emperor (1368-1437)
    35. Re:Disclaimer: I am Not an Electrical Engineer by crschmidt · · Score: 1

      Based on past experience, LiveJournal is very generous for awards to users due to something on Livejournal being broken. During a pretty widespread DDoS, an A block was blocked upstream for 24 hours - and all paid users received an extra three days for the trouble.

      I may be slightly misremembering details, but like I said, LiveJournal has always been generous in making sure its paid customers are happy.

      The free users, of course, are shit out of luck.

      --
      -- Christopher Schmidt YouTube Quality of Experience
    36. Re:Disclaimer: I am Not an Electrical Engineer by TheSync · · Score: 1

      Let me describe an outage at a mission-critical technical facility (millions of viewers) that I am faimilar with.

      Local power goes out. UPS comes on seemlessly. Unfortunately, it is so seemless that the operations folks don't even know the power is off. Evidently the UPS doesn't have an audio alarm...

      When the power goes off, the generator should come on. The "signal" to the operations folks that the power is out is the loud sound of the generator starting. But the generator doesn't come on...

      After a while, the UPS drains, and the lights go off. Millions of viewers see black.

      Someone runs out to the generator...the battery to start the generator was dead! Someone had kicked a switch that charges the battery from 12V to 6V. The switch is near the floor level...

      A quick jumpstart from a car gets the big generator going again...

    37. Re:Disclaimer: I am Not an Electrical Engineer by Freeak · · Score: 1

      Dude calling "Bullshit" totaly gives away where you worked can we say "INFLOW"

    38. Re:Disclaimer: I am Not an Electrical Engineer by Freeak · · Score: 1

      It came from Inflow

  46. Re:GOOD! by Anonymous Coward · · Score: 0

    How to Use Your Browser Tip #63:

    When you put your mouse (that's the little box with the long cord and the buttons) over a link (that's a shiny word), it will display the address (that's a bunch of words with slashes and colons and stuff) that it points to in the bar (that's a horizontal strip, probably grey) at the bottom of your browser window (that's the magic box your browser lives in inside your computer). If you see the words "livejournal.com" in that address, it probably means the link (refresher: that's a shiny-looking word) goes to LiveJournal.

    Now, assuming you hate LiveJournal, clicking (pushing the button) on that link (one more time: a shiny-looking word) will make you unhappy, forcing you to search the Internet for child porn to find solace (see How to Use Your Browser Tip #91: "Boffo the Clown Shows You Everything You Ever Wanted to Know About Smut, You Filthy, Filthy Perv!").

  47. STOP SAYING BLOG!!!!!!!! by stratjakt · · Score: 0, Troll

    We're out of combinations of phonetic sounds. When we make up new words, they sound so fucking retarded, like blog. Say it out loud. Tell me you don't feel like you just lost 100 IQ points.

    Why can't we just call them what they are: online diaries/journals?

    --
    I don't need no instructions to know how to rock!!!!
    1. Re:STOP SAYING BLOG!!!!!!!! by Anonymous Coward · · Score: 0

      We're out of combinations of phonetic sounds. When we make up new words, they sound so fucking retarded, like blog. Say it out loud. Tell me you don't feel like you just lost 100 IQ points.

      Why can't we just call them what they are:


      Web logs?

    2. Re:STOP SAYING BLOG!!!!!!!! by Anonymous Coward · · Score: 0

      Why can't we just call them what they are: online diaries/journals?

      Blog.

    3. Re:STOP SAYING BLOG!!!!!!!! by Anonymous Coward · · Score: 0
      Web logs?
      Web journals.
    4. Re:STOP SAYING BLOG!!!!!!!! by tepples · · Score: 1

      We're out of combinations of phonetic sounds.

      That's almost as silly as saying we're out of combinations of musical notes. It's so silly that it just might be true.

    5. Re:STOP SAYING BLOG!!!!!!!! by rolocroz · · Score: 1

      Note that it is called LiveJournal, not LiveBlog.

      --

      I meta-mod all positive moderation Unfair, because it's abuse of the system.

    6. Re:STOP SAYING BLOG!!!!!!!! by Anonymous Coward · · Score: 0

      Hey careful there, some of us don't have a hundred IQ points to spare

    7. Re:STOP SAYING BLOG!!!!!!!! by Anonymous Coward · · Score: 0

      Go blog yourself.

    8. Re:STOP SAYING BLOG!!!!!!!! by Silentnite · · Score: 0

      *Out of combinations of phonetic sounds*.. Huh. Maybe youve already lost 1 IQ point to many. But whatever. Blog.

    9. Re:STOP SAYING BLOG!!!!!!!! by g-doo · · Score: 1

      I find nothing wrong with the word "blog", as do a significant number of other people in this world. Don't act as though your opinion is the only one that matters.

    10. Re:STOP SAYING BLOG!!!!!!!! by EvanED · · Score: 1

      What's wrong with log?

    11. Re:STOP SAYING BLOG!!!!!!!! by lachlan76 · · Score: 0, Flamebait

      When we make up new words, they sound so fucking retarded, like blog. Say it out loud. Tell me you don't feel like you just lost 100 IQ points.

      This is about livejournal remember, it's not like it's a blog site where your IQ is expected to increase ;)

    12. Re:STOP SAYING BLOG!!!!!!!! by Glowing+Fish · · Score: 1

      Seconded.

      I hope this is one word that we refuse to have foisted on us.

      I mean, no one I know says "blog" besides print journalists commenting on those crazy kids with their kooky intarweb.

      --
      Hopefully I didn't put any [] around my words.
    13. Re:STOP SAYING BLOG!!!!!!!! by matthewpilon · · Score: 1

      I agree!

    14. Re:STOP SAYING BLOG!!!!!!!! by ChristianBaekkelund · · Score: 1

      Amen to that...I hate the word blog...and it seems so....artificial?, as well..

  48. Gee, I wonder... by Rie+Beam · · Score: 4, Funny

    Update from the site:

    "Update #1, 7:35 pm PST: we're up on 'dirty' power for now (it works, but it's unreliable)".

    Congrats to LiveJournal for assembly a coal generator in a record time.

    1. Re:Gee, I wonder... by Rie+Beam · · Score: 1

      And yes, before anyone feels intelligent enough to post it:

      -assembly
      +assemble

    2. Re:Gee, I wonder... by Rie+Beam · · Score: 1

      "And yes, before anyone feels intelligent enough to post it:

      -assembly
      +assemble
      "

      Shit, did I just post that?

      -assemble
      +the assembly

    3. Re:Gee, I wonder... by Anonymous Coward · · Score: 0

      Try assembling.

    4. Re:Gee, I wonder... by shigelojoe · · Score: 0, Offtopic

      Congrats to LiveJournal for assembly a coal generator in a record time.

      "And yes, before anyone feels intelligent enough to post it:

      -assembly
      +assemble"

      "Shit, did I just post that?

      -assemble
      +the assembly"


      So what you've got now is "Congrats to LiveJournal for the assembly a coal generator in a record time."

      Still doesn't look quite right... how about
      +of

      so we have "Congrats to LiveJournal for the assembly of a coal generator in a record time." ;) /shigelojoe, shameless provider of unrequested and pedantic grammatical advice since 1984!

    5. Re:Gee, I wonder... by Vampyre_Dark · · Score: 1

      Their old power didn't seem to reliale either, what's the difference? :)

    6. Re:Gee, I wonder... by Rie+Beam · · Score: 2, Informative

      Sir, I will fight your advice until my grave.

    7. Re:Gee, I wonder... by daft_one · · Score: 1

      Until your grave... does what?

    8. Re:Gee, I wonder... by Rie+Beam · · Score: 1

      Spins, most likely.

    9. Re:Gee, I wonder... by Rie+Beam · · Score: 1

      Their old power didn't seem to reliale either, what's the difference? :)

      In this version, the black smoke flying from the machines is normal.

  49. Update by TrevorB · · Score: 3, Interesting

    On the Livejournal main page:

    Update #1, 7:35 pm PST: we're up on 'dirty' power for now (it works, but it's unreliable), and we're working to assess the state of the databases. The worst thing we could do right now is rush the site up in an unreliable state. We're checking all the hardware and data, making sure everything's consistent. Where it's not, we'll be restoring from recent backups and replaying all the changes since that time, to get to the current point in time, but in good shape. We'll be providing more technical details later, for those curious, on the power failure (when we learn more), the database details, and the recovery process. For now, please be patient. We'll be working all weekend on this if we have to.

    Lovely. I just bought another year's subscription for my wife, figuring the change to Six Apart wouldn't change anything for a few months at least. LJ could lose a lot of subscribers with an outage just after the takeover.

    1. Re:Update by Rie+Beam · · Score: 1

      I doubt it. It's not like this is some frequent event - as someone said early, some jackass was probably just messing with the breaker. How often is LiveJournal normally down, I ask you? Besides, how exactly can you blame a power-outage in any way, shape, or form on Six Apart? What exactly could they do to make electricity stop running, and why would they do it?

    2. Re:Update by Anonymous Coward · · Score: 1, Insightful

      sorry, I missed the part where Six Apart actually changed something...

    3. Re:Update by Anonymous Coward · · Score: 1, Insightful

      uh, this is not Six Aparts fault. This has nothing to do with them. And, they won't lose subscribes.

      How do I know this? Outages are common on LJ....

    4. Re:Update by TrevorB · · Score: 1

      True, this isn't Six Apart's fault at all.

      However, considering the level of paranoia on LJ regarding the acquisition, the timing really really sucks.

    5. Re:Update by MegaFur · · Score: 1

      It could be sabotage on the part of someone that doesn't want to see lj succeed and by someone that knows people will be stupid and think that the power outtage is somehow related to the takeover simply because the two events ocurred chronologically close...

      Yeah ok, pretty unlikely. :-)

      --
      Furry cows moo and decompress.
  50. Look at me! Look at me! by Cyburbia · · Score: 4, Funny

    live journal is dark like my soul like my heart a void its link is cut just like i'll be doing to my arm i blame my parents

  51. Michael - STFU by Anonymous Coward · · Score: 0

    Michael, SHUT THE FUCK UP with your stupid editorial snide remark.

    1) How the fuck is this Six Apat's fault when they * just * took over?

    2) Sometimes, no matter what you do, shit happens.

    3) If this has a human fault, it was the previous owners who didnt pay or check their servers would survive a power out.

    4) Just what is your problem with Six apart that you make such comments anyway? Your an editor, it is not your place to make stupid snide cooments that are clearly BS.

  52. Brilliant Idea by Anonymous Coward · · Score: 0

    I'm sure a slashdotting is exactly what LiveJournal needed right now.

    But i feel like this could be turned into some sort of brilliant DDOS attack scheme in the future...

  53. Angst by mr_RR · · Score: 1

    Looks like the angst over at Livejournal is no longer limited to the database.

  54. A great disturbance in the Force... by YowzaTheYuzzum · · Score: 5, Funny

    ... as if millions of teenage girls suddenly cried out in terror and were suddenly silenced.

    1. Re:A great disturbance in the Force... by cHiphead · · Score: 2, Funny

      shouldnt that be "millions of lonely emo teenage boys"?

      hopefully the 'power outage' that took out lj was actually something cool like someone sneaking an emp bomb into the datacenter. and not some dipshit power company employee hooking up something wrong on a transformer outside and melting the lines.

      cheers.

      --

      This is my sig. There are many like it, but this one is mine.
    2. Re:A great disturbance in the Force... by Council · · Score: 2, Informative

      Livejournal is something like 65:35 female:male.

      --
      xkcd.com - a webcomic of mathematics, love, and language.
    3. Re:A great disturbance in the Force... by balthan · · Score: 1

      It's cute to see such naivety still on the internet. Never played any MMORPGs, huh?

    4. Re:A great disturbance in the Force... by drsquare · · Score: 3, Funny

      >>Livejournal is something like 65:35 female:male.
      >It's cute to see such naivety still on the internet. Never played any MMORPGs, huh?


      This is different to MMORPGs, MMORPGs are generally a male domain, with men pretending to be women to get favours. On the other hand, blogs involve things women like doing, i.e. fucking going on and on about shit no-one cares about.

    5. Re:A great disturbance in the Force... by inlandsis · · Score: 1

      Well, I don't think LiveJournal being mainly female is an issue. As for shit no one cares about, you could be surprised.

    6. Re:A great disturbance in the Force... by nike2422 · · Score: 1

      Interesting observation about blogs. I'm a 40 year old female with an LJ and for some strange reason lots of people read my blog. I guess people do care about the shit I go on and on about.

      Check it out sometime.

      --
      What Would Scooby Do?
    7. Re:A great disturbance in the Force... by Anonymous Coward · · Score: 0

      You do go on and on, don't you? I'm getting tired reading this post here.

    8. Re:A great disturbance in the Force... by cHiphead · · Score: 1

      you underestimate the drive of lonely emo teenage boys to create fake 'female' journals so they can add themselves to the friends list and pretend to have a modicum of popularity and skill with the ladies.

      cheers.

      --

      This is my sig. There are many like it, but this one is mine.
    9. Re:A great disturbance in the Force... by Anonymous Coward · · Score: 0

      Gee, and I just thought that comment was funny. You know why? Because it IS funny. But hey, let's all get our panties in a bunch over the male/female ratios. Our better yet, let's all laugh at the clever comment. It's funny.

    10. Re:A great disturbance in the Force... by zonker · · Score: 0

      that and the pictures of a million cats...

    11. Re:A great disturbance in the Force... by Anonymous Coward · · Score: 0

      Apparently, if you do that, you usually also die.

      http://www.powerlineman.com/

      I reckon, given this is California, it's most likely flaky infrastructure + massive power demand = sudden overcurrent failure.

    12. Re:A great disturbance in the Force... by Anonymous Coward · · Score: 0

      I'm not a coward..I just can't be bothered signing up. It smacks of effort. All I wanted to say is that something-million women are "fucking going on and on about shit" on livejournal, instead of fucking going on and on about shit in your ear :D So be grateful they're getting it out of their systems online, instead of to you.

    13. Re:A great disturbance in the Force... by Marxist+Hacker+42 · · Score: 1

      Nope- chances are they were a part of the Corbett Power Outage due to the ice storm that hit the Portland, OR area this weekend.

      --
      SJW: a person who perceives an injustice, and while correcting it, commits a greater injustice.
  55. 2day sukd sooo bahd you wudnt believe it by djfray · · Score: 1

    half of the newest entries.....oh wait, I see someone's already got the emo lj user-base jokes covered. I have so much room to talk because I have an lj too, haha. But, seriously, how can this have anything to do with six apart?

    --
    This sig is o Unfunny o Funny
  56. Re:./ed !!!! Server Reboot Time? by ebooher · · Score: 4, Interesting

    This is another thing that bothers me about this scenario. I can't say that I've ever admined 100 servers, the most I've ever had was about 30, but if we had a power loss of any kind, you'd just repower them and walk away. Most of them were DEC Alpha gear running Tru64. Why would you spec out a box that has to be handheld every reboot? The only time you should have to handhold a server is during an upgrade. A power cycle without proper SIGHUP or term signals should just run fdisk on it's way back up. (K, so it might take an hour for the server to go live again, but still.) I mean, am I missing something here? Maybe since nothing I've admined got the traffic these things do .... I'm just lost. Some one hit me with the clue by four.

    The only thing I can even think of is they have explicit services that must be started manually ..... but why would you want that? If you have a power hiccup in the middle of the night, you want it to come back up, and be live and happy again *before* you even get the first page. I mean sure, if there was a surge, and that destroyed components, and those components have to be replaced ..... but ..... a reboot is a reboot, man. Here, smoke some source. It's the good stuff.

    --
    "Genius may shine aloof and alone, like a star, but goodness is social, and it takes two men and God to make a Brother."
  57. Six Apart is hosting them already? by MattW · · Score: 3, Insightful

    Er, they just announced Six Apart was buying them like days ago. I doubt they transitioned the servers in the first week.

  58. Hallelujah! by Rirath.com · · Score: 1

    That's one 1 down...

    And a massive cheer was heard across the land...

  59. It's Bush's fault. by glrotate · · Score: 1

    Wow. Can't the poor guy do anything right?

    1. Re:It's Bush's fault. by Anonymous Coward · · Score: 0

      Name one thing.

    2. Re:It's Bush's fault. by Anonymous Coward · · Score: 0

      The election (unfortunately).

  60. Dirty Power? by MonoNexo · · Score: 1

    I'm not an electrical engineer, either, but I'm wondering what Dirty Power is? Is that the unfiltered power that tends to anomilate, per the Monster Cable surge protectors advertising? Or am I thinking of something else?

    1. Re:Dirty Power? by Coolpup · · Score: 1

      The simpliest definition of "dirty power" that I can think of is unreliable power. It could have spikes, lulls, outages, or any number of other power issues. This is just information I've taken off of APC posters we have plasterd around our campus. -- Nothing to see here.

    2. Re:Dirty Power? by Anonymous Coward · · Score: 0

      MonoNexo! Watch underaged Dirty Power gyrate and show you EVERYTHING!

    3. Re:Dirty Power? by sarahemm · · Score: 1

      it's likely, in this case, power directly from the grid, without being conditioned or backed up in any way. some colos give you one power feed of 'dirty' and one of 'clean' for devices requiring redundancy, better ones give you two fully conditioned/backed up systems that are completely isolated from building entry point up to your servers themelf. sound like internap probably just got them back online any way they could while they work on getting normal conditioned/backed up power back online.

  61. Not only Livejournal by Anonymous Coward · · Score: 0

    For anyone who uses Authorize.net, you may have noticed a little downtime a few hours ago. They run their stuff through Internap, and went down for a decent while before coming back up. I called the tech guy and apparently there was a huge power failure at their ISP that took them offline. Wasn't entirely sure if that was 100% accurate, but this new info about Livejournal certainly corroborates the guy's story.

  62. Plea to the mods by clotito · · Score: 0, Redundant

    I'd like to request that the links to LiveJournal be removed from the Slashdot post. It's not as if people don't know how to get there, but at least they are less likely to click just for the hell of it or because they didn't think before hand. I doubt that it will get the servers back up quicker, but at least we won't have to be concerned that we're part of the problem. Thanks for hearing me out.

  63. suprise.. by sumsinnow · · Score: 1

    watch it'll come back up as a subscription site and all of your journals are erased if you don't pay..

    --
    Regards, Joseph
  64. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    from the updated message they're giving, I'd have to assume that they're going through all the databases, looking for data incongruencies caused by calls that were not completed due to the power loss... especially if there were data that had to propagate its way through the network, it could be quite troublesome if all the data connections were to just drop like they did

  65. Re:Look at me! Look at me! by Anonymous Coward · · Score: 0

    hee hee ho ho ***clunk*** hahahahahahahahahahahahahahaha

    well said mate.

  66. Re:./ed !!!! Server Reboot Time? by bradfitz · · Score: 5, Insightful

    They all came back up when the power came back.

    But we intentionally don't have databases come back up on boot because if there was a blip, we want to do an integrity check first. (we run InnoDB, so it's ACID, but we're paranoid ...)

    We have clusters of 2 identical databases in separate cabinets, separate switches, separate Internap power feeds... so normally losing one database in each cluster doesn't matter: the other one gets used. But when we lose every single database, in all clusters, all at once... that's the time to be paranoid and double check stuff.

  67. Where physically are they? by Anonymous Coward · · Score: 0

    Tonight is the night for power failures. My employer's main data center in Ann Arbor experienced an external power failure just before 5.

  68. Last post before blackout: by jcostantino · · Score: 1

    OMGWTFBBQ!

    --
    Reviews with a twist! http://www.sardonicbastard.com
  69. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 1, Funny

    Brad, get back to work! I need my friends page!

  70. "LiveJournal Servers Go Down" by PornMaster · · Score: 4, Funny

    LiveJournal Servers Go Down

    With thousands of teenage girls unable to ponder in an open forum whether or not to blow their boyfriends, thousands of teenage girls go down.

    1. Re:"LiveJournal Servers Go Down" by bluequartz · · Score: 1

      lol - you little joker you!!

    2. Re:"LiveJournal Servers Go Down" by Lord_Dweomer · · Score: 1
      "thousands of teenage girls go down."

      Not that that holds any relevance here...*sigh*

      --
      Buy Steampunk Clothing Online!
    3. Re:"LiveJournal Servers Go Down" by lachlan76 · · Score: 1

      It will if you've got a generator ;)

    4. Re:"LiveJournal Servers Go Down" by Anonymous Coward · · Score: 0

      Just because they all refuse to blow you is no reason to make fun of them.

  71. Where's my irony stick? by gmhowell · · Score: 4, Insightful

    Because michael needs a beating. The site that rolls beta (alpha?) code onto live servers complaining and making jokes because another site goes down through no fault of its own?

    --
    Jesus was all right but his disciples were thick and ordinary. -John Lennon
    1. Re:Where's my irony stick? by elmegil · · Score: 2, Funny

      Oh no. Not a beating. There's a much better use to which we should be putting that irony stick to help Michael out.

      --
      7 November 2006: The day Americans realized corruption and incompetence weren't addressing 11 September 2001
    2. Re:Where's my irony stick? by ces · · Score: 2, Interesting

      Really, talk about the pot calling the kettle black.

      If the datacenter that hosts Slashdot was to have a massive power failure how long would /. be down for?

      That said my company has gear in the same datacenter as LJ, our servers were back up 10 minutes after power was restored. Then again we use Oracle on HP-UX with nice SAN RAID boxes for storage for our database. So our stuff tends to recover from a sudden power loss a little better than a MySQL derivative running on clone hardware.

      --
      Happy Fun Ball is for external use only.
    3. Re:Where's my irony stick? by romcabrera · · Score: 1

      Everything in italics is the exact comment from the submitter. Everything else NOT IN italics, is any of the additional comments by the editors. Haven't you ever noticed that????

    4. Re:Where's my irony stick? by gmhowell · · Score: 2, Interesting

      Everything in italics is the exact comment from the submitter. Everything else NOT IN italics, is any of the additional comments by the editors. Haven't you ever noticed that????

      Long before you showed up here.

      Michael, as an editor, could easily rewrite the summary (and perhaps he did). Or he could choose the most inflammatorily written piece, and pretend to have presented the article without bias.

      Certainly none of these theories are more tin-foil-hattish than 95% of the stories on YRO.

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
  72. OH SNAP! SA FORUMS ARE DOWN TOO!!! by Anonymous Coward · · Score: 0

    Does anyone know why the SomethingAwful forums are down? This is much more dire of an Intarnet catastrophe than ten livejournals!

  73. more like tetanus by tepples · · Score: 1

    lj == lockjaw

    1. Re:more like tetanus by laughingcoyote · · Score: 1

      Don't we wish. Or at least "lockfingers" maybe. That would certainly raise the intelligence level of this thing we call the "net".

      --
      To fight the war on terror, stop being afraid.
  74. My future predicting powers... by Ingolfke · · Score: 1

    tell me that many will blog about how they couldn't blog. Some will complain about the stress of not being able to express themselves, others will question the engineering prowess of LiveJournal and wildly speculate about the cause of the power outage followed by plans to re-engineer the data center, the LiveJournal infrastructure and 93% of the Internet to ensure this never happens again (X.25 over barbed-wire will be suggested).

    Several individuals will join together to file a class action lawsuit against LiveJournal and the data center citing their inability to express themselves due to neglicance and will seek real and punitive damages totalling over $2.5 billion dollars.

  75. No... by EdMcMan · · Score: 4, Insightful

    Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?

    What does Six Apart have to do with Internap? Livejournal has been using - and wanting to switch from - Internap for a long time.

  76. california's power problems by The-Perl-CD-Bookshel · · Score: 0, Troll

    Doesn't California have a long history of power problems? Why burden ones self with the costs of putting a server center there instead of moving to another state with a better infrastructure. Another negative externality of doing business in California.

    --
    I don't keep a lid on my coffee so when I walk around I look busy -me
    1. Re:california's power problems by Anonymous Coward · · Score: 0

      The DC that went down was in Washington state.

      Nice try though.

    2. Re:california's power problems by Anonymous Coward · · Score: 0

      It is standard data canter procedure to have industrial grade UPS equipment + autostart generators (with additional fuel ready).

      California's power is actually quite reliable, the problems that existed were largely due to criminal fruad comitted by the likes of Enron.

      Yah, and as the other poster just mentioned, the data center isn't even in California.

      RTFA, moron.

    3. Re:california's power problems by Anonymous Coward · · Score: 0

      criminal fruad comitted by the likes of Enron

      And just in case people don't know - EDS helped them structure the most profitable of the fraud schemes. No one from EDS is going up the river for bilking California out of billions, but thank god Martha Stewart won't be manipulating the markets any more, that is only OK when it is the financial services industry itself (they don't like the competition).

    4. Re:california's power problems by britneys+9th+husband · · Score: 0

      Not anymore. We got rid of the evil Gray Davis (who of course personally caused the power to go out, my republican friend says he saw Gray Davis flipping a big light switch at PG&E headquarters to the "off" position just before one of the rolling blackouts) and replaced him with the Governator. Arnold won't let the power go out, and if it does, he can go turn the turbines himself with his super terminator strength. Or something like that.

      --
      Hear recorded Slashdot headlines on your phone! New service beta testing. Just call (248) 434-5508
  77. Irony by SELainWhoAmI · · Score: 1

    I found it ridiculously ironic that as soon as I wanted to bitch about livejournal being down in my livejournal..... I couldn't.

    1. Re:Irony by Dev_Nell · · Score: 1

      Followed closely by the irony of people who live to post on LJ being teased unmercifully by people who live to post on Slashdot.

  78. Re:./ed !!!! Server Reboot Time? by TrevorB · · Score: 2, Informative

    For those people who might not know, Brad Fitzpatrick is Livejournal User #1.

    I'd have to agree with the AC, Brad, stop posting to slashdot and hover over that DB rebuild a bit more.

    (Yes, posting to slashdot relieves tension... Whatever it takes, Brad.)

  79. Oh no! It's the end of the world!! by Skudd · · Score: 1

    "like, how m i suppozd 2 tell meh bf bout dat par-t?"

    1. Re:Oh no! It's the end of the world!! by Skudd · · Score: 1

      My condolences go out to the many teenagers who are stricken with depression and are now unable to tell the entire world about it. Although, maybe now they'll learn how to untar a simple archive and use something like Movable Type, b2, or SkuddBlog.

    2. Re:Oh no! It's the end of the world!! by Anonymous Coward · · Score: 0

      Too bad MT is a PAID service. And too bad its comment mechanism sucks like no tomorrow.

  80. Bush supporter too dumb to understand datacenters? by alienmole · · Score: 2, Interesting
    It says nothing of Bush or Internap. It says everything about cheapskate blog admins who think they can run servers without paying for battery backup.

    The LiveJournal status page claims "Our data center (Internap) lost all its power, including redundant backup power". This is nothing to do with "cheapskate blog admins" and everything to do with a serious and quite likely unacceptable problem at Internap.

    Of course, that's why Anonymous Cowards start out with zero points. Guilty of idiocy until proven innocent.

  81. Re:./ed !!!! Server Reboot Time? by sendai2ci · · Score: 1

    If other LiveJournal users ever found out you post here Brad, /. might end up with 5000+ replies to everyone of your posts :p

    Other than that, cheers and keep up the good work :)

  82. We're missing this by Anonymous Coward · · Score: 0

    "YESTERDAY MY MOM GOT A FONE KALL ON HER CELL FONE FROM MY MATH TEACHER SAYiNG THAT i DO BAD iN CLASS AND ALL THiS. SO iM NOT REALLY ALOUD ON THE COMPUTER WHEN SHE'S HERE CUZ i GOTTA BRiNG UP MY GRADES AND ALL THAT. THANKS ALL FOR THE PROPS. iLL BE ON HERE TOMORROW TO UPDATE iF SHE GOES TO WORK. iLL STiLL BE UPDATiNG ON THiS JUST WHEN SHE AiNT HERE. HAHAH iM BAD! OH AND ON ANOTHER NOTE. MY DiET iSNT GOiNG THAT GOOD. i SORTA BROKE iT A FEW TiMES. BUT iM COMMiTED NOW AND STARTiNG TOMORROW MORNiNG NOTHiNG OVER 100 CALORiES, NON FAT, OR LOW FAT FOOD. GOiNG OUT TO DiNNER TONiGHT SO iMMA GO CLEAN UP A BiT AND GET READY"

    A direct quote, sadly. Is this a good thing?

  83. Buzzwords by Ghostgate · · Score: 1

    Yeah, but they left out "paradigm" and "synergy" - upper management will never take them seriously without those!

  84. I know... by Anonymous Coward · · Score: 0

    this is redundant but... PWNED!!!

  85. And now from news from a midget in a bikini.... by testednegative · · Score: 1

    "Ted, it seems that we the LiveJournal outage has caused a massive wave of young emo writing singers who just want to be heard."

  86. Other power backup stories... by ari_j · · Score: 1

    In Minneapolis, Unisys has I believe two or three large diesel generators. One time when their part of the city lost power, they fired them up and had a lot of juice left over. Northwest Airlines bought some of their power and they still had electricity to spare, and ended up powering thousands of homes in the southeastern suburbs, if I remember the story right.

    A friend who worked for Exxon once told me about their power backups...I think it was almost cheaper for them to run on diesel than on the local grid. ;)

  87. Suggested correction to Thinkgeek's shirt: by jcuervo · · Score: 1

    "I'm blogging this locally, and will post it when the servers come back up"

    --
    Assume I was drunk when I posted this.
  88. Re:./ed !!!! Server Reboot Time? by SuperDJ · · Score: 1

    Yeah, just like on LiveJournal.com. Thanks for the heads-up though! =)

    --
    RTJKJAS
  89. Internap is the best carrier I've ever dealt with. by Anonymous Coward · · Score: 0

    I've had a lot of experience with Internap, and I can tell you, the quality of their service is nothing but great. It's possible this could turn out to be one man's mistake, but come on. Half of the United States lost power once, because of an unforseen chain reaction.
    I can give nothing but the highest praise for Internap. Everytime I ever upgraded the IOS on the core router, or took BGP down for testing (Ensuring our failover worked properly. The only real test, is a live test.) I received e-mails in advance about every maintenance, even though 99% of the time, I wouldn't lose connectivity during the maintenance, a testament to their engineering capabilities.
    I was employed, at the time, with a wholesale VoIP company. This means, to provide top knotch quality, we could have no jitter (variation in the RTT of ping times.) Internap monitors their routes for quality, and has customized software to take alnternate paths if problems are recognized are dedicated through one carrier.
    I've never worked for Internap, but I've worked with a hell of a lot of ISP's over the years. They are, far and away, without doubt, the best carrier I've ever dealt with. When I was having problems at one point, which was affecting only their BGP session, they had a team of 4 engineers helping me look at the debug information. A reload solved the problem. Heh...Cisco never did find out what happened. Anyway, it sucks to see them slandered with the strength of their service, and I just wanted to give the community one engineers humble input.

  90. Re:./ed !!!! Server Reboot Time? by bradfitz · · Score: 5, Informative

    At this point all my whiteboards are full of boxes of each database cluster, the machines in that cluster, which have passed their checksum tests. (innodb checksums each 16k page), which replayed their replay/undo logs, where in binlogs each was writing/reading/executing etc...

    So lots of waiting now on the checksum validators. I don't want to put a machine back in and find out in a week there was a database page that was corrupt because the battery-backed write-back cache on the RAID card didn't work as advertised. (which happens on about 95% of RAID cards, in my experience, because they're mostly crap, even the most expensive ones...)

    Also whenever there's any doubt about something's integrity, we backup or snapshot the potentially corrupt version before operating on it. That operation can take time too.

    It's going to be a fun night.

  91. Re:GOOD! by Anonymous Coward · · Score: 0

    Yeah, keeping a journal or diary is only for the sad and lonely. Get a clue.

    Let people do what they want; we don't need others to put them down just because it makes them not feel as insignificant. Go do something constructive, and actually try to better society, and you'll be more significant in the lives of others, which makes you feel better about yourself. Everyone benefitting is better than being a parasite, isn't it?

    One could say, "At least they're not sad or lonely enough to have a slashdot account", and it would sound more "realistic", thanks to the "closet geek" stereotype. But then, wouldn't you say, "That isn't true. Besides, if you don't like it, you don't have to visit. You can just totally ignore this part of the internet."? What's different with Livejournal? Or anything else that's "uncool" in your eyes?

    But I disgress, since you're so much better than everyone else and can decide what's universally "cool" and what's not, and especially since I overlooked the point that the Livejournal admins hired thugs to force you to read teen girl journals for hours every day...

  92. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    *hugs* Good luck man!

  93. OMGOMGOMG by damacus · · Score: 1

    What am I gonna do, now that I can't update my blog!

    1. Re:OMGOMGOMG by Anonymous Coward · · Score: 0

      Troll slashdot of course!

    2. Re:OMGOMGOMG by damacus · · Score: 1

      c1, boredom must be overcome in some fashion. Fear the CS reference.

  94. funny by TLouden · · Score: 1

    I just went there for some information before coming here. Didn't think much about it but appearently somebody else did.

    --
    -Tim Louden
  95. Wikipedia is down too by Anonymous Coward · · Score: 0

    I, for one, welcome our website-killing overlords.

  96. Re:./ed !!!! Server Reboot Time? by simsong · · Score: 1

    Hey, good luck with this whole thing. I hate it when it happens. Did you have both of your clusters in the same center?

    --
    (Yes, I really am Simson Garfinkel)
  97. Re:./ed !!!! Server Reboot Time? by TrevorB · · Score: 1

    A long night indeedy.

    Is there some sort of load threshold you're willing to live with? Perhaps 50% or 80% of all servers up before starting a cluster? You know your system load distribution based on time of day better than anyone though...

    An update #2 on the status page might be called for at this point. People might appreciate some reflection of how many checkboxes are checked off on that whiteboard. It would also give the impression you're busting ass for LJ, which would go over well after the panic some users had over SixApart. "As long as Brad is still around, we're in good hands", that sort of thing.

    Good Luck tonight. SysAdmin crises nights suck, but they do actually pass.

  98. thank god by wobblie · · Score: 1

    I only hope it lasts forever.

  99. nt by njyoder · · Score: 0
    The website is currently displaying this: Our data center (Internap) lost all its power, including redundant backup power, for some unknown reason. (unknown to me, at least) We're currently dealing with bringing our 100+ servers back online. Not fun. We're not happy about this. Sorry... :-/ More details later.

    Update #1, 7:35 pm PST: we're up on 'dirty' power for now (it works, but it's unreliable), and we're working to assess the state of the databases. The worst thing we could do right now is rush the site up in an unreliable state. We're checking all the hardware and data, making sure everything's consistent. Where it's not, we'll be restoring from recent backups and replaying all the changes since that time, to get to the current point in time, but in good shape. We'll be providing more technical details later, for those curious, on the power failure (when we learn more), the database details, and the recovery process. For now, please be patient. We'll be working all weekend on this if we have to.

  100. Value of Livejournal - "Open Source Philosophy" by DemonWeeping · · Score: 5, Interesting

    For those who don't know what's so hot about it and for those who think Livejournal is just a bunch of teenage girls whining.... Livejournal has just about four years of my life documented. The ease of use and the ability to "vent" is comforting, but the real value comes in the interaction. My friends see my life at their convenience and I see theirs at mine. We can choose to ignore the whining of others or we can choose to relate and comment on our own experience. Think of it this way: Open-source philosophy, emotion, and life. I put my own out there and others add to it. I add mine to others. Granted ... those quiz/meme things HAVE TO GO. I do not want to read about "what frog best resembles me" or "which 80's hair band song is me." Grrr.

    1. Re:Value of Livejournal - "Open Source Philosophy" by killerface · · Score: 1

      I agree, i use livejournal also, just for the convience, i can open up logjam, put my ideas in and it's on. The Emo kids do have to go though

    2. Re:Value of Livejournal - "Open Source Philosophy" by killerface · · Score: 1

      and if you don't believe me look at how i spelled convenience

    3. Re:Value of Livejournal - "Open Source Philosophy" by Anonymous Coward · · Score: 0

      HEAR HEAR! All these people thinking they're just too cosmopolitan for LJ can go on using their ugly other community-less blogs. I'm going to sing a dirge for LJ.

    4. Re:Value of Livejournal - "Open Source Philosophy" by Antonymous+Flower · · Score: 1

      Good points, but I can't resist: A proponent of Livejournal named "DemonWeeping." How stereotypical :)

    5. Re:Value of Livejournal - "Open Source Philosophy" by DemonWeeping · · Score: 1

      You know, I was kinda waiting for that since I posted
      I was "Pyromancer" for a good long time, but gave it up on the 1st because I haven't done fire performing in some time. I picked up DemonWeeping because it was available in livejournal/AIM/Yahoo/gmail/MSN/Slashdot/.com all at once. It wasn't until after I deployed the site did I realize how gothy it sounded. How tragic.

      *stapling hand to forehead*

    6. Re:Value of Livejournal - "Open Source Philosophy" by ambienceman · · Score: 1

      Also, to the geeky people who don't have lady friends: You can potentially meet some great women on the site who are generally interested in your writings (personality) rather than simply looks. I met some cool people in general (and dated a girl I met, even though it didn't work out too well) using Livejournal. You can make it something worthwile to use. It's not only about the teenage females athough some of you may like that too.

    7. Re:Value of Livejournal - "Open Source Philosophy" by jfengel · · Score: 1

      The quizzes themselves I can live with. It's when they are accompanied by 1024-pixel-wide illustrations that make the rest of the page unreadable that I get irritated.

    8. Re:Value of Livejournal - "Open Source Philosophy" by TiggsPanther · · Score: 2, Insightful

      Oh yes. If I ever feel the need to post any of those quiz-things I make good use of the <lj-cut> tag. So if anyone on my Friends list (or a random person finding my Journal) doesn't want to see the results they don't have to.

      Actually one of the more useful LJ Features i know of is one that allows you to screen out images over a set size from your Friends list. So you need to view the entry in question to see the image, which is good for your bandwidth and/or narrow page layout.

      --
      Tiggs
      "120 chars should be enough for everyone..."
    9. Re:Value of Livejournal - "Open Source Philosophy" by jfengel · · Score: 1

      Firefox has a "remove this object" add-on that I use for the same purpose. Which is great, except when 20 friends all do it.

    10. Re:Value of Livejournal - "Open Source Philosophy" by Shadesong · · Score: 1

      A very simple "amen to that"!

    11. Re:Value of Livejournal - "Open Source Philosophy" by ChristianBaekkelund · · Score: 1

      I don't want my life Open Sourced. I don't want many people all starting to stick things into me to make me better. I liked my life closed sourced and with me as the only person with access to the codebase, thank you very much...

  101. Conspiracy by schnits0r · · Score: 1

    The livejournal servers are provided by Internap. Anyways, Bush just appointed Internap's CEO to his National Infrastructure Advisory Council (http://www.tmcnet.com/usubmit/2005/Jan/1104954.ht m), and i'm worndering, if this is some sort of terrorist attack, and then I thought, that is what they want you to think. Rather, it's jsut another step towards the republican squashing of the independant media. Perhaps, the republicans are following me. or you. They know that without live journal, the teenage adolecent girls will surely flock to forums, such as this one and post. The sheer amount of posts will crash another server, and therefore, we have a domino effect (also a technique used by the RIAA to crush peer to peer services, such as bit trorent, by causing more strain on a website not meant to handle it). So with livejounral going down like a Korean hooker, and Slashdot in hot pursuit due to the flock of teengirls, we will be unable to communicate our ideas. And without communication, the left wing majority of this website will be unable to unite and thus ensuring the republicans remain in power and control of the "free" world.

    1. Re:Conspiracy by LXE · · Score: 1

      Similar conspicacy theories circulate in Russia (blaming Putin).

  102. What! Katz is still here? by infonography · · Score: 1

    Thank the gods for user settings.

    --
    Sorry about the writing. Robot fingers, you know? Cliff Steele in DOOM PATROL #23
  103. Before you get all down on LJ... by Bloodlent · · Score: 4, Informative

    Just remember it's not ALL obnoxious, over-emotional teen-angst teenage girls. I use mine to showcase (non-depressing)poetry and make intelligent comments about intelligent topics. Basically, if someone makes an LJ about their own life, it sucks. If you can manage to write an LJ and make it about things that matter to more people than just you(ie, "Why Bush's Iraqi war is unjust" vs. "Why this babe I know should bang me"), and at the same time make it funny and enjoyable to read, then you have a good LJ. Most LJs DO suck, but there are some diamonds in the rough.

    1. Re:Before you get all down on LJ... by Anonymous Coward · · Score: 2, Interesting

      Agreed, there are plenty of useful uses for a LiveJournal. LiveJournals are also a great outlet for creative content. For example, my friend and I started a satirical blog on LiveJournal dubbed The Rhubarb. (Of course the link won't work until LJ is back up).

    2. Re:Before you get all down on LJ... by Anonymous Coward · · Score: 0

      *shrugs* The true advantage of LJ is the community feel. You write for the people on your friends' list, and you can expect them to be interested in what you post. Essays on the Iraqi war _and_ details of personal life can both be interesting, banal, fascinating, boring.

      That said, posts like "my life sucks :(", which generally get tonnes of comments, annoy me. But it doesn't hurt (much) to give someone that whiny and angsty a venue for people who care about him to see his thoughts.

      *bows*

  104. LiveJournal down? I better.. by Anonymous Coward · · Score: 0

    .. post this on my Live Journal blog... wait.. it's not loading?

  105. Blog, blog, blog, blog... by alienmole · · Score: 2, Funny

    Blog blog blog blog.
    Lovely blog!
    Wonderful blog!
    Blog blo-o-o-o-o-og blog blo-o-o-o-o-og blog.
    Lovely blog! Lovely blog!
    Lovely blog! Lovely blog!
    Lovely blog!
    Blog blog blog blog!

    -- The Viking Blog Song

    1. Re:Blog, blog, blog, blog... by ajlitt · · Score: 1

      It's bloooog! Bloooog!
      It's big, it's heavy, it's wood

      It's bloooog! Bloooog!
      It's better than bad, it's good

    2. Re:Blog, blog, blog, blog... by Anonymous Coward · · Score: 0

      But I *DON'T* like BLOGS!

  106. Who did it? by Megane · · Score: 1

    All right, who did it? Who pressed the shiiiny, candy-like history eras... I mean emergency stop button?

    --
    #naabhaprzrag, #sverubfr-000, #agi-fcbafberq, negvpyr[pynff*=' negvpyr-ary-'] { qvfcynl: abar !vzcbegnag; }
  107. Re:./ed !!!! Server Reboot Time? by TrevorB · · Score: 1

    ROFL on the hugs...

    That is so LJ.

  108. 503's of '05 by strredwolf · · Score: 1

    I'm getting those too now. And they're not all that nice.

    --

    --
    # Canmephians for a better Linux Kernel
    $Stalag99{"URL"}="http://stalag99.net";
    1. Re:503's of '05 by DaRiachu · · Score: 1

      One would think, that with the immense brainpower of the slashdot community, and the people themselves, they could come up with a more interesting, and perhaps amusing 503 page.

  109. Oh come ON by bigberk · · Score: 1

    For god sakes people, it's a Friday night! If Google went down I could see people panicking, but LiveJournal? Whatever... I'm going out.

  110. Slashdot blogging for a fix by Fractal+Dice · · Score: 1

    The last time I lost my journal, it was accompanied by a loud popping sound and the smell of ozone coming from the power supply of the linux box in the corner of my living room.

    As much as I like having an easy interface to the online writings of all my friends, I miss the days of having my own pet web server, of being able to do something myself other than twiddle my thumbs and wait when something breaks.

    (sorry ... I know ... random meanderings ... I feel this sudden urge to post a few memes, links to personality tests and a few "what happened at work today!" comments).

    1. Re:Slashdot blogging for a fix by Buran · · Score: 2, Interesting

      I know the feeling. I have an LJ (for friends to read) in which I relay news, ramble about things that interest me, and write mini-essays from time to time. I don't whine about my parents or people at school or whatever (well, I do, but it's grumbling about idiots at work, since I work at a university) and the people I know are generally much the same. But I can't stand those typically teen idiotic ramblings either.

      But I too find it irritating that a service I use, that is supposed to be backed up (my account was bouncing up and down numerous times in the past week, too). For a paid service, I'd have expected there to be a lot more backups to make it more difficult for power problems to wipe out the entire site. If the hosting facility doesn't have a UPS, why wasn't one installed?

  111. Not related to Six Apart by wersh · · Score: 3, Insightful

    From the article write-up (and reflecting the thoughts of quite a few of the comments I just read):

    Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?

    I'd love to know what makes you think this has anything to do with Six Apart. The very first line at http://www.livejournal.com states:

    Our data center (Internap, the same one we've been at for many years)...

    They've been with Internap for years, predating Six Apart's takeover. Unless LJ staff is lying, the fault here sounds like it lies entirely with Internap.

    And as far as I can tell, Six Apart didn't ditch the LJ team when they bought them out, so you probably have the exact same people working on bringing the site back up now as you would have if Six Apart had never got involved.

  112. Re:Internap is the best carrier I've ever dealt wi by Anonymous Coward · · Score: 0

    Slandered?

    Excuse me, but they LOST POWER TO CUSTOMER'S SERVERS. Oh, did I slander them by telling the truth?

  113. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    Yeah, and I don't even have an LJ! Oh, the irony. I never really read much into the infrastructure behind LiveJournal - does anyone have details on it?

  114. Six-apart? What are you smoking?? by meshko · · Score: 1

    Dear editors, please don't let morons submit news. It's bad.
    LiveJournal has not changed any of its hosting or any other infrastructure yet. This has nothing, nothing, nothing to do with the recent acquisition.

    Sorry, I'm just cranky because of the withdrawal.

    --
    I passed the Turing test.
  115. NOTHING to do with Six Apart. by Anonymous Coward · · Score: 0

    This outage has nothing to do with Six Apart. Nothing. Nada. Zero.

    It has everything to do with Internap, where LiveJournal has been hosted for years.

    Something obviously fucked up at Internap's facility when they have two independent power sources plus backup and there's no power at all.

    As usual, you can thank Michael for his posting of braindead and incorrect opinion pieces submitted by readers. Were it not for his leaving in the baseless opinion of the reader, you'd be making your comments on facts.

    1. Re:NOTHING to do with Six Apart. by njyoder · · Score: 0

      That's not going to prevent paranoid morons from blaming it on Six Apart though...

  116. Irony. by Grendel+Drago · · Score: 1

    I just think it's ironic (this is irony, right? I remember that all the things in that Alanis song were in fact not irony) that Wikipedia has been experiencing some rather major server issues, recently resolved but not really explained to anyone outside of the server maintenance IRC channel.

    While it was down, the OpenFacts status page was the place for immediate info, but the log of activity was kept on the 'wikitech' account on, you guessed it, LiveJournal.

    Sweet, sweet irony.

    --grendel drago

    --
    Laws do not persuade just because they threaten. --Seneca
  117. What loc?? by gamekeeper · · Score: 1

    I am curious to know what location went down?What janitor/sanitation engineer plugged the Buffer into what wrong socket? What electrician was fired over this? Who maintains all of the back-up systems ? Who was too drunk to find the Actual Light Switch? Who is getting fired for this? Who are they hiring in his place? Do they need tallented replacements if so reply with your e-mail address and I will send you my resume, or if you need a good person and are not related to this please, still reply.. Last but not least, who got stuck in the elevator (doing what), and who got stuck in the bathroom and how they made it out.? If none of the above applies I am curious to know what the ROOT cause analysis is for this situation.. gk

  118. Re:Internap is the best carrier I've ever dealt wi by Anonymous Coward · · Score: 0

    they notify everyone to almost excessive levels, however. i'm not sure that downloading a config off a router via TFTP really necessitates notifying all their customers :) much, much better than most providers that don't notify you when they *are* doing something big, but it gets a bit like crying wolf eventually, and you learn to just quickly glance over anything from internap...

  119. Re:Internap is the best carrier I've ever dealt wi by Anonymous Coward · · Score: 0

    No, but I've seen plenty of people say they are a crap carrier, or shoddy, because of one incident. Work at an ISP, you'll learn, that despite your best efforts, s**t happens.

  120. New licensing scheme? by Grendel+Drago · · Score: 1

    Wait, what new licensing scheme? I didn't even think they'd rolled out the new lawyer-friendly TOS yet.

    --grendel drago

    --
    Laws do not persuade just because they threaten. --Seneca
    1. Re:New licensing scheme? by Anonymous Coward · · Score: 0

      The 'new' licensing scheme isn't really that new. Rather, there were major changes in the license when MovableType, Six Apart's flagship product, moved from version 2.x to version 3.x, which garnered them quite a bit of ill-will in the blogging community.

  121. Internap by missing000 · · Score: 1

    I use an ISP that peers with internap for upstream connectivity to it's tier 1 ip network.

    I noticed a few unreachable hosts earlier, as well as DNS delays. Didn't think much of it, but now those sites are all back up. They are all in the LA area, and I suspect you are as well.

    Is this the case? (I can't tell if it's LA or not from current traces, we appear to be using Level(3) to get there at the moment however)
    Also, what about redundant power? Internap is huge, they must have redundant systems. in place.

    1. Re:Internap by This+Is+Ridiculous · · Score: 1
      They're in Portland. And the Internap facility does have redundant power, but apparently it failed somehow, which is always lots of fun.

      Internap's power has failed before, but they thought it was a fluke:
      We're going to be buying a bunch of rack-mount UPS units on Monday so this doesn't happen again. In the past we've always trusted Internap's insanely redundant power and UPS systems, but now that this has happened to us twice, we realize the first time wasn't a total freak coincidence. C'est la vie.
      (See livejournal.com/powerloss/ for more info.)
      --
      Hey, you try to find an open nick these days!
    2. Re:Internap by missing000 · · Score: 1

      I don't see how a customer owned UPS will provide value. If the datacenter looses power, that's lights out on Internap's routers too.

      Unless they are jacked directly into a copper frame or a telco fiber dmarc, but that would be a very odd use of a datacenter.

      It's just really hard to see Internap having these issues. Most places regularly schedule intentional fail-overs to test systems. I work in the financial sector, and this kind of loss would be unimaginable at any of our (comparatively small) datacenters.

    3. Re:Internap by This+Is+Ridiculous · · Score: 1

      If nothing else, installing UPSes will allow them to shut down their systems cleanly, so they don't have to spend 12+ hours running integrity checks and replaying database logs.

      --
      Hey, you try to find an open nick these days!
  122. animated site status by golden+spud · · Score: 1
  123. Re:./ed !!!! Server Reboot Time? by pronobozo · · Score: 1

    "At this point all my whiteboards are full of boxes of each database cluster, the machines in that cluster, which have passed their checksum tests. (innodb checksums each 16k page), which replayed their replay/undo logs, where in binlogs each was writing/reading/executing etc..." But at least there is still time to read and post on slashdot. :-)

    --
    ------
    insert sig here,here, and here
  124. The Big Red Switch by rah1420 · · Score: 3, Funny

    Someone probably hit the big red switch on the wall, the one covered in a plastic case

    That does happen. I remember working at Purolator Courier's data center in NJ back in -- oh, geez, mid-80s some time. I was a third shift print operator, helped out with the mag tape library too. One night the trouble alarm went off on the fire suppression panel. We'd been having trouble with it all week, and the alarm guy was due in in the morning. One of the newbie operators -- the only one at the console at the time, the others being on a smoke break or asleep in the tape library -- panicked and went over to the annunciator panel. He opened it as I watched him from the console area. I think he thought the halon was about to dump because he reached around the panel and instead of hitting the halon dump abort, he hit the emergency power cutoff.

    BLAM! It was as if a firecracker went off as all the breakers tripped and the fans came to a sighing halt. Both on this floor -- the one with the console and the tape drives -- and the floor above, with the CPU and the disk farms. Dead as a doornail.

    Now, this was Purolator COURIER. We had AIRPLANES coming in to land at Indy center and as of this moment, no way to tell the crews which gate to go to, where to unload their stuff, or how to sort it.

    Not only that, but this was an IBM mainframe shop -- S/390, the Big Iron, with 3380 disk drives. You don't just flip the power switch back on. An emergency power cutoff blows breakers in the power supplies on those DASD strings. The IBM Field Engineer was duly dispatched and arrived with cases of breakers the next morning. But we were still dark when I got off shift the following morning.

    The next night a brand new plexiglass cover was mounted over the Big Red Switch.

    --
    Mit der Dummheit kämpfen Götter selbst vergebens.
    1. Re:The Big Red Switch by Alrescha · · Score: 1

      " S/390, the Big Iron, with 3380 disk drives. You don't just flip the power switch back on. An emergency power cutoff blows breakers in the power supplies on those DASD strings."

      Maybe in *yours* fella, but this was not a general feature of IBM 3380s (single, double, or triple-density versions).

      No device really likes a sudden loss of power, and sometimes you might trip a breaker. I don't think I've ever had to replace a 3380 breaker.

      A.

      --
      ...bringing you cynical quips since 1998
    2. Re:The Big Red Switch by rah1420 · · Score: 1

      Well, that's what the guys told me. (shrug) Hey, I was a print operator. What do I know?

      All I remember was FEs standing around open DASD cabinets playing with power supplies.

      --
      Mit der Dummheit kämpfen Götter selbst vergebens.
  125. First hand account by Anonymous Coward · · Score: 0

    The company I work for also colocates at Internap, and I just got back from their data center after bringing our own servers back up. I was talking to the techs and they said it was a cascade failure that started outside the building and took out a UPS. There were so many people there bringing their servers back to life that there was an hour wait for crash carts (:

    Internap is supposed to be one of the best in this area also, they certainly are the most expensive.

  126. bigger explination by moosesocks · · Score: 4, Insightful

    I'm surprised to see that Internap's main servers are back up. It's pretty irresponsible to bring up your corporate servers before those of your clients.

    That being said, LJ's servers are back up now, but they're making sure that the databases are all in sync -- LiveJournal has one of the most massive distributed MySQL clusters in existance along with a complete caching system.

    They need to make sure that the database is all synchronized before bringing it back up -- chances are they're going to rebuild the cache too. If they didn't, the initial strain on the DB servers would probably bring the site down again.

    This does however, bring up some questions about LiveJournal's network infrastructure. Danga (the creaters of LJ, recently purchased by Six Apart) are heavy users of Perl and MySQL. Needless to say, they have made numerous contributions to both projects and have developed an innovative memory caching system for linux.

    The questions raised however, come from Perl and MySQL. Both are questionable in terms of scalability. Although I'm not qualified to comment on this, I belive that the general concensus is that MySQL is one of the least efficent databases today. Livejournal has 100+ servers. I honestly don't think that a system the size of LiveJournal should require a server cluster that big. It seems that they are trying to solve their performance/reliability problems by blindly throwing hardware at it.

    Of course, I love livejournal. It's simple, easy to use, and is a great tool for building communities. Just as it is simple, it can also be incredibly nerdy (there's actually a command prompt!). They're also completely open source.

    Hopefully, Six Apart can make their network infrastructure more 'professional' while still maintianing the community spirit that has made it so successful.

    --
    -- If you try to fail and succeed, which have you done? - Uli's moose
    1. Re:bigger explination by hwyguy2 · · Score: 1

      Actually, if I recall Brad's recent articles (which, alas, are up on LiveJournal), they've moved away from MySQL. If you read earlier, Brad noted they've moved to InnoDB. He also noted "We have clusters of 2 identical databases in separate cabinets, separate switches, separate Internap power feeds... so normally losing one database in each cluster doesn't matter: the other one gets used. But when we lose every single database, in all clusters, all at once... that's the time to be paranoid and double check stuff."

      I"ve been very impressed with their network infrastructure. In fact, I'm so impressed I (ummm) study it quite a bit every day.

      D (cahwyguy.livejournal.com)

    2. Re:bigger explination by andfarm · · Score: 2, Informative

      InnoDB *is* MySQL.

      --

      TANSTAAFI: There Ain't No Such Thing As A Free iPod.

    3. Re:bigger explination by hwyguy2 · · Score: 1

      Oh. Nevermind.

      Pay no attention to that man behind the curtain

      (Quickly he scurries off to bed. It's been a long day)

    4. Re:bigger explination by Doppleganger · · Score: 1

      It's pretty irresponsible to bring up your corporate servers before those of your clients.

      1) It's quite possible that the corporate servers didn't need special attention to bring up... or, didn't need large amounts of attention.

      2) Where are customers going to go for updates, tickets, etc? Most likely, the corporate servers. Given the choice between the phone ringing off the hook while trying to concentrate on disaster recovery and having other forms of communication flashing around on the network, I'd pick the second any time. Unless there are large numbers of people sitting around to answer phones who would be absolutely useless in the recovery efforts, it's far more effecient to rely on text communication (and much easier to sort the useful people out from those who just want an ETA every five minutes). There's nothing worse for productivity than being forced to stop every five minutes to spend ten minutes convincing someone that you're working on problems as quickly as possible (and no, there isn't an eta, because it's likely something else is broken that won't be visible until the current thing you're working on is fixed..)

      Oh, and I love how you say you're not qualified to comment, and then proceed anyways...

    5. Re:bigger explination by Kyrrin · · Score: 4, Insightful

      As we've said a bunch of times in the past, moving away from MySQL would be prohibitive. By now we know how to make it work for us; switching away from MySQL would not only involve massive rewriting of stuff and alterations on the existing DB, it'd take the next five years before we got as comfortable with the flaws and advantages of another DB package.

      Sure, MySQL has its flaws -- some of them pretty big -- but we can work around them.

      As for the "not needing a server cluster that big" -- do you have any clue how much data we push in an average day? We maintain so many DB clusters to improve reliability, and we maintain so many web nodes because we push a screaming shitload of traffic.

    6. Re:bigger explination by Anonymous Coward · · Score: 0

      No, how much data do you push? A few hundred megabits IIRC, but that doesn't say much.

      How many pageviews? How many entries posted/updated?

    7. Re:bigger explination by mikeplokta · · Score: 1

      The website I manage is one of the busiest UK websites, but our traffic is a fraction of LJ's. And we have forty-two web servers, handling around 170 million pages per month. I would be surprised if LJ didn't have hundreds of servers.

      Of course, we also have three separate colocated data centres, with data replicated between sites, so if one centre loses power, blows up or is eaten by mighty Cthulhu, we can stay up.

    8. Re:bigger explination by elemental23 · · Score: 1

      I'm surprised to see that Internap's main servers are back up. It's pretty irresponsible to bring up your corporate servers before those of your clients.

      Not really. Internap has multiple data centers, so it's likely that the two sites are hosted in two different locations. LJ is based in Seattle, so I assume their servers are hosted there, while Internap's corporate HQ is in Atlanta, where they have another facility.

      Compare traceroutes to www.livejournal.com and www.internap.com. They're on opposite ends of the country.

      --
      I like my women like my coffee... pale and bitter.
    9. Re:bigger explination by Bluecoat93 · · Score: 1

      The questions raised however, come from Perl and MySQL. Both are questionable in terms of scalability. Although I'm not qualified to comment on this, I belive that the general concensus is that MySQL is one of the least efficent databases today.

      Well, you certainly proved your statement that you're not qualified to comment on this. Why should that stop you, though?

      You do realize that Slashdot is built on MySQL/Perl (well, DB2 now for political reasons, but MySQL for a very long time), as is Movable Type and Typepad from Six Apart? I'm not sure how a network infrastructure can be "professional", or how LiveJournal's architecture doesn't qualify. Then again, I'm not sure how you would be qualified to just what is and is not "professional".

    10. Re:bigger explination by IO+ERROR · · Score: 1
      I belive that the general concensus is that MySQL is one of the least efficent databases today.

      And as I discovered this morning, Google uses it. I clicked on a link in an email message sent to me, and...

      ERROR OCCURED
      Description: Error connecting to MySQL server: #1045 - Access denied for user: 'root@localhost' (Using password: YES)
      Module: /mailer/link.php
      Time occured: Sat, 15 Jan 2005 07:42:27 -0800

      So they use it for Gmail, but I don't know offhand if MySQL is what they're using for the index. To their credit, they are using PHP rather than Perl.

      --
      How am I supposed to fit a pithy, relevant quote into 120 characters?
    11. Re:bigger explination by silpol · · Score: 1
      your comment sounds like LJ admins are actually hostages of poor, eclectic design of LJ architecture, don't you think so? the fact thatr LJ needs manual intervention after power loss like this means that noone ever had been thinking on design, rather using duct tape to close coimng gaps in performance.

      and, yes, having all data in one datacenter - it is major design flaw.

      --
      this field has been intentionally left blank ;)
    12. Re:bigger explination by tf23 · · Score: 1

      do you have any clue how much data we push in an average day?

      No idea. Please tell us.

      How many mysql clusters do you have? I'm assuming one cluster is the db-writer, and there'd be a few read-only's slaving from that master?

      How many machines are running memcached?

      How many webheads? Are you using NFS for file dispersal to all the webheads ala slashdot?

    13. Re:bigger explination by Anonymous Coward · · Score: 0

      I recently read something that the average data throughput at LJ averaged around 450 Mbps, and averaged some order of magnitude more pageviews/unit time than /.

    14. Re:bigger explination by AllUsernamesAreGone · · Score: 1

      your comment sounds like LJ admins are actually hostages of poor, eclectic design of LJ architecture, don't you think so?
      So unlike slashdot of course. And pretty much every other major system I've run into over the past 10 years. Every one of them paragons of design, capable of changing a major backend component without a single problem.

      the fact thatr LJ needs manual intervention after power loss like this means that noone ever had been thinking on design
      Have you actually ever maintained a significant, database-backed system and seen what happens when some twat of a contractor cuts through a power cable or something kills the power to everything on the spot? That is not something you want to power back up and let users right back onto it until you have made damn sure the system, especially the database, is sane - and I've seen even oracle boxes screw up after unexpected power loss, let alone postgres or mysql ones. Without a UPS and enough time to cleanly shut down - things you'd expect from a datacentre - any complex system is going to need checking before being put back in production otherwise any number of problems could crop up.

    15. Re:bigger explination by crschmidt · · Score: 1

      300 Mbps, 2000 dynamic page views per second, 300 journal updates a minute, probably 10 times that in comments posted.

      That's at a slow time, of course, but that's the numbers that I pull from on-site statistics and slides from Brad's various talks.

      --
      -- Christopher Schmidt YouTube Quality of Experience
    16. Re:bigger explination by crschmidt · · Score: 1

      My cluster knowledge has gone out of date after the recent switch to much of the data being switched to 64 bit hardware, but here's a guess, which is probably low: 8 user clusters, each with a "Master" and 2 slaves. Most machines running dual masters in case one fails. Every machine runs memcached in its spare ram: 6 months ago, this amounted to 40 gig, but now it's way bigger, although I don't have estimates. About 60 diskless web slaves. 40 backend database server. Approximately 300-400 MBps, although ,that's again, a guess. I don't work there: these numbers are relatively well informed guesses, but may be off. Do not take my word as gospel: I am an uninformed user.

      --
      -- Christopher Schmidt YouTube Quality of Experience
    17. Re:bigger explination by crschmidt · · Score: 1

      LiveJournal The Company is based in Portland, (although I think the office address is still a PO Box in Beaverton) and will soon be moving to San Francisco. However, their data is indeed at the Seattle center.

      --
      -- Christopher Schmidt YouTube Quality of Experience
    18. Re:bigger explination by elemental23 · · Score: 1

      Seattle, Portland... I knew it was somewhere up there. It's all Pacific Northwest to me. I think I knew it was Portland, but had forgotten or gotten the two cities confused somewhow.

      I wonder if they're going to relocate their server farm down here in SF or just manage everything from here. Either way sounds like a major project. I assume they'll move everything to a SF datacenter piecemeal over time. That's not a operation I'd look forward to.

      On the other hand, I wonder if they'll be hiring...

      --
      I like my women like my coffee... pale and bitter.
    19. Re:bigger explination by crschmidt · · Score: 1

      I doubt that they'll move, but that's personal opinion only. They've already got two remote sysadmins in Seattle: Barring another incident like this, there's not really much that can't be done remotely, I don't think.

      --
      -- Christopher Schmidt YouTube Quality of Experience
    20. Re:bigger explination by Anonymous Coward · · Score: 0

      Yes. As the other person noted, InnoDB is MySQL. The default type of table MySQL uses is very fast, but is not ACID, and it's a bit of a stretch to call it a relational database. The InnoDB table type is slow, but it has transaction, foreign keys, and all that good stuff.

      I'm surprised they've gone this long without making the jump. If you've ever used a real relational database, using the default MySQL tables can be very frustrating.

    21. Re:bigger explination by Hartree · · Score: 1

      Been there. When you've got years of history with a system, it's awfully tough to move away from it.

      That said, (and noting that my DB experience is with other systems than MySQL), given that innodb supports replication would it be possible to distribute your servers over a couple of different colos? I know that you'd have a lot of update data passing between them. It might well be expensive, but this sort of outage is expensive too now that you're a commercial service. There's also a mention in the updates that this sort of thing has happened before.

      A lot of sites do multi-location DB systems, so it can be done (I just don't know if it's feasible for you guys.)

      In any case, good luck from an LJ user and a veteran of some ugly multiday sleep deprived data center recoveries. I feel your pain.

  127. Perhaps... by brutus_007 · · Score: 0

    LiveJournal should offer those who are willing to put in some sweat equity, charging UPSes or powering servers by pedaling/running in a hamster wheel/other old school electricity generation technique, the chance to update their journals, so they can let the world know how they felt about not being able to access livejournal... to be read once the site comes up.

    Colour me redundant, but there must be at LEAST a hundred or so scremo/emo/angst filled poetry, song and interpretative dance about not being able to let the world know how you feel about livejournal being down.

    --
    I have 1 million monkeys on a million year contract to make me a better sig.
  128. Somewhere... by joeszilagyi · · Score: 1

    A cam whore gently weeps.

    --
    Dude, where's my packet?
  129. A giant Molly-Guard? by Anonymous Coward · · Score: 0

    "The next night a brand new plexiglass cover was mounted over the Big Red Switch."

    Hehehe, it's not just 3 year old girls named Molly that hit the Big Red Switches by mistake.

  130. Thank God! by Okthnxbye · · Score: 1

    Finally a break from the never ending angsty teenage bullshit.

    --
    This space is powered by Google Ad-nauseam.
  131. Looking forward to the post-recovery analysis by davidwr · · Score: 1

    I hope they do a look-back analysis on this and publish the results.

    It will be interesting to see what caused the failure, what they didn't do that could've mitigated the failure, and whether such mitigation makes economic sense.

    Should make interesting reading.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  132. SPOF by Anonymous Coward · · Score: 0

    Single point of failure.

    The systems all pass through a single cable at some point. That means at your UPS or later.

    If a circuit at the UPS goes out, or god forbid they decide to test the UPS and it fails the test, everything goes down.

    It isn't impossible. It does happen.

  133. Opportunistic by Anonymous Coward · · Score: 0

    Earlier status.livejournal.com pages have been mostly (or completely) plain text. Now that /. has linked to it? What's that I see? A banner ad for warped.com, the ones who host the status.livejournal page? Could they be trying to say they don't suck like Internap, so pretty please buy services from them? I love it.

    1. Re:Opportunistic by brutus_007 · · Score: 0

      Uhmmm, Warped has *always* had, at the very least, a logo on that page. Don't believe it? Try The Wayback Machine, going as far back as it can, back to September 23rd, 2001.

      Granted, the banner is bigger now, but that may be traffic driven (over x MB/month(day?) = larger banner) since it's a page that isn't used often. Then again, Warped.com may just be saying "We're not teh sux0rz".

      --
      I have 1 million monkeys on a million year contract to make me a better sig.
  134. This will be bad PR... by Sparks23 · · Score: 2

    There were already lots of LiveJournal users who were upset and confused and unhappy with the idea that LJ and Danga (the company which made LJ) had been bought by SixApart. No doubt, as there have been no downtimes of this magnitude at LJ before, doomsayers will be claiming that it's SixApart's fault.

    Never mind common sense; it won't matter that if SixApart can be held responsible for failures at InterNAP's colocation facilities, they're a much bigger -- and more powerful -- company than most people have ever given them credit for...

    --
    --Rachel
    1. Re:This will be bad PR... by brutus_007 · · Score: 0

      Of course, the sad part is that with the latest update, stating they'll be buying rack-mounted UPSes for their systems (that's a bunch of them for 100+ servers), certainly aided by the finances the SixApart deal provides, but most LJ users won't see it that way... or at all.

      --
      I have 1 million monkeys on a million year contract to make me a better sig.
  135. Re:Look at me! Look at me! by taernim · · Score: 1

    hey, you sound cool. can I add you to my journal? kthx. ;-)

    --
    "PC Load Letter? What the $@#% does that mean?!"
  136. Slashdot Article by tonsofpcs · · Score: 1

    This makes me wonder if we will see a slashdot article if slashdot ever goes down for more than 2 mintes.

  137. Update 2 by KinkifyTheNation · · Score: 2, Informative

    Update #2, 10:11 pm: So far so good. Things are checking out, but we're being paranoid. A few annoying issues, but nothing that's not fixable. We're going to be buying a bunch of rack-mount UPS units on Monday so this doesn't happen again. In the past we've always trusted Internap's insanely redundant power and UPS systems, but now that this has happened to us twice, we realize the first time wasn't a total freak coincidence. C'est la vie.

  138. The cause of the outage? by supersat · · Score: 2, Informative

    According to some LiveJournal employees, a massive UPS exploded. From IRC:

    <rahaeli> As far as we can tell, a UPS exploded.

    Their site now says that they're buying their own UPSes, because this is the second time that the entire data center has lost power. Details on the first outage can be found here (a Google cache since LJ is down).

    For the paranoid: This has nothing to do Six Apart buying LJ. They're still in the same "world-class" data center they've been in for years.

  139. Re:./ed !!!! Server Reboot Time? by cHiphead · · Score: 3, Funny

    you want beer and pizza? email me an address/zipcode at the sig email and ill do my part to support restoring lj.

    if my wife cant post this weekend, im gonna hear about it. and not even be able to post my lj about getting yelled it about lj being down as if i caused the power outage myself. ;)

    not really.

    well maybe.

    Cheers.

    --

    This is my sig. There are many like it, but this one is mine.
  140. Shutting down the protection grid? by Anonymous Coward · · Score: 0

    "He wants to shut down the LiveJournal grid, Peter."

    "You shut that thing down and we are not going to be held responsible for whatever happens."

    1. Re:Shutting down the protection grid? by locokamil · · Score: 1

      Are they saying that someone deliberately sabotaged LJ?

    2. Re:Shutting down the protection grid? by Anonymous Coward · · Score: 0

      Gah, no. Just some Ghostbusters humour.

  141. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    Hey, thanks for all you're doing.

    I might be impatient for lj to get back up, but I'd rather it be up and running right then it all being screwed up because you didn't wait and check everything.

  142. It's strange by AndroidCat · · Score: 3, Funny

    Remember when teenagers were happy when people couldn't read all the personal details in their diary?

    --
    One line blog. I hear that they're called Twitters now.
    1. Re:It's strange by Anonymous Coward · · Score: 0

      I hope the parent gets modded up. (Although this AC reply doesn't help much.)

      This is a huge social change, one of those total turnarounds in the big zig-zag movement forward... but the discussions are all about interfaces and the web technology and other completely trivial details.

      What happened? What contributed to it? Don't these kids want or believe in privacy any more? Is this bad or good? Need to put on ol' thinking cap... or better, maybe just go and ask around. :)

    2. Re:It's strange by mdwh2 · · Score: 1

      Wanting privacy hasn't really changed though. Some people I know with a LiveJournal still do private journalling, either with private entries on LiveJournal, or something offline. There's no reason to suspect that the number of people keeping private journals is less than in the past.

      Also there are different levels of privacy - many people have some entries set so only people they trust can read them, and some people do their entire journal this way.

      Discussing what you did, or your feelings, no matter how personal, to your friends is nothing new. What's new is the way that places like LiveJournal allow this to be done in a far more efficient way, and on a greater scale.

  143. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    Haha, yeah right. Using mysql, you're probably fucked. Switch to a real database - Postgresql.

  144. what Obi Wan would say if he were here by insomnyuk · · Score: 2, Funny

    "I have felt a great disturbance in the force; as if a million voices suddenly cried out in terror."

    Those poor, poor children.

  145. Kremlin shuts down Live Journal by goga_russian · · Score: 0

    there is an article on few russian sites stating that the reason it was shut down because of russian 'separatist' student group that wanted to get free transportation and assiciated with anti-Putin statements. can anyone find that in english?

    --
    Dont Judge The situation by the Misfortunate. Goga.
  146. russia in shadows by schweller · · Score: 2, Interesting

    i won't exaggerate if i tell that in recent years most of "social life" in .ru zone moved to livejournal. it's 10 a.m. in russia now, and most of russian lj-addicts still don't know about apocalypse in lj. i hope everything will be turned up in the nearest future. brad, we believe in you! :)

    1. Re:russia in shadows by Yurka · · Score: 1

      True. However, most of the people use it from work (as do we with /.), and on weekends the volume falls aproximately fivefold.

      --
      I can assure you, the best way to get rid of dragons is to have one of your own.
  147. I have many servers at that Internap site by Dejohn · · Score: 1

    It was down for about a half hour, maybe a little longer. Most obnoxious for the colo facility that *is never supposed to go down*

  148. Re:./ed !!!! Server Reboot Time? by MarkRose · · Score: 1

    If you look at his userpage here, you'll see he only posts a couple times a year.

    --
    Be relentless!
  149. btw, by schweller · · Score: 1

    there was info in some russian online media, that this turnout was organized by russian officials who thacked down opposition in internet. conspirology rules :)))))

  150. Or maybe a giant Dee-Dee guard? by MsGeek · · Score: 1

    Or, as in the immortal cartoon "Dexter's Laboratory"...

    Dee-Dee: OOOOOH! What does THIS button do?!?!?!?

    Dexter: GET OUT AUF MY LAH-BOR-AH-TOR-EEE!!!!!

    --
    Knowledge is power. Knowledge shared is power multiplied.
  151. Live Jounal? Hah... by wcdw · · Score: 1

    Authorize.net (a fairly popular credit card gateway) is also an Internap client - I wonder how many sites (like ours) potentially lost revenue as a result of this outage.

    http://www.theboyz.biz/

    --
    If you're not living on the edge, you're just taking up space!
  152. Re:./ed !!!! Server Reboot Time? by nike2422 · · Score: 1

    What's so unusual about Brad posting here, I thought everyone posted here.

    --
    What Would Scooby Do?
  153. YOU ARE RITE by Anonymous Coward · · Score: 0

    i rly do think so this isnt a troll

  154. Re:./ed !!!! Server Reboot Time? by TrevorB · · Score: 1

    Sure, go to www.livejournal.com when it's back up. It's fairly self explanitory.

  155. Re:Conspiracy - I don't think so, there are by Anonymous Coward · · Score: 0

    There are literally tons of conservative communities at LJ, it's one of the bigger conservative communities out there.

  156. I call bull on all this by Anonymous Coward · · Score: 3, Insightful
    There seems to be a lot of latent hostility towards teenage girls. WTF? Your outlet is geeking out on Slashdot. Theirs is LJ. And how do you all know so much about the content of LJ anyway?

    1. Re:I call bull on all this by Anonymous Coward · · Score: 0

      Exactly... There seems to be a bunch of twenty-something geeks on here, continually posting about those who use a different service. Of course, never mind that many of the offending posters are probably paying good money to watch teenage girls in another window; That would be beside the point...

    2. Re:I call bull on all this by inlandsis · · Score: 1

      Oh, thanks for this post. I think people here read a couple of very nutty journals and made hasty generalisations. True, there is an awful load of stupid journals, but there is also many extremely brainy and coherent users.

    3. Re:I call bull on all this by Crackferret · · Score: 1

      The obvious conclusion is both Livejournal and Slashdot would have much more value if they included webcam functionality, preferably of teenage girls.

  157. Bringing servers back is hard why? by satch89450 · · Score: 1

    When $DAYJOB had a present from Sierra Pacific Power of a two-hour blackout, and we discovered there were major problems with our generator, the poor APC UPS batteries weren't able to hold up the 150 servers I run.

    When the power came back on, we had 143 servers back on-line in ten minutes. We had 149 on line in fifteen minutes. We had two servers (leased dedicateds) that requires some file system repairs before they would come back on-line, but that task was finished 30 minutes after power restoration.

    What's so hard about that?

    (With the addition of a three-phase power transformer, our generator is working properly.)

    Customers kept calling asking us why it took us so long!

    1. Re:Bringing servers back is hard why? by Lew+Payne · · Score: 2, Funny

      | ...the poor APC UPS batteries weren't able to hold up the 150 servers I run.
      |
      | When the power came back on, we had 143 servers back on-line in ten minutes.
      | We had 149 on line in fifteen minutes. We had two servers (leased dedicateds)
      | that requires some file system repairs before they would come back on-line, but
      | that task was finished 30 minutes after power restoration.
      |
      | What's so hard about that?

      What's so hard about that? Well... not everyone who has 150 servers can get 151 of them back online in 30 minutes.

  158. [OT] Your sig by ibentmywookie · · Score: 1

    Damn you fuckers move quick.

    --
    -- The doctor said I wouldn't get so many nose bleeds if I just kept my finger out of there!
  159. [OT] Sig by HeghmoH · · Score: 1

    Given the fact that a pyramid scheme is guaranteed to leave the vast majority of the people who get sucked into it with absolutely nothing, do you actually expect you have a good chance to get your free Mac Mini? What makes you luckier than the next guy?

    --
    Mod down posts with a "Free Mac Mini/iPod" sig, they're spam!
  160. This happened to me a while ago... by edunbar93 · · Score: 1

    I mean having the uber-redundant, diesel-powered backup power in the server room fail.

    Except the power didn't fail outside the server room, just inside it. There was a faulty breaker that died unexpectedly. Now, we had 100+ servers go down as a result of that, but we were pissed just the same, right along with about 20 other companies.

    I'd post a link to the livejournal entry about the incident, but...

    --
    "No problem. I have the capacity to do infinite work so long as you don't mind that my quality approaches zero."-Dilbert
  161. Re:./ed !!!! Server Reboot Time? by Tet · · Score: 1
    We have clusters of 2 identical databases in separate cabinets, separate switches, separate Internap power feeds...

    What bothers me is that you don't have separate data centres. I run a reasonably large web site, but it's nowhere near the size of LJ. Yet we have multiple geographic sites, so even if the (N+1) power fails completely in one hosting centre, we're only down on capacity, not out completely. I can't believe a site the size of LJ doesn't do the same...

    --
    "The invisible and the non-existent look very much alike." -- Delos B. McKown
  162. Re:Bush supporter too dumb to understand datacente by RollingThunder · · Score: 2, Interesting

    Unless it means that the "cheapskate blog admins" were too cheapskate to buy proper dual-power supply boxes so that they can have dual power paths right to the servers.

    You can have all the great redundant mains and backups you want, and it's for shit if you only have one power line to the system and that power bus loses juice.

  163. Re:Look at me! Look at me! by bronaugh · · Score: 1

    Yeah yeah, it's funny and all, but it's pretty fucking uncool on a number of levels. People cutting themselves is really bad news; please don't make fun of it.

  164. Is it going to ever come back up? by Anonymous Coward · · Score: 0

    I've been waiting almost twelve hours to change one freaking word of the post, I added to my journal right before it went down. One f'ing word is all I want to change in an otherwise wonderful treatise, but because of the depth of the subject, the lesser word might detract. Is the damn thing ever going to come back? I'd like to go to bed!

    LJ User for all of 45 days...

  165. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    Ever heard of "dynamic content"?

    Ever heard of "high update to read ratio"?

  166. Um... No? by Pliny · · Score: 1

    Do not pass go, do not collect $200.

    Look, Perl rubs me the wrong way. I loathe it, and it makes me wanna hurl. More than that - it's Postgres that rocks my DB world. But personally, I think I'd at least read up on LJ's infrastructure before bashing it.

    I mean they've got what? 2.5 million active users?

    And how many hits are DB-backed?

    Sweet fuck, man. How many servers do you think they're wasting? Assuming no redundancy (ha!), right now they're sitting at an approximate ratio of about 25,000 users per server! What morons they must be to not be squeezing more out of them. (And yes I know that I'm way oversimplifing, but... really?)

    --
    What does this button d$#%* NO CARRIER
    1. Re:Um... No? by EvilStein · · Score: 1

      Hrm. Dunno,actually. The company I used to work for hosted (incubated) a very active social networking site, hi5.com, which has several million users already. it's a Postgres site, pushes an absolute shitload of traffic, and is operating with WAAAAAAAAAY less than 100+ servers.

      I guess it's just how you set it all up. *shrug*

    2. Re:Um... No? by MarkRose · · Score: 1

      Just about every single hit is DB backed. At peak times, they get over 1000 hits per second. Read about the infrastructure. It's interesting. http://www.danga.com/words/2004_oscon/oscon2004.pd f.

      --
      Be relentless!
    3. Re:Um... No? by silpol · · Score: 1

      well, infrastructure docs are way outdated, and they do NOT have documenting sync often ;)

      --
      this field has been intentionally left blank ;)
    4. Re:Um... No? by Anonymous Coward · · Score: 0

      Several million active users of LiveJournal = several million people posting crap into blogs, many of them several times per day. Add to that all the anonymous visitors like me surfing through journals of people they've never met.

      Several million users of hi5 = several million people fill out a web form once, most likely because their most annoying friend sent them an email demanding that they do so.

      Basically you're talking about something that, although it obviously impressed you, is very much small potatoes on the web, and certainly not on the same scale as LiveJournal.

    5. Re:Um... No? by EvilStein · · Score: 1

      Well, AC, you're completely wrong. It's not "filling out a web form once" - it has an entire blog system, photo uploads, messaging system... other stuff as well.

      It's definately not "small potatoes on the web" - such a statement is pretty damn stupid.

  167. Re:./ed !!!! Server Reboot Time? by c_g_hills · · Score: 1

    Hopefully as lambda switching becomes more common, it will be perfectly feasible to run a SAN spreading across 2 or more datacentres.

  168. New Update! by Anonymous Coward · · Score: 0

    Update #3: 2:42 am: We're starting to get tired, but things are almost done. Unfortunately a couple machines had lying hardware that didn't commit to disk when asked, so InnoDB's durability wasn't so durable (though no fault of InnoDB). We restored those machines from a recent backup and are replaying the binlogs (database changes) from the point of backup to present. That will take a couple hours to run. We'll also be replacing that hardware very shortly, or at least seeing if we can find/fix the reason it misbehaved. The four of us have been at this almost 12 hours, so we're going to take a bit of a break while the binlogs replay... Again, our apologies for the downtime. This has definitely been an experience.

    I'm replying to myself and going to bed

  169. Makes me wanna laugh by Jesus+IS+the+Devil · · Score: 2, Interesting

    It's funny how I was just met with some Internap sales people a few months ago. They were bragging about how their network infrastructure was superior to most others, since it intelligently routes traffic to the path of shortest response (not hops).

    They even bragged to me how their network uptime SLA is 100%! I mean good god, now I find out this is the SECOND time it's happened (from the livejournal update site)???

    I'm glad I didn't go with them...

    --

    eTrade SUCKS
    1. Re:Makes me wanna laugh by Anonymous Coward · · Score: 0

      Internap is a bunch of packet punters. They have no network infrastructure. They set up a POP and get large pipes with BGP feeds from several top ISPs. Then they perform their 'supah sekret bullshit path optimization procees' on the various pipes. Certain routes that they deem better performing than what normal BGP judges are used in place of the normal BGP routes. Without even delving into the merits of if the 'supah sekret bullshit path optimization process' even works, there is a subtle and non obvious caveat if it does work as advertised. It only works for transmitted traffic; which also guarantees you have a different path on the recieve because the rest of the Internet is using the normal BGP routes.

  170. Like, what's wrong with you, people? by 21mhz · · Score: 2, Interesting

    The comments seem to be full of contempt for teenage -angst inane ramblings that are common on LJ. Come on. It's not like you are forced to read through this stuff.
    I have a few "friends" there at LJ, some of them net.celebs, and I like their posts. It's the matter of whose writings do you find interesting, and you are free to be completely unaware of the rest. Why all the vitriol?

    --
    My exception safety is -fno-exceptions.
    1. Re:Like, what's wrong with you, people? by Anonymous Coward · · Score: 0

      Because some on here who really just need to change their Kotex but decide to inflict themselves upon us and act like whiney little bitches who need to take it up the ass more often.

    2. Re:Like, what's wrong with you, people? by Anonymous Coward · · Score: 0

      The little boys are afraid of girls...

  171. Re:Live Jounal? Hah... by KeLp · · Score: 1

    Thats why you have redundancy in your payment gateways. Use two or more. Use anet and plugnpay.

    Anet was hosed a few months ago due to DoS attacks. But all was good because we had a backup provider.

  172. Re:Look at me! Look at me! by Anonymous Coward · · Score: 0

    Speaking as a former teenage self-harmer, they only do it because they take it seriously and believe others will. Less therapy and more perspective would have helped me loads, even if some of that perspective came from strangers' shitty jokes :)

  173. obligatory... by slappyjack · · Score: 1

    In South Korea, only Old People blog on LiveJournal

  174. Almost appropriate.. by EvilStein · · Score: 1

    http://www.cafepress.com/blogwhine

    Can't whine about it in your blog if you blog isn't there for you to whine about it in.

    I can just imagine the huge pile of traffic that LiveJournal is going to get hit with once everything *does* come back up online.
    Hrm. Ss there any way that they can blame this on Microsoft? :)

  175. disk writes by Spazmania · · Score: 1

    Unfortunately a couple machines had lying hardware that didn't commit to disk when asked, so InnoDB's durability wasn't so durable (though no fault of InnoDB).

    Um, yeah. That happens when you configure the raid cards for write-back instead of write-through but forget to buy the cards with batteries.

    --
    Moderating "-1, Disagree" is simple censorship. Have the guts to post your opinion.
    1. Re:disk writes by TheSunborn · · Score: 1

      Or when you buy (ide) harddisks and don't configure them to flush write cache before they acknowledge write. The raidcard might flush data to the harddisk cache insted of to the harddisk

  176. Touché by WebCrapper · · Score: 1

    I have more arguments, but I'll let it drop. No sense in arguing over something that neither of us can prove ;-)

    I'll admit though, LJ is a major undertaking and they have produced some nice code for the community at large to use.

    1. Re:Touché by Kiryat+Malachi · · Score: 1

      Last I knew, LJ was handling ~2 posts per second on average, and who knows how many more lookups into their DBs, plus the more generally intensive nature of their site (images, lots more distinct entries than Slashdot, so caching into memory is somewhat less effective).

      I honestly don't think the traffic is even comparable.

      --

      ---
      Mod me down, you fucking twits. Go ahead. I dare you.
      (I read with sigs off.)
  177. Internap Sucks by Nurgled · · Score: 3, Interesting

    I seem to remember that a few years back they had a similar problem (Internap lost all power) and it turned out that some idiot had hit the big red "shut down all power to the entire datacenter" emergency button. This isn't the first time this has happened, and last time it wasn't under Six Apart's management.

    I'd say it's Internap's incompetence that caused this problem. If they can't keep their datacenter running even though they have multiple redundant power supplies then something is very wrong. I see from the outage page that LJ people are now planning to buy their own UPS so that they don't have to trust Internap anymore.

    For power outages, my house has a better record than Internap right now, and I don't even own a UPS!

    1. Re:Internap Sucks by Punk+Walrus · · Score: 1
      I see from the outage page that LJ people are now planning to buy their own UPS so that they don't have to trust Internap anymore

      Maybe I don't see how this is run in the big picture, but a data center is more than just power to servers, the power affects switches and routers as well, which I assume you don't own - Internap does. I know in the data centers we have (all are site-redundant, what we have on the North side of town is ALWAYS synced to a redundant on the South side), if you put a UPS to a rack of servers, it won't do you a damn bit of good if the whole power grid goes kaflooey because... well, your servers may be running, but the logs will be filled with "cannot reach network" errors. You'd need UPS's for the servers, then for every switch and router in the chain all the way to the telco outside.

      Right?

    2. Re:Internap Sucks by Huogo · · Score: 1

      The problem comes from the fact that the computer just lost power suddenly, thus weaking havoc on the databases. All the power is back up now, but the LJ servers cannot just resume normal function, because there is a chance the databases are fscked. If the servers stayed up, but the internal network went down, the databases would be perfectly fine, and as soon as Internap is back online, you can bring your servers back very shorly thereafter. Right now, Internap is back, but the databases are taking some work due to the sudden server shutdown, so LJ isn't up.

    3. Re:Internap Sucks by Scott+Laird · · Score: 2, Informative

      A couple points. First, there's *nothing* that you can do about the "idiot hit the big red button" problem--you're required by law to have the button, because it's a safety issue. It has to be accessible--you can't lock it in a closet. And everyone knows that if you put a big red button on a wall, sooner or later someone's going to hit it.

      I don't know what happened this time, but the ~2002 Internap Seattle outage was caused by an idiot Speakeasy tech who couldn't figure out how to use the exit door, so hit hit the Big Red Button instead.

      I worked for Internap at the time, and I spent weeks stuck inside that colo facility. It was basically the only "dot-com" grade thing that Internap built (they were usually somewhat thrifty, at least pre-2001). It sparkled. Everything was over-engineered. You had to go through multiple rounds of security to get access to anything.

      The last I heard last night, no one quite knew what'd happened yet. Apparently, multiple redundant power systems all failed at the same time. This facility was designed by a company that already had ~5 years experience running high-end colo facilities, and it was designed as the flagship facility for showing off to potential customers. This isn't a hole-in-the-wall hosting place, it's more of a bunker hiding in the shadow of the Space Needle. So, frankly, it'll be very interesting to see what happened, because no money was spared to keep this sort of thing from happening.

      (Disclaimer: I haven't worked for Internap since 2002. I still own a bit of stock, because it's not worth the hassle of selling it for what little it's worth. It's not really the same company now that it was when I started in '98, and only a handful of my former coworkers are still with the company. I'm not even going to *start* with my opinion of the current management.)

    4. Re:Internap Sucks by Gleef · · Score: 1

      if you put a UPS to a rack of servers, it won't do you a damn bit of good if the whole power grid goes kaflooey because... well, your servers may be running, but the logs will be filled with "cannot reach network" errors.

      When power gets restored to the switch however, you can come right back up. Without the extra UPS for the server, you have to worry about whether hardware got damaged, the database was properly committed to the hard drive, etc. Extra UPS for the server would mean they would have been back online at 7:35pm last night rather than still down this morning.

      On the other hand, this looks like a case of "always solving the last crisis".

      --

      ----
      Open mind, insert foot.
    5. Re:Internap Sucks by MegaFur · · Score: 1

      I work at a medium-to-large wastewater treatment plant in Knasas City, MO.

      While it's probably true that you can't /lock away/ the main breaker feeding electricity into the plant, AFAIK, there's no reason or need to put it out in the open in an area where any idiot can just throw it on accident. You can put it somewhere outside in a power cabinet with a whole bunch of warning signs on it. That plus the large humming noise coming from it really ought to deter all but the dumbest of idiots.

      Or are we talking about different things?

      --
      Furry cows moo and decompress.
    6. Re:Internap Sucks by ces · · Score: 1

      Well more to the point the problem LJ is having is due to a crappy database and not using decent storage systems.

      We were able to bring our Oracle boxes back up quickly after power was restored.

      --
      Happy Fun Ball is for external use only.
    7. Re:Internap Sucks by ces · · Score: 1

      Datacenters have to have a BRS by the exit doors both in case some tech shorts himself across a power feed and so the fire department can kill all of the power to the room as killing the power feed to the building won't do it.

      --
      Happy Fun Ball is for external use only.
    8. Re:Internap Sucks by Scott+Laird · · Score: 1

      Yeah, but it's not just for the fire department--by the time they got there, the tech'd be dead. As I understand it, it's really for local staff so they can kill the power if there's an imminent danger to another tech's life. The canonical example is an IBM tech with his tie stuck in a herking huge printer--either hit the EPO button or watch him get line-fed through the thing. While this exact example my not apply in colo spaces (I've never seen any big mechanical devices in any of the ones that I've been in), it's still important for electrical fires. Presumably, that's the main reason that we're required to put the damn things in.

    9. Re:Internap Sucks by silvwolf · · Score: 1

      While this exact example my not apply in colo spaces (I've never seen any big mechanical devices in any of the ones that I've been in)

      I got a tour of my college's datacenter during one of my classes. Pretty big school, about 18,000 students. Seemed to be a pretty standard datacenter w/ racks of servers, tape robot, raised floors, fire suppression system, etc. Also in the room was the printer that perforated & sealed the student account statements and other mass mailings. I found it rather odd that the machine was in there..

    10. Re:Internap Sucks by ces · · Score: 1

      Sorry I mean it was for saving the tech or for the fire department. Basicly any time you want to make sure everything is off RIGHT NOW.

      --
      Happy Fun Ball is for external use only.
    11. Re:Internap Sucks by Nurgled · · Score: 1

      The reason LJ was down for so long was because their database servers weren't cleanly shutdown and they had to carefully check all of the data to ensure it was written to disk correctly. With UPS in the racks for these servers they would have been taken down cleanly and the site would have been up again within minutes of the power coming back on.

  178. Geocaching by Pirogoeth · · Score: 1

    I wonder if this was the same outage that took down Geocaching.com? Talk about your worst case scenarios...

    --
    Happiness is like peeing yourself. Everybody can see it but only you can feel its warmth.
  179. Re:./ed !!!! Server Reboot Time? by Nurgled · · Score: 1

    You can read Brad's presentations on LiveJournal's setup. The LISA one is the most recent, I think.

  180. Hmn... by Shrubbman · · Score: 1

    Ironically, I had just finally got around to aquiring a consumer grade UPS for my own system, installed it, and posted on LJ about it shortly before all of this happened. Go figure.

  181. They're out on the streets! by Anonymous Coward · · Score: 0

    This frightens me. Now they will have to actually live a life rather than write about one. I like the way LJ & AOL keeps them online. Or maybe, hopefully, they are crying & hitting refresh...

  182. Re:./ed !!!! Server Reboot Time? by niteice · · Score: 1

    It looks like you're running on Linux...what filesystem are you using? Reiser(4(?)) would be a big help here.

    --
    ROMANES EUNT DOMUS
  183. Evil Empire strikes back by Anonymous Coward · · Score: 0

    According to the Russian-Israeli news service, MigNews http://www.mignews.com/news/technology/world/15010 5_54115_61571.html/ , a student group "Going without Putin" advocating the preservation of the existing public transportation discounts and army draft delays for the students got their website http://www.idushiespb.narod.ru/ busted. When they moved their discussions to the livejournal, LJ got its UPS exposion. This might have been a paranoia if LJ was not a major meeting point for the students during the recent Orange revolution in Ukraine that got Kremlin blackeyed. However this outage would have never happen, KGB or not if Brad has been using some mature high available suppliers like Stratus that AOL used to use, or Tandem. One should not be using unix hacks clustered or not for the system that requires high availability. Hire me Brad, I'll fix it for you!

    1. Re:Evil Empire strikes back by Anonymous Coward · · Score: 0

      Well, you've shown me the light. We'll see you in the office on Monday.

      --Brad

  184. Report the news, not your opinion by Anonymous Coward · · Score: 0

    People posting news reports to /. should refrain from posting kneejerk assumptions/opinions such as this one:

    "Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?"

    Unless you've researched the cause of the downtime and researched what happened, how it was resolved, etc, etc, leave your comments out.

  185. Re:./ed !!!! Server Reboot Time? by PopeFelix · · Score: 1

    I don't post here. Oh, wait...

    --

    Pope Felix the Scurrilous.
    Computer Geek by day, religious Icon by night.

  186. Not Six Apart's Fault by DarkKnightRadick · · Score: 1

    Internap has been hosting LJ since long before Six Apart thought about taking over LJ. If you bother to read you'll know this.

    --
    "There is a way that seems right to a man, but its end is the way of death." Proverbs 16:25 (NKJV)
  187. Who's been affected by sono · · Score: 1

    Anyone know of other sites that have been affected by the outage?

  188. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    OMG I LOVE BRAD!!!!

  189. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    If you get it, can you pass it on? I've done a whip-round for a gift basket and we're uncertain of the address we've found...

    copperbadge@gmail.com (lj user copperbadge)

  190. This is why I use Blogger.... by Patrick+Mannion · · Score: 1
    1. Hasn't never gone down due to a "power outage"

    2. Not filled with teenage girls and emo boys. Depsite the fact I like emo...a bit...

    3. OK, so maybe there are teenage girls on Blogger but they don't go "OMFG LYK i GoT kIsSeD bY jOhN tOdAy!!!111"

    4. You get a subdomain, not some crappy /users/nickname/

    5. It can be exported to sites.

    6. It never went throught a temporary phase in which you had to buy a damn "invite code" or one from a fucking friend.

    Also according to Wikipedia:
    As of December 2004, nearly five and a half million accounts had been created, of which approximately 1.4 million had been updated at some point in the last 30 days [1] (http://www.livejournal.com/stats.bml). Of those users who provided their date of birth, the vast majority were in the 15-23 age group. Of those who specified a gender, more than two thirds were female.
    --
    In America, you spam computers In Soviet Russia, computers spam you!
    1. Re:This is why I use Blogger.... by This+Is+Ridiculous · · Score: 2, Interesting

      Personally, I'll trade a subdomain for the elegant simplicity of the friends system, post security, threaded comments, communities, user images, easy and powerful customization, an open-source backend with some seriously useful software contributed to the community, clients, and a site that, during the 99% of the time it's running properly, is ridiculously fast.

      Actually, I won't trade a subdomain for all that. I'm a paid user, so I get one anyway.

      (And there's a simple solution to the emo teens: ignore them.)

      --
      Hey, you try to find an open nick these days!
  191. Jesus Christ... by Anonymous Coward · · Score: 0

    Does every frickin' LiveJournal thread on Slashdot have to devolve into a bunch of losers insisting blogs are a contribution to society?

    It's always the same defense, too. "LiveJournal lets me keep everyone apprised of my life, without sending mass emails." Like that's new. When I was a kid -- before email, jackasses -- there were people like you. They mailed "family updates" every Christmas, long letters about what they named their dog's puppies and who their cousin saw in Orlando. We mocked those people. No one took them seriously, and their "updates" always ended up in the trash.

    No one cares about your life. When you graduate college, send us a letter. When you get married, send us an invitation. Those are landmark events. But no one -- NO ONE, even your family members who pretend to read your LiveJournal -- wants daily, weekly, or even monthly updates on your life. Not via snail mail, not via email...and not even via one, convenient website.

    1. Re:Jesus Christ... by Anonymous Coward · · Score: 0

      No one cares about your life. When you graduate college, send us a letter. When you get married, send us an invitation. Those are landmark events. But no one -- NO ONE, even your family members who pretend to read your LiveJournal -- wants daily, weekly, or even monthly updates on your life. Not via snail mail, not via email...and not even via one, convenient website.

      I love it when people make blanket statements like that, especially in light of the fact that Livejournal thrives on the very idea that people DO care. I mean, it's like when people say that the beatles (or any other talanted musician/artists) have no talent. It's a valid thing to think, but you're, you know, SO OBVIOUSLY WRONG, so what's the point of even saying what you think? It's not going to change reality.

      I read my friends page every day, and there are probabaly 10 people whose lives I want to read about every day (they don't always satiate this need by updating, though). And guess what, I'm not the only one. That's kind of the point of Livejournal.

    2. Re:Jesus Christ... by Anonymous Coward · · Score: 0
      Livejournal thrives on the very idea that people DO care.
      No. LiveJournal thrives on the idea that teenagers want people to care. LiveJournal thrives on writers, not readers -- which is why the former group vastly outnumbers the latter (which is entirely composed of the former, anyway!).
    3. Re:Jesus Christ... by mdwh2 · · Score: 1

      When I was a kid -- before email, jackasses -- there were people like you. They mailed "family updates" every Christmas, long letters about what they named their dog's puppies and who their cousin saw in Orlando. We mocked those people. No one took them seriously, and their "updates" always ended up in the trash.

      Then you have completely missed one of the points of something like LiveJournal. With email, the sender decides whether the recipient gets sent the information. With a webpage, the user decides whether to read it (and I'm sorry that no one cares about you, but some of us have these things called "friends").

      Btw, why did you post this comment? After, if no one cares about what you have to say..

  192. Clueless editorial note on article by jridley · · Score: 1

    Yeah, the hosting center that they've been at for years has a power failure 3 days after a new place buys LJ. Must be Six Apart's fault, they're not "ready to handle this." Or, maybe it's a coincidence?

  193. That's why you'll be alone tonight... by Anonymous Coward · · Score: 0

    That's funny, I see subdomains on LJ.

    And you're making it seem like the fact that it's mostly female is a bad thing for an online community, eh? You'll fit in quite well at /., killer. I hope it keeps you warm at night, loser.

    1. Re:That's why you'll be alone tonight... by Patrick+Mannion · · Score: 1

      Oh, whatever. What I'm saying is you get the subdomains for free on Blogger... on LiveJournal you have to pay to get them I'm sure.

      --
      In America, you spam computers In Soviet Russia, computers spam you!
    2. Re:That's why you'll be alone tonight... by Anonymous Coward · · Score: 0

      You're 5'9" and 204#. No wonder you stay away from women, they're probably pretty hostile towards you, lardass.

    3. Re:That's why you'll be alone tonight... by Patrick+Mannion · · Score: 1

      Oh fuck you.

      --
      In America, you spam computers In Soviet Russia, computers spam you!
  194. Frap by LooseChanj · · Score: 1

    I just wanna rant about the doofus who followed me around my mom's condo building muttering "9/11", "you just don't get it", "do you live here", etc.

    Ok, there, whew.

    --
    Mix the failings of Usenet with the shortcomings of the World Wide Web and the result is slashdot.
  195. fools on their pc's by Anonymous Coward · · Score: 0

    Didn't they think after it first went down with Internap that they should switch. Honestly, servers these days...

  196. Knew it! by wuice · · Score: 1

    I KNEW that this story would be on slashdot, and I knew someone would make a crack at Six Apart (who are brand new to the scene) and imply that somehow magically the power wouldn't have gone out if it weren't for Six Apart owning Livejournal.

    1. Re:Knew it! by dswensen · · Score: 1

      Amen.

  197. Clues Left By Brad? (Re: "smart migrator") by jlwn111 · · Score: 1

    Hi. Perhaps it's more than just the power failure? Perhaps BradFitz's last cached LJ entries may hold some additional clues as to why it's taking so long after the power failure to bring LJ back up? Yes, perhaps they do. Enjoy: Jan. 12th, 2005 @ 02:31 am *yawn* It's one of those database nights. Watching 3 sets of progress bars move way too slowly. Wish there were something on Tivo at least to kill some time. Jan. 11th, 2005 @ 09:15 pm smart migrator I just wrote a database migration script which, after each chunk of data moved, asks the load balancer (Perlbal) the free user queue depth. If more than 10 (less is just noise), it sleeps a second and asks again. Only once the queues are empty does it migrate more data. End result: it moves data as fast as it can, without affecting page response times. I've been meaning to make a generic wrapper for any utility, where the wrapper parent watches the load balancers, and the child does its work at full speed, but the parent will occasionally SIGSTOP/SIGCONT it...... haven't got around to that yet. The generic wrapper could even have another pluggable-child as its rate-limit determiner, so anybody could use it. Jan. 11th, 2005 @ 01:24 pm Parallel compression Are there any multi-threaded compression algorithms, or at least wrappers/formats for interleaved compression, with variable interleave size? It'd be nice to take advantage of multiple processors when gzipping 380 GB, while still doing sequential reads, even if the resultant file was non-standard.

  198. Re:./ed !!!! Server Reboot Time? by This+Is+Ridiculous · · Score: 1

    LJ has always been a somewhat cash-starved operation; they make a significant amount of money from their paid users, but they also have a lot of expenses--full-time employees, an ever-expanding user base on a technology that isn't easy on hardware, bandwidth use...

    As it is, most (all?) of their employees are in Portland, so they keep all their servers there, where they can quickly get at them if something happens. Having a second datacenter would be hard on their employees, hard on their budget, and hard on their architecture--for a site that, in the end, isn't critical to have running 24/7.

    --
    Hey, you try to find an open nick these days!
  199. Re:Look at me! Look at me! by identity0 · · Score: 1

    Cut the poor bastards some slack, at least they have the excuse of "teen hormones".

    Nothing, on the other hand, can excuse Taco's lame blog:

    Why is it that my personal value as a human being is always tied 100% to the status of my server. Since last week the box has been cranky (a blown power supply, resulted in the harddrive being happily moved to a machine with 128 megs less RAM, which means the whole thing is just sluggish as hell today. And suddenly I feel like shit. I feel tired unhealthy, and burnt out. A few weeks ago, I was on top of the world: the machine was stable, kicking out 640,000 pages in one day, and performing snappy for everyone. And I was cheerful. Its really strange that a chunk of steel and silicon 3 time zones away defines my mood.

    The airline lost my luggage... it contained 4 pairs of boxers.

    So ya know that annoying ad with the damn taco bell dog and the cops that keep saying 'Drop the Chalupa' over and over again? I hate that ad.


    Good gawd... Taco can put most teenage girls to shame when it comes to lame personal details publicized.

  200. Eejits by Anonymous Coward · · Score: 0

    Wow too many LiveJournal haters here. People, you're wasting your time. What's so wrong with LJ, anyways? There's all types of people that use it, not just 13 year old girls that can't spell, so don't be so goddamned ignorant. And At least it's not as slow as this lame site.

  201. Re:./ed !!!! Server Reboot Time? by ertdredge · · Score: 1

    Personally, I'm finally switching from postgres to mysql after 8 years of happy use of the former because it's finally let me down.

  202. Spoke with Sandy (Brad's Mom) about 30 minutes ago by mandelion · · Score: 1

    They are hoping to have limited capacity on the site in a few hours. They have not slept very much and called Six Apart the minute everything went to hell. Plans for the holiday weekend for the Fitzpatricks went to hell and no one has had much rest. They are testing everything and all the rumors out there are just that - rumors. Mena and her crew were notified the minute it went down. So the people calling up LJ's customer service threatening to slit their throats, saying Six Apart got punk'd, and everything else - can you be more emo? Come on now, if you need to journal that bad - head over to GJ just get your fix. Otherwise, give them time. This has only happened one time before and that was confirmed by Sandy when she called me back (I called asking for an interview and more information as I am doing a story on blogging and this outage for a site I write for.) My god, you would think someone is killing kittens or something the way people are crying in chat rooms. http://www.mandelion.com

  203. Get your T-Shirt here! by Striver · · Score: 1

    OK...I just couldn't resist doing this...In keeping with the American tradition of compassionate response to disaster...

    Get your souvenir T-shirts and coffee cups HERE!

    --
    this is loaner...my sig is in the shop
  204. We see all, we know all... by HPTiPete · · Score: 1

    Those LiveJournal users are everywhere!

    Yes, we do want our fix.

    Go, Team LJ, Go!

  205. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    Most of the employees are in Portland, but there are at least three who work remotely from other parts of the US. The servers are physically hosted in Seattle, and the two sysadmins (Lisa and Matthew, iirc) live and work there.

    LJ user Emmavescence, too lazy to sign up...

  206. Authorize.Net by rayvd · · Score: 1

    A-ha! Our credit card processor -- Authorize.Net also went down hard yesterday. I did a quick tracroute just now and see that they're also located at Internap.

    Whoops :)

  207. Great... by greyspot · · Score: 1

    Now somebody's going to start a ribbon campaign.

  208. LJ is back up... except my cluster. by vcutag · · Score: 1

    It's back up. Unless you're on Filet MIgnon or Madcow (like myself.)

    1. Re:LJ is back up... except my cluster. by kben · · Score: 1

      Actually, none of the clusters seem to be up and running as of yet.

    2. Re:LJ is back up... except my cluster. by vcutag · · Score: 1

      I've managed to post to mine, it's on roastbeef, so I was wrong.

    3. Re:LJ is back up... except my cluster. by kben · · Score: 1

      :\ I was pretty sure that I was on Green... but maybe I'm on Madcow. *kicks it*

  209. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    you are such an ass

  210. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    Quite possibly, the "ass" has never managed much of anything and he may have cost us, further updates. Plus, he insulted a good guy. Perhaps, "ass" isn't right word and he's more of a punk...

  211. ljpr0n by Anonymous Coward · · Score: 0


    http://www.livejournal.com/community/artinnudity /

  212. Yay! by Goodwill_Michael · · Score: 1

    Been a bad 24 hours without LJ :( Its back now though, yay! :D

  213. Re:Look at me! Look at me! by mdwh2 · · Score: 1

    Speaking as a former teenage self-harmer, they only do it because they take it seriously and believe others will.

    Though perhaps not everyone is the same as you. In my experience, many if not most self-harmers (both teenage and older) do not do it because of what they think others will think of it.

  214. Re:./ed !!!! Server Reboot Time? by Anonymous Coward · · Score: 0

    OMGWTFLOL!!!1!1!!! me2. :-p

  215. WoW 16+ hours downtime - coincidence? by Macgrrl · · Score: 1

    Bizzarely enough we had major downtime for both WoW and LJ this week.

    Maybe it's a conspiracy to see if they can force all the computer geeks ut of the house and into the sunshine for a few hours this month?

    --
    Sara
    Designer, Gamer, Macgrrl in an XP World
  216. Re:One man by jabber01 · · Score: 1

    True, one man can't control it. One man with a backhoe, however, is a different story entirely. ;)

    He who can destroy a thing, controls a thing.

    --

    The REAL jabber has the user id: 13196
    What you do today will cost you a day of your life

  217. Glass houses... by cyranoVR · · Score: 1

    Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?

    Last time I checked, LiveJournal wasn't experiencing 503 Service Not Available error downage for 1 - 2 hours every day (unlike certain other web sites).

  218. worst thing for LJ (this article) by Aeron65432 · · Score: 0

    can we say slashdot effect?

  219. Faulty Mobo's? Faulty thinking. by sparkz · · Score: 1
    Bullshit.
    Speed/Duplex negotiation is an OS configuration issue, not a hardware NIC issue.
    If the OS can't configure the negotiation, that's still the OS, not the hardware. It just means that the driver isn't capable of properly configuring the NIC. Just because your workaround was in hardware, does not mean that that is the cause of the problem.

    As for adding your own UPSes which ignore the EPO, is surely that defeats the object of the EPO. I don't know USAian requirements, but if, as you say, the EPO is required, is it legal to bypass it with your own UPSes?

    LJ clearly have not heard of DR; although a true DR configuration is probably overkill for this type of site, this report gives the strong impression that basic sysadmin competencies were not followed when there was time available - during design and deployment, and then later during normal running. These problems had apparently not occured to anyone until it happened. Isn't "what's the worst-case scenario" a common-enough question? Wouldn't "total power failure" be one of those answers?
    Even with write-though caches, a small battery in the array can flush data to disk after a power failure. This isn't rocket science - buy the right kit for the job, understand what you're buying, and how to configure it. If you don't understand what it is, what on earth made you decide to buy it?!! You've got dual-powered systems, but didn't use that feature - why did you buy it then? It wasn't a conscious decision to take the risk, and it wasn't a conscious decision to get dual-powered hardware for resilience. No thought was made about power. Most colo's provide dual-sourced power supplies for this type of problem - power from seperate grids, so even if the grid providing power to the datacentre goes down, the alternate grid continues running.

    Sigh. I often have customers nearly as daft as this, though I don't think I've come across such a poorly considered deployment for a long, long time.

    --
    Author, Shell Scripting : Expert Re
  220. What's most amazing about that link... by Paradox · · Score: 1

    It isn't that someone did a breakfast journal. I could see someone doing that as a joke. It doesn't shatter my worldview.

    What gets me is the sheer volume of comments! Not just from a small group of people, but if you look, you see lots of different people doing it.

    That DOES shatter my worldview.

    --
    Slashdot. It's Not For Common Sense
  221. Re:The Big Red Switch -- nice simple technology... by GuruBob · · Score: 0

    oh the nostalgia...
    the ohonosecond as the 50hz and 400hz supply goes the lights, then a short pause as the brakes on the 3380 HDA's belt drives kicked in literally clank clank clank right through the disk farm..

    the aircon isolation switch was next to the EPO's and our site sparky was due to do Liebert maintenance...

    We had some nice simple technology (which of course was only useful if you lost external power....) a room full of hundreds of car batteries in series.. as a last ditch backup in case gennys and thycon both failed.

    --
    Facebook is a woodpecker tapping on the skull of Humanity, Forever.