LiveJournal Servers Go Down

← Back to Stories (view on slashdot.org)

Posted by ryuzaki0 on Friday January 14, 2005 @03:30PM from the only-mostly-dead dept.

Wind writes "According to any journal hosted off of LiveJournal.com, the LiveJournal data center Internap has suffered a critical power failure, leaving all of LiveJournal and its content temporarily offline and requiring the revival of 100+ servers. Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size? Updated information is posted here."

14 of 596 comments (clear)

Lights out by r_glen · 2005-01-14 15:31 · Score: 5, Funny

Sounds like someone was taking a nap over at Internap
The Pain ... by webfiend · 2005-01-14 15:31 · Score: 5, Funny

You can't imagine the withdrawals I'm going through. It's like the great Slashdot brownouts of '98.

I need my fix, man!
1. Re:The Pain ... by DrEldarion · 2005-01-14 15:59 · Score: 5, Funny
  
  Honestly! Now we have to wait a day or so to find out what MelissaMinx492 ate for breakfast today!
In other news... by Anonymous Coward · 2005-01-14 15:32 · Score: 5, Funny

...the collective IQ of the internet has raised about 20 points.
slashdot has repeated 503 errors, by Anonymous Coward · 2005-01-14 15:33 · Score: 5, Insightful

and search.pl is constantly being trashed by distributed xanga botnets. perhaps michael wasn't quite prepared to be an editor of slashdot?
1. Re:slashdot has repeated 503 errors, by stupidfoo · 2005-01-14 15:41 · Score: 5, Insightful
  
  How is this a troll? It's funny that an "editor" at site with as many problems as slashdot has feels that it isn't amazingly hypocritical to mock another site that is currently having problems. People in glass houses indeed.
  
  Slashdot has semi-major problems almost every day. 503 errors, "nothing for you to see here" annoyances, and a search engine that goes down more than a Thai hooker.
Internap is *down*? by MightyTribble · 2005-01-14 15:33 · Score: 5, Informative

Internap *down*?
Bush just appointed Internap's CEO to his National Infrastructure Advisory Council, yet the man can't keep a co-lo facility switched on.
I'm not sure what that says of Bush or of Interap. And it certainly doesn't seem to have anything to do with SixApart.
What a cock by realdpk · 2005-01-14 15:34 · Score: 5, Insightful

"Perhaps Six Apart wasn't quite prepared for the responsibilities of a website of this size?"

Perhaps shit happens, and a blog service doesn't warrant the necessary investment to survive whatever caused this outage?
Disclaimer: I am Not an Electrical Engineer by ebooher · 2005-01-14 15:53 · Score: 5, Informative

I know nothing of how InterNap is set up. I just want to throw that out there ahead of time. Now, it's time for my patent pending "Bull Shit Theory of the Day."

Ok, here is the rant. I used to work for a Colocation facility. Nothing special, small by Telco terms. The whole facility only had about 1500 cabinets. (Though I hear they are now full, and going to be expanding.)

We had a main power draw off of the local grid. We had a backup power draw off of the *next* cities power grid. (ie, when all the offices around us went dark, we still had power.) And you don't even want to know the kind of red tape we had to go through for *that* pull. I'm still not sure how they did it. We had fly wheel kinetic electricity storage systems, battery backups, and a diesel engine from a train so large it had it's own building.

We used to joke that if we lost power, we had more important things to worry about. And again, we were small time compared to some of the massiveness that is out there. *cough*AADS Chicago*cough*

So I'm kind of in agreement with the statement currently on LiveJournal. It's unknown to me how any self respecting colo facility can say "We've had a power outage that also took our redundant systems."

I have to call bullshit on that entire train of thought. If that's true then they don't *have* any redundant systems, and I'd be looking for a new provider. The most likely thing (at least in my mind) is that someone, somewhere got mad at something specific and decided to make a point by popping the main breaker to their portion of the facility.

Oh, that was another thing, each room had several "main" breakers. It took a hell of a power surge to pop all of them, and the Liebert systems had power filters of some kind, really really big capacitors or something I think, so a surge really never made it to the other side anyway, it got stored in the cap and then trickled out like the rest of the power.

But I was a UNIX admin, not the EE that was planning the power generation aspects of the facility. So take some of it with grains of what ever white powdered spice you prefer.

--
"Genius may shine aloof and alone, like a star, but goodness is social, and it takes two men and God to make a Brother."
1. Re:Disclaimer: I am Not an Electrical Engineer by Anonymous Coward · 2005-01-14 16:24 · Score: 5, Informative
  
  My friend's company is hosted by internap. Today he messaged me when the power went down. It was only power to the second floor, my friend's servers, while cut off to the internet were still running (on the 3rd floor). Internap has redundancy and backup generators (and enough fuel onsite to run for 30 days without external power). Apparently there was construction occuring on the second floor... my guess is that some dipshit contractor cut through a power cable or 3 and took the whole floor down.
  
  To all the people accusing LJ of being stupid for not having UPS systems, Internap has 3 fully redundant power systems (yes, I know, didn't help much) so most people probably don't feel the need to run their own ups.
A great disturbance in the Force... by YowzaTheYuzzum · 2005-01-14 15:59 · Score: 5, Funny

... as if millions of teenage girls suddenly cried out in terror and were suddenly silenced.
Re:./ed !!!! Server Reboot Time? by bradfitz · 2005-01-14 16:08 · Score: 5, Insightful

They all came back up when the power came back.

But we intentionally don't have databases come back up on boot because if there was a blip, we want to do an integrity check first. (we run InnoDB, so it's ACID, but we're paranoid ...)

We have clusters of 2 identical databases in separate cabinets, separate switches, separate Internap power feeds... so normally losing one database in each cluster doesn't matter: the other one gets used. But when we lose every single database, in all clusters, all at once... that's the time to be paranoid and double check stuff.
Re:./ed !!!! Server Reboot Time? by bradfitz · 2005-01-14 16:39 · Score: 5, Informative

At this point all my whiteboards are full of boxes of each database cluster, the machines in that cluster, which have passed their checksum tests. (innodb checksums each 16k page), which replayed their replay/undo logs, where in binlogs each was writing/reading/executing etc...

So lots of waiting now on the checksum validators. I don't want to put a machine back in and find out in a week there was a database page that was corrupt because the battery-backed write-back cache on the RAID card didn't work as advertised. (which happens on about 95% of RAID cards, in my experience, because they're mostly crap, even the most expensive ones...)

Also whenever there's any doubt about something's integrity, we backup or snapshot the potentially corrupt version before operating on it. That operation can take time too.

It's going to be a fun night.
Value of Livejournal - "Open Source Philosophy" by DemonWeeping · 2005-01-14 16:49 · Score: 5, Interesting

For those who don't know what's so hot about it and for those who think Livejournal is just a bunch of teenage girls whining.... Livejournal has just about four years of my life documented. The ease of use and the ability to "vent" is comforting, but the real value comes in the interaction. My friends see my life at their convenience and I see theirs at mine. We can choose to ignore the whining of others or we can choose to relate and comment on our own experience. Think of it this way: Open-source philosophy, emotion, and life. I put my own out there and others add to it. I add mine to others. Granted ... those quiz/meme things HAVE TO GO. I do not want to read about "what frog best resembles me" or "which 80's hair band song is me." Grrr.