Slashdot Mirror


How the Leap Second Bug Led Facebook To Build DCIM Tools

miller60 writes "On July 1, 2012 the leap second time-handling bug caused many Linux servers to get stuck in a loop. Large data centers saw power usage spike, sometimes by megawatts. The resulting "server storm" prompted Facebook to develop new software for data center infrastructure management (DCIM) to manage its infrastructure, providing real-time data on everything from the servers to the generators. The incident also offered insights into the value of flexible power design in its server farmss, which kept the status updates flowing as the company nearly maxed out its power capacity."

1 of 46 comments (clear)

  1. Re:DCIM by atom1c · · Score: -1, Offtopic

    Since the two realms of specialty (data centres vs. digital cameras) do not overlap -- except that, in this fringe case, Facebook might give a crap about one of them -- I, unfortunately, fail to see the relevance of your comment.