Facebook Experimenting With Blu-ray As a Storage Medium
s122604 links to CNN's explanation of what may be the future of cold (or at least lukewarm) storage at Facebook, which is experimenting with massive arrays of Blu-Ray discs for seldom-accessed user files. Says the report: The discs are held in groups of 12 in locked cartridges and are extracted by a robotic arm whenever they're needed.
One rack contains 10,000 discs, and is capable of storing a petabyte of data, or one million gigabytes.
Blu-ray discs offer a number of advantages versus hard drives. For one thing, the discs are more resilient: they're water- and dust-resistant, and better able to withstand temperature swings. Their data can be restored more quickly, and they're easier to transport.
Most important, though, is cost. Because the Blu-ray system doesn't need to be powered when the discs aren't in use, it uses 80% less power than the hard-drive arrangement, cutting overall costs in half.
... those drives offline or come up with a system to power up the drives via custom san hardware when you want to access them? With facebooks cash it should be do-able.
If they only keep one copy, how do they detect and recover from bitrot?
Or is the stuff already not really important to keep more than one copy around
My stupid past shared on Facebook can now last even longer!
I know that enterprise grade hard drive are made to be spinning for years without fail, but there are hard drive that are made to be spun down and essentially powered off when idling. They are laptop drives. Again, not made for enterprise storage but neither is Blu-ray so I find it curious that this would be the USP of this solution.
- Henrik
- when the Shadows descend -
BD also cost more per-GB than a HDD.
I hope it gets those cartridges faster than RedBox.
Can I ask Facebook to delete my stuff from one of those (assuming I had a Facebook account in the first place)
Let's add up those bytes:
12 x 50GB (calculating with DL discs) gives 600GB/BR cartidge, or about the storage of a phisicaly smaller LTO3 tape with some compression. (LTO4 gives 800GB uncompressed) This gives 0.47PB of storage per rack.
LTO can be rewritten if needed. Of those you can pack 1320 tapes (IBM TS3500-S54 storage frame) frame for 3.2PB uncompressed data using LTO-6 tapes.
The BR discs can be a bit faster when retrieving many small files, yet I still wonder the logic here...
Enterprises have been doing this with tape for 30 years.
In fact, modern tape technology probably has a higher "volumetric" density than BD.
"I don't know, therefore Aliens" Wafflebox1
>"Their data can be restored more quickly"
Than a hard drive? I think not.
> "the Blu-ray system doesn't need to be powered when the discs aren't in use, it uses 80% less power than the hard-drive arrangement, cutting overall costs in half."
Say what? When my backup hard drives are not being used, they also use zero power because they are not plugged in. And when they ARE plugged in, they "power down" after a few min of no usage, which I think is like 1% of normal power.
The density of storage for bluray is also not better than hard drives, and the writing is much slower. I also don't see how transport is so much better than laptop hard drives. Bluray MIGHT be cheaper, depending on how you value your criteria... and the discs are more rugged (if that even matters).
When you first access this data, you have to sit through 42 previews before you get to it.
It was a joke! When you give me that look it was a joke.
I read TFA. They're not using them as "storage" in the sense of active, accessible storage. It's a backup system.
What they're trying is, instead of storing redundant copies of everything on multiple drives (for resilience and geolocality), they're keeping one copy live and keeping backups on blu-ray.
So there's never a latency of minutes while it loads data from Blu-Ray, you just might be routed to Siberia or something to get the one active copy. If that copy's bad, error (restore from backup during next nightly batch or something).
Why not just spin the harddrives down when not in use?
BD also cost more per-GB than a HDD.
fifty "25GB 4x BD-R Hard Coating" for $35 about 1250GB about 36GB a $
A 3TB drive (I would say the sweet spot) would be $100 about 30GB to a $
So blue ray is slightly cheaper per GB for me. I suspect in bulk the differences are bigger.
they had cold-storage CD jukeboxes at (well-known HVAC) back that far for old catalog crep. heck, they had rooms full of videotape carts in TV stations back that far... take your pick, VHS pro or Beta Pro. robotic storage is way old, just the medium changes, depending on what you are used to in your industry.
if this is supposed to be a new economy, how come they still want my old fashioned money?
How is this different from the last time the topic was on the front-page of /.?
http://hardware.slashdot.org/s...
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
Nope, you can't quit. Facebook has had the mindset from the beginning 'you have no privacy'. I originally thought, 'Facebook is storing abandoned accounts on blu ray.'
Shame Linkedin went down the privacy destroying road of Facebook. Linkedin could have been really great for the professional.
"Those data demands will only increase with time, particularly as personal cameras and smartphones become capable of capturing higher-quality images."
From Facebook: "We automatically take care of resizing and formatting your photos for you when you add them to Facebook."
It little behooves the best of us to comment on the rest of us.
They ought to try bees. It's good enough for HEX.
If they go with the blu, it could actually drive the price of this dog down enough so that non-billionaires could afford it. $2 per disk is (ungodly expensive). I am more accustomed to 100 disks for about $20 (20 cents per disk). I know blu-ray holds more, but the format just seems too expensive. I've seen side-by-side comparisons between dvd and blu-ray films, and yes the vertical and horizontal resolution doubles and so (very) fine details that are lost on DVD you can see with blu-ray. Other than that, the differences are for the most part minimal. As a data storage medium, I'm not compelled to move because I already have installed equipment, and while the storage is a 5 fold increase, the cost is also a 5 fold increase. No economies of scale have yet applied to blu-ray.
n/t
I'm sure there's other tape libraries with similar densities, but the IBM TS4500 (http://www-01.ibm.com/common/ssi/printableversion.wss?docURL=/common/ssi/rep_ca/2/897/ENUS114-072/index.html) high capacity frame (storage only) can hold 1320 LTO 6 tapes each with a 2.5 TB native capacity.
What does that TS4500 cost? I'm curious how it compares to a stack of dumb 16-bay SAS enclosures at $300 each.
http://www.ebay.com/itm/like/1...
A general purpose FreeBSD or Linux system with four raid cards can control 1024 drives mounted in such enclosures, so about $2 per drive for the intelligent bit.
I dunno. I've never been pleased with the performance of optical media. I'd think being in a data center, heating up and cooling down from usage and storage is going to have very bad effects on recordable optical discs (CDs, DVDs, Blurays). Not to mention, it's always a pretty well known fact, consumer recorded media (the ones with dyes and stuff) aren't terribly reliable in the long term. My personal experience with recordable optical media is poor at best, I have very very few discs that've remained readable and error free after just five years of relatively decent care and storage. And this is not even using them every day, heating them up and cooling them down, just stored in a dark cool place.
Seems... overhyped. I simply can't come to believe this is an actual viable storage medium for any kind of large scale operation. But enh, if it works for them, good deal. Seems like you'd get more bang for your buck using high capacity tapes which hold up much better to heating up and cooling down.
The power saving claim also seems silly. This could be easy done with standard hard drives in a cartridge type system they're saying they're using, powering down unused drives and putting them into a storage position (though for me, I think it'd be much smarter to make the connector the moving part and just plug into the right bank of HDs, instead of moving HDs around in a cartridge.)
The more I think about this operation, the less intelligent and efficient it seems to be.
Not that I had any trust in them anyway.
Blu-Ray, and indeed any modern optical storage, is very short-lived precisely because it's designed to be cheap. The laser disks used to store the Doomsday Project in Britain were still readable after 20 years. Modern optical storage decays typically within 5. Less, as the density goes up. And failures take out far larger percentages of the storage.
Magnetic tape is still the only trusted long-term backup medium. I wouldn't suggest it for something like Facebook purely because of seek times, but it's hard to think of any viable alternative.
With Blu-Ray, to guarantee to avoid complete disk loss, you'd have to be re-archiving the entire archive annually. That adds an enormous invisible cost to the project. They're not going to do that. Which means there's guaranteed loss of backups. How much depends on the exact storage conditions but it won't be pretty.
As for better ability to withstand conditions, it again comes down to the nature of the storage. Optical disks are highly vulnerable to a lot of things that hard drives are not. Overall, optical storage usually performs very badly in comparison, as the things hard drives are vulnerable to are cheaply avoided but the things optical storage can be attacked by are usually a lot harder to deal with.
I'm sure you're aware that none of the above formats (tape included) are considered "archival quality" - they just don't have the sort of durability required by that categorization. No known digital format does and there's nothing you can do to stabilize them. It's a big research area. For now, tape is considered the only method that is economic and durable, with the lowest loss of data per failure.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
Waiting for j-43289.ar-298.bluray.facebook.com...
FaceBook is EVIL.
FB should not be keeping user data in the first place without the users explicit and continued request to do so.
Users are dumb and forgetful. FB should be asking them regularly at least every six months, "hey we have these pictures and walls, do you still use them and want us to keep them?".
Otherwise FB is keeping and using and mining stuff without consent/asking... basically confirming they're EVIL.
Your entire life, online, and on EVIL FB no less... how stupid can you be.
On my systems I build what is called RAWPOD for backups.
Like an external drive but safer.
RAWPOD=Redundant Array With Powered Off Disks (kind of like RAID by analogy)
I mount regular SATA disks in my towesr and make a special power harness. I use a DPDT key switch that opens the +5 and +12V power to the drive.
As a result my backups are a minute away in case of primary drive failure, but are safe, powered off 99% of the time against power problems, virus, accidental deletions, mechanical shocks. This is layer one in the back-up strategy next to the off site back-ups.
I use Syncback for free backups with my RAWPOD disks.
Okay, so we need disc 101 from tray 1010101 and the robot arm is busy, three other fetches already in the queue. After 30,000ms client Javascript times out and substitutes a "retrieving data, re-try for a few minutes" place holder, sets a longer camp-on timeout and releases the request.
The reason the robotic arm is busy is that despite random assignment to storage pools with some localized album grouping, web crawler activity for public albums, and bulk pre-fetch requests for semi-private albums by browser plugins run by logged-in users (which became more popular as access time increased) ... the lukewarm storage facilities are running hot and queues are full most of the time.
Despite the polished and smoothly functioning presentation that encourages the users to "just wait a bit" ... a dark rumor grows deep in the hearts of many that the data is not merely delayed, they must brush off dust and cobwebs, or root for it because it had been haphazardly tossed into a pile of rubbish somewhere, relegated to the digital Basement. Facebook does not think your photograph is of sufficient merit. Grandmother has long passed and you had not wished to look at her last week, so... why should you be interested now?
The effects are complex, but the cause is clear: the Internet is perverse. It re-routes around any attempt to take immediate access data off-line by degrees, accomplishing this through a series of countermeasures such as unwelcome crawlers depleting your cache, hitting your 'public' cold data systematically and regularly, then finally bankrupting your company as users migrate to another service whose superior performance does not arise from superior engineering -- merely the fact that fewer users are using it.
So the moral of the story is, if you are Facebook and wish to remain so, you will either strive to find a way to keep the random access time for everything down below 2000ms -- or die.
And also, Facebook would be wise to heed the following:
once / forgotten by tourists / a bicycle joined a herd of mountain goats /// with its splendidly turned horns / it became / their leader /// with its bell / it warned them / of danger /// with them / it partook / in romps / on the snow covered / glade /// the bicycle / gazed from above / on people walking; / with the goats /// it fought / over a goat, / with a bearded buck /// it reared up at eagles / enraged / on its back wheel /// it was happy / though it never / nibbled at grass /// or drank from a stream /// until once / a poacher / shot it /// tempted / by the silver trophy / of its horns /// and then / above the Tatras was seen / against the sparkling / January sky /// the angel of death erect / slowly / riding to heaven / holding the bicycle's / dead horns //////~Jerzy Harasymowicz
<blink>down the rabbit hole</blink>
How big a stack do you need to match a 1320 tape library? Even using 4TB disks you're talking 825 disks, which means 51 enclosures. And then four racks to hold those enclosures. And enough floor space to hold those racks. And enough circuits to power those racks.
At that level of scale, tape is simply a better option for archival storage.
That should make the right to be forgotten super simple to implement. They just need a little box where a tiny mallet comes out and breaks the disc when someone wants to delete a photo.
Facebook has just reinvented the robotic tape silo that mainframes have been using for decades...!
" For one thing, the discs are more resilient: they're water- and dust-resistant, and better able to withstand temperature swings."
Moisture from storage will cause the discs to corrupt. Typical life span is 3-5 years on a shelf. Temperature swings can shorten that more. The die in the discs (whether CD, DVD, or BR) wear out.