Petabyte Storage Array
knight13 writes "Engadet is reporting that EMC is rolling out a petabyte RAID array. From the article, "And if you're ready for that level of storage, there's now someplace to get it: EMC has launched its first petabyte array, a version of the company's flagship Symmetrix DMX-3 system that includes nine room-filling cabinets of drives." The price? A mere $4 million."
JPEGS would that be?
If you mod me down, I *will* introduce you to my sister!
This is pretty interesting in that it's yet another item that we all wish we had just for overkill purposes.
However, I doubt they'll sell many of these. The only places I can think of that would benefit from this are supercomputing institutes, but they often build their own redudant RAID systems and/or NAS systems.
It's nice and all, but seriously people, who's the audience?
Interesting calculation: If you live 80 years, that's 435.5 KB per second -- enough for a TV-quality video of your entire life.
Be relentless!
I was just thinking about how 4 years ago you could build a terabyte array for about $5-10,000 down from many millions 8 years ago. Today, you can get a terabyte for less than $500. In a few years, a petabyte is only going to cost $5,000. If you just buying space for future growth, it seems like a total waste of money.
I love the last line -- if you just want bragging rights -- these days that'll last 3 days.
The views expressed are mine own and do not express the views of my employer.
When will we have support for hardware raid in the 2.6 kernel?
(Then I can just make..my...own...*)
*Scale: 1:1000
--
http://wi-fizzle.com Ruby on Rails and stuff
Censorship is obscene. Patriotism is bigotry. Faith is a vice. Slashdot 2.0 sucks.
http://www.archive.org/web/petabox.php
By those who truly care about the human tradition, and spreading the music of the Grateful Dead and other freely available media.
Is this another slashvertisement?
Funny, the poster just before you said that this was
"enough for a TV-quality video of your entire life".
Conversely, this is enough for twice the pr0n you could
possibly watch (assuming you sleep and eat).
Let's assume for a moment that the average lifetime of one hard disk in this petabyte array is 6.5 years. Since there are 2,400 hard drives, that means that once this thing has been running for a while, you will be replacing, on average, one broken hard drive per day, for the entire lifetime of the array. That's about $350 per day in replacement parts alone!
"enough for a TV-quality video of your entire life".
;)
I watch my pr0n in HD
redundant array of independent disks array! redundant indeed. can you use this if you have a nic card? oh, and don't forget to keep your pin number and ssn number safe!
if i'm not immortal, what's the point of living?
...te?
this is enough for twice the pr0n you could possibly watch (assuming you sleep and eat)
Never heard of TV dinners?
1 Petabyte is enough pr0n?
:)
For who?
You're new here, aren't you?
Pretty Pictures!
I know some people need a lot of storage, but c'mon:
4 -million-petabyte-array/
http://www.engadget.com/2006/01/30/emc-rolls-out-
Four million petabytes?
Anyway, anyone else remember Arthur C. Clarke's "3001 The Final Odyssey"? By the time you start talking about petabytes, we could think about storing not just great works of art, but also the artist who created them. Let's see, some math is in order:
Fat artist: 75kg = 75000 g. Since we're 70% water, we'll say the average molar mass is 20g, so that's 3750 moles of molecules. Multiply by Avogadro's number and we get 2.2575 x 10^27 molecules. One petabyte is 10^15 bytes, so we need another 12 orders of magnitude before we can even devote 1 byte per molecule.
Even millions of petabytes leaves us short. Yawn, wake me up when we have 4 million zettabyte drives. Of course, by then the drive manufacturers will have redefined the prefixes again...
I pity you.
It's amazing how quickly storage increases and prices go down. On the other hand, it's interesting to keep in mind that as amazing as an iPod nano would be in 1985, the invention of paper was the single biggest leap in storage density we've ever seen.
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Each of your eyes has about 8Kx6K retinal detector cells, signalling at 40Hz. Nyquist sampling means we'd need 16Kx12K 24bit pixels to fill them at 40fps, or 368Gbps. Two ears at even 1Mbps falls into the rounding error, but 10:1 compression is a reasonable minimum expectation: 3.7Gbps. 100 years is 1450PB . The average American earns about $40K:y for 40 years, or $1600K. That means when we get $1:PB, we'll be able to afford to store everything we see and hear in our entire lives.
25 years ago, 10MB cost $5K to store, or $1:2KB. Today $1 stores 1GB, a 500K-fold increase. The 1500-fold increase to $1:PB is relatively around the corner.
--
make install -not war
We will never need one petabyte of storage.
and then 10 years down the lane
We will never need one exabyte of storage...
"Cue the 'finally enough storage space for my pr0n' jokes."
Yeah, THIS is the comment in this whole thread that should be modded redundant. Oh well, at least 'overlords' hasn't been modded up yet.
"I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)
What's funniest about this comment: It was modded redundant.
Honesty may be the best policy, but by process of elimination, dishonesty is the second best policy.
The likelihood of failure on any given day is not independent of any other day. Chance a drive will fail on the second day is pretty low (first day is pretty high, installation mess-ups and the like). Chances increase over time, so maintenance will follow a pattern of exponential increase. And in 6.5 years, I can have a 1PB raid under my desk at home.
This sig has not been evaluated by the FDA. It is not designed to diagnose, treat, prevent, or cure any disease.
The thing is built around 2,400 500GB hard drives.
I wonder when (if) the average consumer can get 1PB harddrives?
I don't know if Moores law applies historically to harddrives, but if doubling of capacity occured every 18 months and figuring 500GB is the limit size now and the doubling continues into the future:
500GB - Now
1TB - 18 months
2 - 36
4 - 54
8 - 72
16 - 90
32 - 108
64 - 126
128 - 144
256 - 162
512 - 180
1024TB = 1PB - 198months which is 16.5 years.
We were JUST suppose to have gotten to petabyte storage.... that is 1000 years from now... at this rate, by the end of the century we will have 100 petabyte drives (which in reality are only 1000 terabytes rather than 1024 terabytes because drive makers cheat you)
try your hand at cuniform, written with mud and a stick, or etch some runes with a bit of rock
Damn I really need to create my own business in the storage market. I am not exactly sure about what EMC provides in this $4 million package (servers, 24x7 contract, maintenance, hard disk replament, etc) but I KNOW how to create a 1 PB storage device for less than half the price ($2 million instead of $4 million). And I am pretty sure about my numbers...
I am sick of the current state of the storage market. Vendors are either designing unnecessarily expensive solutions, or are having HUGE margins...
A petabyte ought to be enough for anybody. And I mean it this time.
Fascism trolls keeping me up every night. When I starts a preachin', he HITS ME WITH HIS REICH!
I talked to someone who set up a Petabyte array many years ago (he had a *lot* of funding). It seems that when you put tens of thousands of disks together, your total mean time between failures becomes, well, for all those disks, every few minutes. They had someone full time whos job was to go around replacing disks all day.
The faster a disk spins, the more disk surface is exposed to the magnetic field used to write to the drive, so the less storage you have. Disk rigidity is important for two reasons - it limits how close the read head can get and it limits how precisely you can know how much disk surface has been visible. The faster you can either read magnetic fields or generate them, the less disk you need to write to, thus increasing storage. The distance of the read head determines the surface area exposed to the magnetic field on writing, so determines how far apart your data must be to not overlap.
A trivial question might be: Using a standard, existing hard disk (but modifying the controller as necessary) increase the capacity of a hard drive? The answer is "probably".
One way to do it would be to add enough RAM such that a fairly substantial portion of the disk can be held in ramdisk on the controller. Because you are then not reading and writing to the disk directly, but going through ramdisk, the speed of the drive becomes much less important. If you slow the drive down substantially, whilst writing to it at the same speed, the data won't be smeared over the disk as much, so you should be able to increase the density.
In practice, as disk manufacturers don't design their disks with that kind of mod in mind, you are very likely to run into significant problems with defects on the surface that simply aren't visible at 7200 or 15000 RPM. Other problems, such as stability (drives depend a lot on gyroscopic effects and aren't built to go slow), may also limit how much you can cheat on the density.
Another option would be to seriously cool the read/write head, so that you could flip the magnetic state faster. Again, you're limited. Mechanical devices don't like being freeze-dried - even when they ARE dry. However, you may be able to get some improvement that way.
If you're just looking for ANY increase in capacity, then that's trivial and requires no engineering (but some programming). Modern computers are very fast, compared to modern hard drives. If you have one physical sector per physical track, then break down the structure entirely in memory, you eliminate the need for inter-sector gaps, physical sector headers, etc. You might be able to squeeze out another 10%-15% by this method, which isn't a whole lot but isn't bad for the effort it would take.
There are very likely other mods that hard disk manufacturers could use, but which would be totally beyond anyone doing homebrew stuff. The platters probably aren't using the absolute ideal materials - let's face it, they're in business to make money and there are far more home buyers wanting cheap drives than there are perfectionists wanting perfect drives. I suspect there are other areas they could improve on, using existing technology, but won't because it's not economic.
That's probably why you see bursts of improvement. When there's a massive enough need for the extra storage, it can be achieved. When there isn't, it's not worth the extra investment.
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
do you work for dreamhost? i just checked them out and am considering hosting with them, they seem pretty sweet.
They allready can, things like this will just save them money as they need to buy less of these than the old style things. $4million is really small money when it comes to important goverment functions.
I disagree. I bet you could fully encode 4 Japanese men, although you probably wouldn't have enough space for all their clothing.
Of course, 3001 was assuming that it would take a long time for a filesystem to be developed that could handle the 1 petabyte drives. This timeframe has since been sped up due to the fact Microsoft has announced that Vista will ship soon and has promised WinFS to be released by 3001.
Navicula hydraulica plena anguilarum est. Omnes castelli tuus nostri sunt. Ed elli avea del cul fatto trombetta.
Paper is better than stone or clay tablets, but is really not that much more dense (information-wise) than parchment or papyrus that came before it. I don't really think it's such a leap, unless you are talking about the cost/volume of information. Certainly once paper became mass-produced it was cheaper than anything else, but that was well after its invention.
dom
After the experiences that I have had with EMC gear in several different companies around the world I am suprised that anyone would still use EMC junk. Ohhh. but I guess EMC would support someone who is paying $4million for their storage gear rather than someone that only paid $0.5million... Since the company has started to use alternative vendors for storage we have had fewer problems, with higher performance and better support at a much cheaper price. And what is more interesting is that the EMC story I am talking about is a very common story from what I have seen. I would be interested to hear other peoples storage issues with EMC if they are not a Top100/500 company?
With a beast like this that fills up a whole room, anything else becomes a peripheral....
I recently began working at a TV Station (although not directly for them), and from what I've heard (not confirmed by an engineer though), they have at least a petabyte of storage.
;-)
It's not that surprising if you think about it... digital video takes a lot of space, and this is broadcast quality, some of it even high-definition.
Of course, it has been nicknamed "The Peta-file"
Is this really a cheap solution? I could get 145 7TB XServe RAIDs plus XServe G5 cluster nodes with XSan for about $2.3 million. Throw in a couple hundred $K for racks, etc. and I'm at around $3 million. What does this get me over an Apple (or other) solution?
Find me in ~/.sig
Chip.
On.
Shoulder.
But interesting none-the-less. Too bad it got modded into oblivion. Some of these "losers" could have learned a thing or two about industry. But then, you had to get all offensive.
You have to be _subtle_ with your barbs, so it flies over the heads of the moderators with PMS.
THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
You forgot the ^_^, bone smuggler.
I was just thinking about how 4 years ago you could build a terabyte array for about $5-10,000 down from many millions 8 years ago. Today, you can get a terabyte for less than $500. In a few years, a petabyte is only going to cost $5,000.
Law of Accelerating Returns
"I am the king of the Romans, and am superior to rules of grammar!"
-Sigismund, Holy Roman Emperor (1368-1437)
Just don't name it the Peta-file. :)
Rumor has it that Sony tried to use such a name for a tape system several years ago, until the North American team heard it. Not sure it that's true, even though it came from a friend that worked in Sony marketing in Canada at the time.
MadCow
I used to have a sig, but I set it free and it never came back.
That it says you can get 480TB for $250K, so two of those would give you 0.96PB for $500K rather than spending 8 times that to get the other 0.04PB?
Any sufficiently advanced man is indistinguishable from God
The score is already -1, so I don't need to mod it, but there isn't an option for "bad pun" anyway.
I keep a lot more than 50 days worth on line. And I get effectively more than 90% compression. And individual users can do their own restores from their own desktops.
Look at how dirvish works. Or rsnap, or rsync-incr, or rsnapshot, or ribs-backup, or indeed any tool based on Mike Rubel's basic idea.
I use a homebrew variation that is suited to my employer's unique needs and infrastructure. You may find it expedient to do the same. I don't save any metadata other than the snapshot date for each tree, and I use data mining techniques (well, actually I use find and gawk from command line) if I want to determine what's going on or how the system is doing.
It has run for years with no maintenance other than periodic OS patches. It is not our primary backup system because it does not support off-site archival, but it's well worth the investment for rapid restore of user-deleted files. I'll consider this array (I'm currently using linux soft raid 1+0 on two physically separate busses) when I need more disk eventually.
They should call it the "Petafile". that is a nice, innocuous brand-name, isn't it?
The Internet Archive Project http://www.archive.org/ is running on the PetaBox http://petabox.com/ rack system, which was commercialized by Capricorn Tech http://www.capricorn-tech.com/ more than a year ago.
a beast.html from nexsan http://nexsan.com/ which manages to pack 42 500 GB SATA drives into a single 4U rackmount box. With multiple RAID5 volumes and shared hot spare drives, this results in about 17-18 TB of usable file system space.
This system uses absolutely no board/controller lever redundancy, instead they use a separate file system on every disk, then mirror pairs of 1U units, and finally mirror the entire (mirrored) rack to a geographically distant location.
I am currently testing a much denser solution, the SATABeast http://nexsan.com/products/products/satabeast/sat
According to the nexsan engineer I spoke with today, they do so much burn-in testing of the Hitachi Deskstar drives they ship, that over the 15-18 month period they've used these drives, the total error rate has been just 0.4%.
Even if these numbers are somewhat skewed due to many systems (i.e. drives) being relatively recently installed, it is still very impressive.
For our setup we plan to use multiple full boxes, each connected to a separate NFS server. Each server has multiple FC host adapters, so if a server crashes, the corresponding box can be connected to one of the other servers.
We will also use rsync to mirror all data across the country to a secondary site.
Terje
"almost all programming can be viewed as an exercise in caching"
Very funny :-)
But the parent point still stands, all because of a pesky little OS called Solaris and that danged ZFS it has...
Where we work, the IT department has switched to EMC for all the major storage hooked to our UNIX systems, and we are talking hundreds of terabytes. They brag about how EMC is redundant, and doesn't fail. Every 6 months or so we have serious problem with these "redundant never fail" systems that causes loss of work for a day or more for large portions of the company. We pay big bucks for these EMC systems, and removed other brands that never gave us any problems. Have other folks had these problems with EMC?
Imagine how much the RIAA would want if you filled it with MP3's!!!!!!!
this should be enough to store hashes of all possible 8-charachters password for 92 keys. More or less, I mean.
Man, thank god that windows has 256-chars password length
(it's all fiction, I made the numbers up. but I'm pretty sure about the size of the hashes db..)
-- There are two kind of sysadmins: Paranoids and Losers. (adapted from D. Bach)
4000 bucks per terabyte sounds a little pricey. Whatever happened to economy of scale? On the other hand I get $3900 bucks for the price after 10 generations of splitting the price by 2. So figure in 10-15 years you'll have that petabyte, but by then you'll be drooling over the sextabyte or whatever it's called. (insert puns here)