Slashdot Mirror


640gb PCIe Solid-State Drive Demonstrated

Lisandro writes "TG Daily reports that the company Fusion io has presented a massively fast, massively large solid-state flash hard drive on a PCIe card at the Demofall 07 conference in San Diego. Fusion is promising sustained data rates of 800Mb/sec for reading and 600Mb/sec for writing. The company plans to start releasing the cards at 80 GB and will scale to 320 and 640 GB. '[Fusion io's CTO David Flynn] set the benchmark for the worst case scenario by using small 4K blocks and then streaming eight simultaneous 1 GB reads and writes. In that test, the ioDrive clocked in at 100,000 operations per second. "That would have just thrashed a regular hard drive," said Flynn. The company plans on releasing the first cards in December 2007 and will follow up with higher capacity versions later.'"

49 of 324 comments (clear)

  1. Oblig. by Anonymous Coward · · Score: 5, Funny

    640gb ought to be enough for anybody.

    1. Re:Oblig. by GodfatherofSoul · · Score: 2, Funny

      Maybe for your pr0n collection ;)

      --
      I swear to God...I swear to God! That is NOT how you treat your human!
    2. Re:Oblig. by ady1 · · Score: 5, Insightful

      Its not the size of the harddrive which is amazing. Its the read/write speed.
      Even if you get a 32GB model, you can install windows on it and use the regular SATA2 HDD for movies/music storage. Think of the booting time.

    3. Re:Oblig. by zippthorne · · Score: 2, Interesting

      Which brings to mind an interesting point..

      Why even have swap files? Shouldn't caching decisions be done a bit more intelligently at the application level? I have 10 times more RAM in my current PC than all of the memory (including the HDD) on my first PC. At some point, can't we drop swap?

      --
      Can you be Even More Awesome?!
    4. Re:Oblig. by jamie · · Score: 2, Informative

      Once you get above $500 desktop computers, it doesn't much matter. A properly tuned system will only use swap, if at all, to drop a few MB from RAM to disk because it's just never accessed. A server that swaps during use is just not set up properly.

    5. Re:Oblig. by walt-sjc · · Score: 2, Insightful

      True, you have a lot more RAM now than you used to... BUT, you are also not running DOS anymore. Hell, MS Office alone take 256M just to load Word (I KID...) But seriously, modern apps are huge. Because memory is cheap, apps are not efficient. We also use our systems differently, working with massive files (images, movies, music) and tend to run lots of apps at the same time, including virtual machines. Yes, you can still run windows in 512M, but it seems that even 2G isn't enough at times.

      So we still need swap.

    6. Re:Oblig. by norton_I · · Score: 5, Informative

      You can tell if flash is bad, if worst comes to worst, by reading after writing. Or reading after erasing, and looking for stuck 1's.

      'worn out' flash doesn't spontaneously change state. Bits just get stuck and don't erase correctly.

      I don't know how flash drives actually handle this, but it isn't magic or impossible to fix.

      Also, the lifetime of modern flash is long enough that it is hardly an issue any more, even for normal desktop use. Maybe you don't want to use it for swap *IF* you swap a lot, but given the cost is in the same ballpark as RAM, you could just buy more RAM.

    7. Re:Oblig. by Lisandro · · Score: 3, Informative

      our childish "uh oh" introduction, your completely un-cited "a lot of issues" comment, and your vague "I recall" interruption reveal the fact that you're spouting off some crap on a subject you have no direct, real-world experience with. Find another subject with which to stroke your ego kid, because you're looking like a pompous dumbass on this one.

      The specs for a 256Mb NAND flash memory chip by Samsung (which is by far the biggest NAND flash manufacturer today) quotes 100k millon write/erase cycles, and this is for an IC commonly used in USB pendrives. The figure usually tends to get worse with increased memory sizes since the memory "element" (float gate) becomes smaller. For example, Modern 16Mb chips, which are the ones i have experience with, usualy quote 1 million W/E cycles endurance.

      But, it felt good stroking my ego a bit more :) Thanks!

    8. Re:Oblig. by Anonymous Coward · · Score: 3, Informative

      You can also tell if FLASH is verging toward failure on any particular block by timing the erase cycle. As the part ages out, the erase cycle takes longer. You can also use that information to enhance wear leveling.

    9. Re:Oblig. by networkBoy · · Score: 2, Informative

      No you would boot directly from it. GP doesn't understand BIOS very well. This could hook into the boot process much the same as any RAID controller does.
      -nB

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    10. Re:Oblig. by AK+Marc · · Score: 2, Informative

      Not in Windows. It has a swap file by default. If you disable swap, it still swaps. If you have swap enabled and have something running in the background and come back hours later, it's so slow it's painful, since everything gets swapped to disk except the one thing running. "Tuning" may be possible, but well outside the realm of the average user, and even completely disabling swap in all settings, you still have swapping happening.

    11. Re:Oblig. by Eccles · · Score: 3, Informative

      Oh yeah, how does a solid state drive handle fragmentation? I have heard that they don't fragment, but not from reliable sources, and I just don't see how that is possible unless there is some built in mechanism to close gaps on the fly or something.

      Hard disks have a fragmentation issue because sequential accesses are much faster than random ones with a spinning disk. Each time the next sector to get isn't right after the previous one, the head must seek to the start of the next track. Solid state "disks" have true random access, where accessing blocks in random order costs no more than sequential accesses. So while solid state will fragment, it doesn't matter for performance or reliability.

      --
      Ooh, a sarcasm detector. Oh, that's a real useful invention.
    12. Re:Oblig. by Cantinflas · · Score: 5, Interesting

      I don't know where you got your info, but 'modern' flash has horrible life-expectancy in terms or write-cycles.

      Explanation:

      Most flash vendors have moved to MLC (Multi-Level Cell) flash. It's cheaper and denser, but the bit-error rate goes up because you have more bits per cell. The typical life expectancy for MLC is somewhere in the range of 10,000 writes using single-bit error correction. This is compared to 'older' SLC flash which has a write endurance of 100,000 to 300,000 writes.

      Now, most vendors making media out of flash take varying degrees of a combination of two approaches (in addition to standard wear-leveling approaches). The first approach is to assume that the majority of the users will only ever store audio or video data so the occasional uncorrectable error won't have much impact as long as it doesn't corrupt the filesystem. The second approach is to use more advanced error correcting algorithms to compensate for the higher bit-error rate.

      Using more advanced algorithms, it's possible to get more than 300,000 writes out of a MLC flash-block before the errors become uncorrectable.

      P.S. I may be wrong, but I believe flash can have some really odd error conditions. For example, it's possible to disturb a bit in a block just by reading it. I believe it's also possible to disturb a bit in a different block on the same matrix when writing. That's why some form of error correction is always required with NAND flash.

    13. Re:Oblig. by darkpixel2k · · Score: 3, Funny

      So while solid state will fragment, it doesn't matter for performance or reliability.

      Yes, but it *needs* to be defragged because I *hate* seeing red in my disk analysis...

      --
      There's no place like ::1 (I've completed my transition to IPv6)
    14. Re:Oblig. by TheLink · · Score: 2, Informative

      "Solid state "disks" have true random access, where accessing blocks in random order costs no more than sequential accesses"

      AFAIK for flash, sequential access is still faster than random access, even for NOR flash.

      It's quite amazing how slow some flash is (esp NAND flash). Some are 1ms for random access, and others are even 7ms.

      For comparison a 15krpm drive has a random access time of about 5-6ms. So if you have RAID10, you might get similar or faster speeds, than a flash drive 10x the total price of a RAID10 system with similar capacity.

      --
  2. Uhh, Price? by mosel-saar-ruwer · · Score: 4, Interesting


    Who, what, when, where, why?

    Price would seem to be a pretty important detail...

    1. Re:Uhh, Price? by thegnu · · Score: 5, Funny

      I think it's safe to assume you won't be buying one soon.

      --
      Please stop stalking me, bro.
    2. Re:Uhh, Price? by morgan_greywolf · · Score: 5, Informative
      FTFA:

      So how much will these cards cost? Flynn told us that the company is aiming to beat $30 dollars a GB, something that should seem very cheap to large corporations, adding "You can drop ship or Fedex this card and be up and running in a few minutes... you can't do that with a storage area network." So, let's say they get to $29 a GB, a reasonable price for NAND flash-based memory devices. 640*30==$19,200. Sorry, but that doesn't seem to beat an inexpensive SAN in price. I recently priced out a 12TB iSCSI SAN for a little bit more than that, and even 1-2 TB fibre SAN from IBM should be around the same price.

    3. Re:Uhh, Price? by CastrTroy · · Score: 2, Insightful

      I could buy a 10 GB drive for most of my OS and software, and just keep my media on a traditional hard drive. You don't need a super fast drive for your MP3s and Videos, but it would be nice to increase boot times as well as application start up times.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    4. Re:Uhh, Price? by walt-sjc · · Score: 4, Informative

      Um, we were using large RAM disks (the kind that hooked up to SCSI, had a built in UPS and disk to dump RAM contents) many years ago (8 now?) to speed up databases. That was limited by the SCSI bus, but access time and latency were near zero (which was awesome.)
      Of course, large back then meant 4G, and the average hard disk was 9G. This is evolutionary, not revolutionary.

    5. Re:Uhh, Price? by walt-sjc · · Score: 2, Insightful

      Um dude? He is quoting the FA. It's pretty clear that this drive will release at prices closer to $30/G. Maybe next year they can get down to $8...

    6. Re:Uhh, Price? by nuzak · · Score: 2, Interesting

      > Of course, I don't personally know if PCIe is hot swappable

      It is. But so is PCI -- it's just a matter of whether the physical and electrical connections on your hardware allow it, and whether your OS is set up to handle it. I wouldn't recommend yanking out your video card while the PC is on.

      --
      Done with slashdot, done with nerds, getting a life.
  3. And another question. by AltGrendel · · Score: 4, Insightful

    What's the MTBF?

    --
    The simple truth is that interstellar distances will not fit into the human imagination

    - Douglas Adams

    1. Re:And another question. by LiquidCoooled · · Score: 4, Informative

      Hitachi are saying that they have solved the overwrite problem (at least mitigated it by a factor of 100)

      They appear to want to use normal DRAM memory for the running of the drive but then write it permanently to the NAND flash at shutdown/memory full time.
      I would assume this involves charging of a small battery and dumping the data later on.

      http://www.theinquirer.net/gb/inquirer/news/2007/09/26/hitachi-reckons-solid-state

      --
      liqbase :: faster than paper
  4. Wow by thatskinnyguy · · Score: 3, Interesting

    I could imagine using this as an OS drive. No sooner do you let your finger off the power button than the login screen appears.

    --
    The game.
    1. Re:Wow by FiveLights · · Score: 4, Funny

      You are standing in a field, facing East. A river runs East and West to the North of you and a path runs to the South and East.

    2. Re:Wow by HarvardAce · · Score: 3, Insightful

      You are likely to be eaten by a gnome.

      --
      Note to self: Stop putting jokes in my insightful comments so I can get something other than +1 Funny!
    3. Re:Wow by east+coast · · Score: 4, Funny

      Yeah? How does WoW play on your C64?

      It plays very nice.

      [turns to the other Commodore users] I told him it plays very nice [chuckling from users]

      Now! Go away or I shall taunt you a second time.

      --
      Dedicated Cthulhu Cultist since 4523 BC.
    4. Re:Wow by zlogic · · Score: 5, Interesting

      My PDA boots in about 30 seconds, and it doesn't have a harddrive. Booting isn't just loading stuff from a drive, it's hundreds of tests like
      - hardware changes
      - hardware initialization (e.g. loading firmware)
      - searching for drivers
      - applications acquiring and releasing resources and checking for stuff like library versions, user names etc.
      That's why BIOS initialization often takes time, and yet it works even if the system has no drives.

      The only way this would work is hibernating, but hardware would still need to be initialized.

    5. Re:Wow by eatont9999 · · Score: 2, Informative

      I hope you guys know that a maximum data transfer rate of 800Mb/s is 100MB/s. We don't know if it is sustained or peak so we are not even guaranteed that. Maybe it does not even matter. A few years ago they came out with this thing called SCSI U320. It is an interface that can support drive speeds up to 320MB/s per channel. If you have a decent controller and a RAID 5, you can get very close to 320MB/s sustained data transfer rates. Don't even get me started on SAS arrays! Sure it may be a little too enterprise level for some people, but before spending the money on unproven hardware, I would choose a proven robust interface. Now, if the author made a mistake and these things run at 800MB/s, then we have a new issue in which we can start talking about 4-10Gb/s Fibre SAN arrays, but I'll leave that for another time. Sorry to shit in anyone's hat, but those are the facts man.

  5. on behalf of all of slashdot, i would like to say by circletimessquare · · Score: 4, Funny

    (drool!)

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  6. $30 per gb, ouch by dnamaners · · Score: 2, Insightful

    Its fast, but not as fast as I would have hopped with parallel access. They better get the speed up or the cost down to hit it big. Right now I'd take either direction, as they both have decent applications. Good progress though, time will tell.

  7. Re:Still Expensive by Jeff+DeMaagd · · Score: 4, Insightful

    I hope this means that laptops and large capacate media players with extremely long battery life are not too far away.

    I think people expect too much from SSDs. The hard drive is far from the dominant power consumption component in a notebook. The CPU, chipset, GPU and display panel each consume more power than a notebook hard drive does. If you follow a modified version of Amdahl's law (not a law, but whatever), you want to fix the biggest problem first, and that is either the display or CPU. An LED backlit display can save some power, and running a lower power rating CPU saves power too. Compared to that, the savings of swapping HDD for SSD is negligible. On a standard notebook, I think you might add 15 minutes to battery life, which is still far from "extremely long battery life".

    In media players, doubling in capacity every year is a reasonable expectation.

  8. $30 bucks a gig by niola · · Score: 2, Informative

    According to the article, they are looking at pricing to be 30 dollars a gig. That is pretty pricey.

    That means their low-end 80GB drive will be around $2400+ or so US dollars depending on tax, shipping, retail prices etc.

  9. Re:Flash Memory == Vanished Data by Anonymous Coward · · Score: 3, Funny

    should its battery fall out

    The fuck?

    by egg troll (515396)

    Oh, carry on.

  10. Misleading benchmark by Chris_Jefferson · · Score: 4, Interesting
    "'[Fusion io's CTO David Flynn] set the benchmark for the worst case scenario.."

    By which he means, set up a completely unrealistic benchmark which shows his flash drive in the best possible light, and a traditional drive in the worst possible light.

    I still want one of these, but that benchmark is nothing to be proud of.

    --
    Combination - fun iPhone puzzling
    1. Re:Misleading benchmark by stonecypher · · Score: 2, Insightful

      "[Fusion io's CTO David Flynn] set the benchmark for the worst case scenario.."
      By which he means, set up a completely unrealistic benchmark which shows his flash drive in the best possible light, and a traditional drive in the worst possible light.
      No, the scenario he set up is a classic worst case scenario for drives, one which is well known to large disk usage corporations: near-random access by hundreds-or-thousands of concurrent users. This is what SANs are built to address, this is what parallel RAID is so good at, and yes, it so happens that this new drive is also good for it, because there's no seek time.

      Just because they've successfully approached a real problem doesn't mean that describing the real problem is flimsy propoganda. Knee-jerk reactions don't happen because you're a knee.
      --
      StoneCypher is Full of BS
  11. Wow, $19,200 for 640gb by Acecoolco · · Score: 2, Informative

    Damn expensive, $19,200 for 640GB......... I want it but cant afford it.. Josh

    --
    Just because it works, Doesn't make it right. - JTM
  12. Just as expensive as RAM by mathx · · Score: 2, Informative

    considering that large quad-socket boards have space for 8 Dimms per CPU we're looking at 128GB+ per machine and soon to increase. 640Gb isnt that much bigger. Since it's on the memory bus and not a PCI-* bus, its going to be faster than these drives, and it's more expensive right now. By the time they're at $30/GB ram will be alot less than that. The automatic persistence (without scheduling to back the memory to disk, like RAM would need) is the only advantage, so you're putting a high price on that ability - a database with constant accesses that need to go to permanent non-loss storage with a power out.

    If you really have such an environment, I would think that fixing your HA setup would be a priority first - duplicating your servers so they can take eachother's jobs over and providing redundant power. I dont even want to know how many xactions/second a properly memory-stored database can do (once you get rid of the filesystem and driver layers, which this thing would require), Im sure many many more. While disk wont take as many xactions/sec, you can always back dirty ram to disk in huge chunks (1 meg blocks or more) to avoid having to need 100K IOops.

    I just dont see any advantage properly tuning your server and process environment couldnt achieve with commodity unspecialized cheap easy to replace parts and a few brains.

  13. HardCard! by jeffy210 · · Score: 2, Interesting

    It's the return of the HardCard!!!! I remember having one of these with my old PC/XT. It was a 20 MB HardCard that fit into an ISA slot. The first ever hard drive i had running Windows 3.0 with DOS 3.3 on it.

    --
    ------
    "And may your days be long upon the earth."
  14. Talking to the company at demo by shdowhawk · · Score: 5, Informative
    Having talked to people at demo, what it pretty much came down to is this... is this a product we should be excited about? Definitely ... is it something that will do well right away? Not at all. The price has to drop before this becomes a really valid and useful tool for the GENERAL PUBLIC / Company. But there are a lot of companies out there willing to pay too much money to get these. Hopefully these big companies buy these up and fund this project as QUICKLY as possible. 7 of these side by side at 320/640 gigs a piece is a SCARY/powerful server.

    That being said, a few of the guys there said that they pretty much expect these (at the beginning) to do the best sales for companies that are looking to get really really fast database servers going. NOT for scsi san replacements (it's silly to spend $100,000 for something you could get for 10,000 hard drive space wise). Eventually as the price drops... i know of a handful of people who would EASILY pay 1000$ to get one of these on a gaming rig even if it was only 100 gigs. But that right there is already 1/3rd of the price of what it currently is. (assuming it's around 30$ a gig).

    Another thing to keep in mind that came up in the conversations... since these are tiny, think about the cost per server rack... and think of the cost per electricity to run. If you take those into consideration, these are actually less expensive that most people would think! A massive rack of hard drives could cost a lot of money in a co-location ... and a lot of electricity to run it all... But then again, we're talking about savings on servers, not general in home use.

    When this gets to about 1/3rd of it's current price, that's when you will see these things become TRUELY mainstream both to the average company and home users (be it rich ones who need the latest and greatest).

    Fusioni-io -- Link to their site.

  15. Re:write limit? by Lord+Ender · · Score: 3, Insightful

    That's a standard feature these days. I want to know how many times I can overwrite the entire disk with /dev/random. If I did a "dd if=/dev/random of=/ioMemory size=[size of ioDrive]" how many times could I execute that command before the thing goes tits up?

    --
    A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.
  16. Did anyone notice by colourmyeyes · · Score: 2, Interesting
    That these are initially Linux-only?

    From TFA:

    Linux drivers will be included and Flynn said Windows Server, XP and Vista drivers will be available three months after that.
    --
    My grandmother used anecdotal evidence all the time, and she lived to be 120 years old.
  17. Yes but.... by Kildjean · · Score: 2, Insightful

    They are going to sell it at $30/per GB. Now lets do the math: 649GB * $30 = $19470....

    With that amount of money I buy a Mac Pro with 8 Cores and a 1Tb Raid or a 1TB San.... I think Solid State has to grow cheaper before we consumers can jump the gun at it... but, like hard drives back when they made the jump to GB, it will be awesome to see SolidState HD in systems, better then the clunky magnetic disks we currently use...

    Even for a corporation telling them a 649Gb Solution is going to cost them $20K they will flip you for it.

    --
    Nom de dieu de putain de bordel de merde de saloperie de connard d encule de ta mere.
  18. Re:write limit? by TinyManCan · · Score: 4, Informative
    Somewhere between 100k and 1 million times.

    Cosidering that this drive is 640GB, that means you would need to write somehwere in the region of 61 PETABYTES of information.

    You'd have to write to the drive at a perfect 800 MB/s for 941 days to hit that mark.

    It could last as long as 30 years, at full write speed of 800 MB/s if it can handle 1M writes per cell.

    At the end of the day, semiconductors this large and high quality are certainly better than tiny bits of rust on rapidly spinning platters.

  19. Why the product rocks.... by tempest69 · · Score: 2, Informative
    This device is all about the IO/second.. a 12 TB SAN cant come close..

    If your looking to run a blast/darwin query on 50k files to find the closest match to an unknown dna sequence Either you need to recode a bunch of software to use sql, or you snag a piece of hardware that gives database level performance. 80gigs at $2400.00 is a bloody bargain.

    The device is also 10x faster in bandwidth than a normal drive which is comparable to a san, but not such a power hog.

    So really the tradeoff rocks for small files. It doesn't have a controller interface latency so its really quick. It should mask a good chunk of hard drive based lag.

    Storm

  20. FAQ by ZapmanFBM · · Score: 2, Informative

    For a lot of these questions, you can look at the FAQ here

  21. Re:write limit? by TinyManCan · · Score: 2, Informative

    Certainly they are using wear leveling. They are probably (information is thin) also over allocating storage, such that 10-15% can fail before impacting the advertised free space on the device. In reality you will see your full 640GB of storage, which you could write 61PB of data to the very first sector on the disk over and over and never experience any issues. Before the last 'extra' block is used up, you'll get an alert and replace the device.

  22. 800MB/s and 600MB/s NOT b/s by master811 · · Score: 2, Insightful

    I hate to nitpick but the TFR uses MB/s not Mb/s, remember there is 8x DIFFERENCE between the two, otherwise 800Mb/s sounds a lot less impressive (its pretty much the same as current raid-0 (ish)