Slashdot Mirror


A Semi-Radical Approach To Avoiding fsck

Dru writes: "This is an article about a hardware technology that is largely unknown in the new Unix community. In theory, with this inexpensive hardware, your BSD or Linux box could start doing guranteed reboots in under 2 minutes (no fsck required) and super fast database writes. It could leapfrog all of the journaling filesystem projects as well. Yes, I wrote the article. The article is long, detailed, and mentions FreeBSD often. However, I do believe it is relevant to any other PC Unix. If enough people learn about it, maybe they will start demanding it from their favorite hardware vendor." With RAM and hard drive space both continuing to decline, I wonder how the speed / use curve for individual PCs' storage (from L1 cache to backups) will evolve. With a similar bent, Arek urges you to "take a look at our company's Solid State Disk Drives." How'dja like 8 or so gigs of DRAM next time you edit a video or burn a CD?

116 comments

  1. Re:Just mod down please. by PovRayMan · · Score: 1

    Thanks. I was about to change the url to betanews.com to read that when I saw an article with no comments. It was posted at 3:21 and I went, hey.. I might have a chance to become like all those other /. lamers and go for a first post!

    Thanks for not trolling on me for beating you to the first post, Tronster.


    ----------

  2. Hard drive space declining!? by jonnythan · · Score: 2

    With RAM and hard drive space both continuing to decline
    Did he mean "RAM and hard drive price both continuing to decline"?
    1. Re:Hard drive space declining!? by swb · · Score: 1

      Hey, around here free hard disk space is continuing to decline all the time. Waiting for one of those after Christmas deals on a new disk.

  3. Interesting idea but.. by sith · · Score: 1

    I guess its a neat idea, but it seems like a hack to get around using a JFS. Maybe a Journalling FS takes some time to implement, but i run reiserfs on my laptop just fine with no problems (cept that time I forgot to include it in a kernel compile and rebooted... oops...). This is a lot of work to go through just to get around writing some software.

    Also, isn't this going to be a pretty significant load on the PCI Bus? Whats the latency on a pci transaction? It seems like you could run into all sorts of troubles there and perhaps end up slowing down your system with all the traffic..

    Or perhaps the pci bus is loads faster than I imagine?

    1. Re:Interesting idea but.. by jmp100 · · Score: 4
      You'd have to implement a completely separate bus or you'd risk getting severely bogged down. You'd have to make a dedicated bus that went from the CPU to a dedicated slot and then to the hard-drive controller. Doing this on a PCI bus like that which exists today would not be particularly efficient. Certain IDE and SCSI drives talk at 100MHz and up; having disk I/O passing over the PCI bus TWICE (CPU -> TRAM -> HD) isn't the way to go, since your Ethernet, video, and other PCI devices are also competing for time on the bus.

      Of course, implementing a separate bus will take millions in research (after all, it has to be done right), but once everything is decided on, it's probably only $20 or so in extra hardware. In theory, all you'd need is another PCI bridge chip or similar. Ever seen the inside of a NetApp? The motherboard has a CPU, space for RAM, a PCI bridge, and some slots. Nothing else. Extremely simple.

    2. Re:Interesting idea but.. by |_uke · · Score: 1

      actually.. if you slap in a highpoint controller, you could have the ide controller/tram all on one card. Data would only need to travel once over the bus.

      --
      Luke
  4. Interesting Idea, but... by jester_uk · · Score: 1

    If it's mission critical it should be on a UPS with controlled shutdown. No fsck on reboot.

    If you're talking about the actual in-case PSU going pop, then reboot time is the least of your worries.

  5. Re:I'm a geek by jonnythan · · Score: 1

    I must question the sanity of someone who mentions "Fight Club" in a list of otherwise brilliant, distinguished works of art ;)

    And remember, he was only god as long as someone believed.

  6. Website unclairity by el_munkie · · Score: 1

    Can someone find a price on the 2 Gig HD from Platypus? The website provided is nonnavigable, and I cannot find a reseller that displays a price. It seems like a cool idea to have a big-ass RAM HDD, but I know it will be expensive.

    1. Re:Website unclairity by da5id · · Score: 2

      The website provided is nonnavigable

      Yes, it is a crapy website, after about 15 min I dicovered you click on a image to get to the 'qukdrive' info page here: http://www.platypus.net/pages/products_qikdriv2.ht ml.

      I havent dled the pdf, but looks like no price info. Does offer drivers tho, so probably not vaporware.


      echo $email | sed s/[A-Z]//g | rot13

    2. Re:Website unclairity by da5id · · Score: 1

      P.S. Red River Computer Company sells them for $3,874.00 for the 1GB model and $63,275.00 for the 8GB model.


      echo $email | sed s/[A-Z]//g | rot13

    3. Re:Website unclairity by da5id · · Score: 1

      P.P.S CDW sells the 2GB model for $5,768.55 (Special Order)


      echo $email | sed s/[A-Z]//g | rot13

    4. Re:Website unclairity by da5id · · Score: 1

      Gad, if I was just patient I would collate all this info into one post, but no.

      For a list of all platypus products, and their prices at CDW check out this search: http://www.cdw.com/shop/search/results.asp?key=pla typus


      echo $email | sed s/[A-Z]//g | rot13

    5. Re:Website unclairity by Tet · · Score: 2
      Can someone find a price on the 2 Gig HD from Platypus?

      No, but I'd bet on it being lots of money. We looked at solid state drives at a previous company. Since it was for mission critical stuff, we were looking at a RAID5 array of them, and it was priced at over £200,000 for a modest sized array -- something that would cost probably a tenth of that with conventional drives.

      --
      "The invisible and the non-existent look very much alike." -- Delos B. McKown
    6. Re:Website unclairity by John+Paul+Jones · · Score: 1

      Yeah, contact sales@ininet.com for pricing information.

      As an aside to the main topic, I've been the primary US beta-tester for these QikDrives under NT/2K/Solaris/Tru64/Linux/FreeBSD. I consult to Platypus, and assist their engineering team. This is good stuff, if your application can utilize it effectively.

      I have an Ultra10 with the 2GB half-card in it now.

      -JPJ

      --
      Feh.
  7. RAM is VERY cheap now, too. by Wakko+Warner · · Score: 2
    With the current RAM prices, a product like this could become very feasible. Right now, you can get 1GB of PC133 SDRAM for under $350 (based on Pricewatch's best prices.) A single $70 256MB DIMM would be plenty for a device like this, and adding a few more DIMM slots for do-it-yourself upgrades would certainly strengthen its market appeal.

    Just something to think about for those still skeptical...

    - A.P.

    --
    * CmdrTaco is an idiot.

    --
    "Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"
    1. Re:RAM is VERY cheap now, too. by da5id · · Score: 1

      RAM is VERY cheap now, too.

      But of course they want you to buy their ram, at $7,779.60 for 1GB!

      I would realy like to know if you can just slap anyold SDRAM in them or what. And also if there are any alternitives to the actual board on the market, as $1,969.40 for the board and 512MB of ram is a bit steep even to get started (assuming you could add SDRAM at market price)

      BTW I am getting these prices from http://www.cdw.com, I don't suppose it gets much better, but any one have other info?


      echo $email | sed s/[A-Z]//g | rot13

    2. Re:RAM is VERY cheap now, too. by norrisd · · Score: 1

      That's not too bad. I've just sorta finished a bunch of e-mails to my computer addict uncle about the possabilities something like a 15 second boot on pc's if allthe important stuff could be stored electronicly.
      it seems like there have been a few articles in the past about instant boot stuff from flash rom drives. It's totally possable with what we've got going witht he technology we've got going right now with the falling prices and all.
      something like the instant on pds's/ce devices would be nice (without the ce or course.)

    3. Re:RAM is VERY cheap now, too. by Howie · · Score: 1

      What I'd like is an option in (insert OS here) to allow me to say "Now, I'm not going to add any more new hardware, or move anything around, so stop trying to detect EVERYTHING every time I start up, and write some sort of static image, and boot in 3 seconds next time".

      What is Windows (in particular) *doing* during that time? I have a processor that can do hundreds of millions of instructions per second, it can't really be actually processing during that time...

      --
      "don't fall into the fallacy of believing that Perl can solve social problems. Maybe Perl 6 can, but that's a ways off"
    4. Re:RAM is VERY cheap now, too. by jrcamp · · Score: 1

      I think part of it is processing the registry. It has to also load all of those .VxDs that control your hardware. Then it has to initiliaze all of it.

    5. Re:RAM is VERY cheap now, too. by Tenareth · · Score: 1

      In RedHat, disable kudzu. Reenable when you add hardware. Simple.

      -- Keith Moore

      --
      This sig is the express property of someone.
    6. Re:RAM is VERY cheap now, too. by RelliK · · Score: 1
      But of course they want you to buy their ram, at $7,779.60 for 1GB!

      huh? 256MB RAM costs $180, so 1GB is $720. If you are talking about a single 1GB module, that's not $7k either. Crucial sells it for $2429.
      ___

      --
      ___
      If you think big enough, you'll never have to do it.
    7. Re:RAM is VERY cheap now, too. by da5id · · Score: 1

      If you are talking about a single 1GB module, that's not $7k either.

      No, I am talking about the Platypus ram they want you to use.


      echo $email | sed s/[A-Z]//g | rot13

  8. don't bet on it. by Wakko+Warner · · Score: 3
    UPS batteries only last a few years. What happens when yours fails a few months or (if you get a defective one) a few years before its expected lifetime is up? Never, ever count on any of your computer equipment, and always have as much protection as you can afford. This is one means toward that end.

    - A.P.

    --
    * CmdrTaco is an idiot.

    --
    "Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"
    1. Re:don't bet on it. by Alakaboo · · Score: 2

      That's why you hook the UPS up to the serial port on the machine, so when it starts to die a violent death it'll signal the system to do a clean shutdown. Of course, this is assuming you bought the nice, well-engineered UPS as opposed to the dirt cheap ones you get at Best Buy for $40!

      The nice UPS units do a few things to insure this doesn't happen:

      1. volt-o-meter on the circuit voltage
        Doesn't just wait for the source power to go out, checks the internal power for sudden drops.
      2. more than one battery
        If one dies, the next one struggles forth long enough to send the shutdown signal.
      3. use batteries that don't die
        IIRC, the rechargeables they use in those bad boys don't just die when they have a problem, they slowly fade out. Plenty of time.

      This doesn't take into account the messy situation that occurs when someone yanks the plug. But if this ever happens, it's time to get a new network engineer/sysadmin/intern.

      One last idea: plug the UPS into a UPS into the wall. Mua!

      Alakaboo

    2. Re:don't bet on it. by fatphil · · Score: 1

      "
      One last idea: plug the UPS into a UPS into the wall. Mua!
      "

      No! That makes the UPS nearer the computer a critical failure point.
      _Parallelise_ the UPSs, rather than serialising them.

      FatPhil

      -- Real Men Don't Use Porn. -- Morality In Media Billboards

      --
      Also FatPhil on SoylentNews, id 863
    3. Re:don't bet on it. by Stormgren · · Score: 1

      Well, for the truly "Mission Critical" you get a UPS that's directly wired into the computer room wall outlets, with hot-swap components and dual parallel inverters.

      They make those, you know.

      "All those tubes and wires and careful notes!"

      --

      "All those tubes and wires and careful notes!"

    4. Re:don't bet on it. by gimpboy · · Score: 1

      One last idea: plug the UPS into a UPS into the wall. Mua

      that may have been what the manual from the cheap ups you got a best by for $40 have said. if you ever buy a nice ups, it should clearly state in the manual _NOT_ to do this-it does more harm than good. kinda like wearing 2 condoms at the same time.

      use LaTeX? want an online reference manager that

      --
      -- john
    5. Re:don't bet on it. by Tenareth · · Score: 1

      Simple, All modern servers have at least 2 power supplies, plug each redundant (usually hot-pluggable) power supply into a different UPS.

      This is how we handle all of our critical servers.

      Now you are protected against Power Failure, UPS Failure, and Power supply failure.

      -- Keith Moore

      --
      This sig is the express property of someone.
    6. Re:don't bet on it. by fatphil · · Score: 1

      Very roughly.

      In series:
      Source failure: protection = 2
      UPSfar failure: protection = 1
      UPSnear failure: protection = 0
      near/far measured from view of the server.

      In parallel:
      Source failure: protection = 2
      UPS failure: protection = 1

      See, there are no protection=0 scenarios except double UPS failure. (and serial is equally bad for that).

      Use parallel UPSs for _hot swappable_ UPS setups!
      It's the only truly redundant way for 24/7 systems.

      FP.


      -- Real Men Don't Use Porn. -- Morality In Media Billboards

      --
      Also FatPhil on SoylentNews, id 863
  9. Too much work by ericfitz · · Score: 4

    A write-caching disk controller combined with a journaling file system would give you the same benefit. You're just reinventing the wheel..

    The only really new thing here seems to be the fact that the "TRAM" is file-system aware, which is just another way of saying that you are investing in hardware which will just tie you to tired old EFS.

    Windows NT has had a journaled file system forever, and the journaling doesn't cause the major performance impact that everyone seems to think it does. Maybe someday Linus will get in the mood and allow a journaling FS into Linux.

    On a side note, what does the OS do in case of some sort of TRAM failure?

    1. Re:Too much work by LarsG · · Score: 2

      Windows NT has had a journaled file system forever

      Unless they did som heavy changes in NTFS5, it is still log-based. Think of it as a circular buffer, usually 2-4MB size, where file system transactions are logged.

      It usually works just as well as a full journalled FS - 2 second "fsck" and a consistent fs. Under heavy disk activity you do however run the risk of exhausting the size of the log, and end up with an inconsistent file system if you crash.

      Other features of NTFS are cool, though. The fs attributes are a superset of posix and vms', so it can emulate both. It also has several metadata files, which provided 3rd parties the hooks needed to add quotas and other features to NT4.

      --
      If J.K.R wrote Windows: Puteulanus fenestra mortalis!
    2. Re:Too much work by fatphil · · Score: 1

      Isn't NTFS essentially HPFS?
      If I remember Microsoft OS/2 correctly, that is.
      I can't be sure there was theft/borrowing/evolution or whatever, but I do remember seeing a feature comparison of a whole bunch of FSs, and HPFS and NTFS had suspiciously similar features.

      FatPhil


      -- Real Men Don't Use Porn. -- Morality In Media Billboards

      --
      Also FatPhil on SoylentNews, id 863
    3. Re:Too much work by Tenareth · · Score: 1

      One minor difference, HPFS was/is fast. NTFS runs slower than FAT32.


      -- Keith Moore

      --
      This sig is the express property of someone.
  10. more to the problem.... by ndfa · · Score: 2

    Am i mistaken or did this article feel just to warm and fuzzy. I know there is a lot of good technical info in there but its all wrapped up in a very strange manner. I dont think you are solving the whole problem, by overlooking and waving your hands over the rest. I mean so you put a few sticks of memory and power on a PCI card! You do this cause a UPS can die, well i got news for you, the PCI Card could die too! AND if you are trying to make reboots faster, dont bother, if you are serious you would have a backup system and the same should be true for a web server dying. The only time i want fast reboots is when a good game of UT croacks on me and i want to get back to fraggin..... this is not technology i would use for mission critical apps!

    Hmmmmm... the more I think of it, the more it feels like a marketing team thought that a semi tech article on Slashdot would be just the ticket for killer web site traffic! Maybe its the lack of caffinne on my part...What do other ppl think ?

    --
    Non-Deterministic Finite Automata
    1. Re:more to the problem.... by gunner800 · · Score: 1
      It may not be the greatest thing in the history of the universe, but it has its place.

      Yes, serious users should have backup systems and should not be dependent on a single piece of hardware. But not all organizations or individuals can afford the backups. And it's always nice to have one more safety net.

      Just think of it as a (theoretically) cheap way to make data integrity somewhat safer, without sacrificing useful uptime. Could be useful?


      My mom is not a Karma whore!

    2. Re:more to the problem.... by fatphil · · Score: 1

      I agree.
      I could summarise the article in...
      (Use a nonvolatile disk cache)
      ... 5 words.

      OK, it's not quite what he's saying, but for the compactness I think it's pretty close.

      FatPhil
      -- Real Men Don't Use Porn. -- Morality In Media Billboards

      --
      Also FatPhil on SoylentNews, id 863
  11. RAM == volatile by pjrc · · Score: 5
    RAM is volatile.

    Sure, a battery backup sounds like it solves this, but consider that DRAM stores its charge on tiny capacitors, and requires a controller to be performing "refresh" access cycles regularily (usually every 15.26 s). This means that not only must the battery be good, but the controller accessing the DRAM must continue providing the refresh cycles without interruption. That may sound simple, but not all DRAMs are created alike.... SDRAM DIMMs have a feature called Serial Presence Detect (SPD) that is a small non-volatile 2-wire serial EEPROM memory that hold identifying data about the size and timing parameters for the memory. A typical DRAM controller would be initialized at boot time... a card like this would require a special DRAM controller that only initialized its timings when the DRAM/battery is first installed. Perhaps the controller would be designed to use relatively slow and conservative timings, always, so it'd never be able to reinitialize to other settings (that could be wrong) and/or stop providing the critical refresh at any point.

    The point is that to retain memory, DRAM requires not only power but a properly operating controller to supply the refresh cycles. Magnetic media maintains its memory without either of these conditions. Compared to magnetic media, DRAM is very volatile. "Mission Critical" data, whatever that may be, would be existing at tiny charges on the very tiny capacitors, which could dissipate in only about 4-8 ms, if the DRAM controller doesn't perform perfectly.... inside a computer (designed as a reliable server) which has just crashed for some unknown reason!

    1. Re:RAM == volatile by spectecjr · · Score: 2

      The point is that to retain memory, DRAM requires not only power but a properly operating controller to supply the refresh cycles

      Then use Static RAM with 5ns (or lower) cycle times instead. The idea has lots of problems, but the type of memory required (which isn't specified) isn't one of them.

      Simon

      --
      Coming soon - pyrogyra
    2. Re:RAM == volatile by dbarclay10 · · Score: 5

      You raise an implementation issue.

      The point is that to retain memory, DRAM requires not only power but a properly operating controller to supply the refresh cycles.

      Laptops run off batteries, and their memory seems okay.

      My Pilot runs okay off AAA-type batteries. Memory has been running quite well, thank you :)

      This fellow wasn't talking about slapping some RAM sticks to a breadboard and running current through the wires. Of course you would need a memory controller. Duh. :) The problems you raised were solved many years ago. If they hadn't been, nobody would be using volatile memory(like SDRAM) at all - it'd be too unreliable.

      I almost think you're just looking to spread some FUD.

      "Mission Critical" data, whatever that may be, would be existing at tiny charges on the very tiny capacitors,

      You say this like it's a bad thing! It's relied on every day. Hell, mission-critical data on its way to be written to disk is nothing but A CLUMP ELECTRONS MOVING ALONG A WIRE, in a lot of cases, those wires many times smaller than a human hair.

      Don't make a mountain out of a molehill. This isn't a bad idea, and just because they'll have to put a DRAM controller on the card doesn't make it one.

      Dave

      Barclay family motto:
      Aut agere aut mori.
      (Either action or death.)

      --

      Barclay family motto:
      Aut agere aut mori.
      (Either action or death.)
    3. Re:RAM == volatile by tzanger · · Score: 1

      Puh-lease

      I can't remember the last time I've had DRAM-related hardware failure. This is across approximately 150 computers, from 80386s to the latest and greatest around. Hell I don't think I've even had trouble on the old XTs or even Commodore 64 unless I was specifically screwing with the controller.

      The controller would initialize to the settings in the DRAM's SPD and stay there. What is so difficult about that? In the case of multiple DIMMs, initialize to the slowest/most conservative of the bunch. There's no need to synchronize to the host system if you use some sort of buffering or even extend the PCI access time until the end of the refresh.

      Yeah you could go SRAM but that's VERY expensive. How about PSRAM which is used in the Palm series of handhelds? They're DRAM with SRAM timings and built-in refresh circuitry. Much cheaper but consume more power than SRAM.

      As far as crashing for no known reason, why would you have the OS RELY on the controller? Do a mem check at start and perhaps during idle times/backup times. If the memory goes for a shit, you stop using it. Code your VFS around it being there but with sane timeouts so if it dies you can recover. This kind of design technique has been around for decades.

      Christ, you make it sound like magic how this stuff works. The dynamic memory technology around has been proven over the years. I'd be more inclined to think that the battery or associated switchover circuitry would give you trouble before the DRAM ever would.

    4. Re:RAM == volatile by tzanger · · Score: 1

      Then use Static RAM with 5ns (or lower) cycle times instead.

      Yeah, and quintuple your memory costs. Try PSRAM (Pseudo-SRAM) which is really DRAM in SRAM's clothing (pinouts and timings). They contain a DRAM bank and the refresh circuitry needed to keep the DRAM alive but without bothering with an external controller.

      The cost is more expensive than regular DRAM but WAAAAY cheaper than SRAM.

    5. Re:RAM == volatile by fatphil · · Score: 1

      This (being effectively part of the disk subsystem) is a comparatively low bandwidth device, there's no need for 5ns SRAM. Drop the cost by going for slower SRAMs.

      FP.
      -- Real Men Don't Use Porn. -- Morality In Media Billboards

      --
      Also FatPhil on SoylentNews, id 863
    6. Re:RAM == volatile by selectspec · · Score: 2

      Absolutely correct. This idea is ludicrous. You need hard storage back-up. The parity failure rate of DRAM vs. a hard disk should be enough to mandate this.

      --

      Someone you trust is one of us.

    7. Re:RAM == volatile by maraist · · Score: 2

      ECC.. Or better yet, for expensive boards, use a form of mirrored RAID (in addition to ECC). We're using incredibly inexpensive memory to work with roughly 32Meg of data (anything more is probably asking for trouble.. we're not building a caching system as was pointed out. In fact the system should stall if the buffer starts filling up). If video cards can handle twice 32Meg pipes, a "TRAM" controller should be able to as well.

      -Michael

      --
      -Michael
    8. Re:RAM == volatile by Doctor+Memory · · Score: 1

      (giggle) Jeezuz, do they still make this stuff? Last time I heard about this, I was speccing 2114s (read: back in the early 80s) and I ran across this "QSTAT" (quasi-static) stuff (not for 2114s, maybe 6116s?). Exactly the same thing, DRAM with a built-in controller on the chip. Le plus ca change...

      --
      Just junk food for thought...
    9. Re:RAM == volatile by pjrc · · Score: 3
      You say this like it's a bad thing! It's relied on every day.

      DRAM (tiny charge requiring sustained refresh operations) is relied upon during normal operation of the computer. The proposal here is to also rely upon DRAM during and after the events that lead to a crash.

      To respond specifically to your examples above, your battery supported laptop and palm pilot memory is reliable, but what happens if they crash? Is your laptop memory intact after something goes wrong? The microcontroller in a palm has no MMU, so if something does wrong, it'll be able to easily trash the memory.

      Regarding data being sent as "a clump of electrons moving along a wire" (propagation of a change in voltage potential would be more accurate)... that just simply isn't the way it's done. Communication takes place using protocols which verify that the data has been properly received. Newer ATA transfer modes use a CRC, and even with the older modes, status bits are provided to verify that the data was properly written. It would be horribly unreliable to send a "clump of electrons" and hope that the data is received and stored properly.

      Now, regarding the comment:

      I almost think you're just looking to spread some FUD.

      FUD, Fear, Uncertainty, and Doubt is a marketing tactic, generally used by an established vendor when their well-known product is inferior and more expensive, and the best way to convince a customer to buy the established product is to scare the customer away from the competitor.

      Why would I do that? I don't have any vested interest in the current practices. I'm not participating in the development of any journaled filesystems. I do a bit of freelance hardware design and small quantity sales, so if I thought this was a really good idea, I might go after making such a card and kernel patch.

      But I believe the idea is fundamentally flawed.

      During the unpredictable events that will lead to a crash, and the unpredictable behavior immediately following a crash, DRAM is going to be a much less reliable way to be holding data. It doesn't matter how well DRAM works during normal operation. DRAM has proven quite reliable, as long as the computer and memory controller operate properly.

      Even with a specially designed memory controller (as a standard one won't do), it is quite risky to rely upon DRAM during a crash. Call it FUD if you like, but DRAM just isn't a reliable place to have data when a machine crashes. You say FUD now, but if anyone were to actually make such a card, the term I'd use would be Snake Oil.

  12. Simply a cache for a journaling filesystem by elprez · · Score: 3

    First, it is absolutely critical that the OS creates some log or structure of operations on the TRAM for filesystem operations. Basically, if the OS can mark the beginning and end of an operation and place it in this memory, you can now get a journaled meta data filesystem without a complete re-architecture of a filesystem.

    Basically, if the OS can determine the beg/end of an operation (transaction) and it logs this information, then we have a journaling file system. Any persistent storage will suffice for the journal - 'TRAM' or hard disk or clay bricks. The only difference is the access time.

    In general there is no magical way for the OS to know what data is the beg/end of a transaction. The OS could try to handle meta-data in this fashion. It can log the meta-data changes it would make in atomic transactions and replay un-commited transactions on a reboot. However, the file system still needs to be aware of this journaling.

    Consider a power failure during a commit to the file system. The file system is in a partially modified state and the transaction has not been retired from the TRAM journal (since it did not complete). When the system boots again, the TRAM journal is replayed and the same operation begins again, except this time on an inconsistent file system. The file system needs to recognize that a partially commited transaction needs to be rolled back.

    The above is based on my (very incomplete) understanding of journaling file systems. However, a TRAM card amounts to a cache for a file system journal, so in no sense is it going to replace or leap-frog journaling file systems.

  13. Platypus Prices from CDW by l1nuxn3rd · · Score: 4

    Here is a link to the Solid State Hard Drive Pricing Page from CDW.
    http://www.cdw.com/shop/search/results.asp?grp=HSO
    Platypus products are listed as well as some from Quantum and Sandisk.
    You are talking $1,969.40 US currency for the Platypus QikDRIVE8 512MB, the smallest model i saw.
    CDW is the Authorized reseller I found for the US.

    1. Re:Platypus Prices from CDW by John+Paul+Jones · · Score: 1

      CDW has old/high prices... I don't think they've sold a single one. Contact sales@ininet.com for accurate info.

      (sales hat off now)
      -JPJ

      --
      Feh.
  14. opensource drivers for the Platypus QikDrive? by flaggzz · · Score: 1

    couldn't find any...

    and binary ones only for 2.2.17

    this is the problem with closed-source drivers for an fast-changing os like linux...

    yuck

    --
    Ring brother, ring for me | Ring the bells of hope and faith
    Ring for my damnation | I am at the gallows end
  15. as a replacement for hard disks by AussiePenguin · · Score: 1
    I don't see the point of using this as a backup as others have pointed out, the battery is likely to die like that of a UPS. So it's only job would be to backup a UPS which shouldn't die anyway.

    What seems more likely is to use it for replacements of hard disks. And this made me remember that it already is used like that. For example Electronic Organisers. But you'd want to have good battery testing and hot swapple battery so you can replace the battery while the machine is up. I wouldn't want to rely on a battery to protect my data as they aren't very failsafe.

    AussiePenguin
    Melbourne, Australia
    ICQ 19255837

    --

    Jeremy
    Melbourne, Australia
    Jabber Australia

  16. Too much trouble, too little benefit by Anonymous Coward · · Score: 1

    Running ReiserFS on Linux. Boots fast without a problem. Anyway, how often do you reboot anyway? One power failure in the last 6 months, and rebooted once to upgrade the kernel. ReiserFS is so rock solid and so fast that I haven't bothered replacing the bad battery in my old UPS. Don't need it. Try ReiserFS. You'll be impressed.

  17. Already available for most RAID controllers.... by X · · Score: 4

    Most RAID controllers will give you a battery backed-up write-back RAM cache. Depending on how you configure it, it will say that a write is committed as soon as it's in RAM. This accomplishes the same net effect without requiring all this modification of the OS.

    Of course, lots of people don't like to configure their RAID controllers this way, because there is no redundancy for data in RAM, not to mention that the risk of failure is still higher than with a hard disk.

    I hate to say it, but that article seems like it was written by someone who has not been out in the real world.

    --
    sigs are a waste of space
    1. Re:Already available for most RAID controllers.... by maraist · · Score: 2

      At face value I think you missed much of the point.. The OS can't know the intent of the "application". Just because I've performed a write, doesn't mean that only saving that much will have any meaning after a crash. I don't really understand how the author intends to deal with the "begin" and "end" transactions unless they're providing a journalling service (which I'm not familiar enough with). At the very minimum, however, a transaction would be the modification of an inode/directory element and then at least the initially provided data. The fsck is reduced because you'll never have a disk with only part of this info. Beyond that an explicit fsync might provide enough info to the OS to say that everything buffered up till the fsync should be part of that transaction.. I it "possible" to design in such a way that DB writes can write out everything as one piece, call fsync, and then have the OS garuntee an all or nothing write.

      If you mean to say that the write-back cache can be used for application meta-data (namely the transaction support), then that's an interesting (albeit proprietary) avenue to explore.. But I can imagine that regular flat SCSI and IDE based systems could benifit as well (since there are definately web servers / DBs out there without RAID). Why not push for a product that works with all drives and fights to become ubiquitous.

      -Michael

      --
      -Michael
  18. Come to the point! by zmooc · · Score: 4
    To the author:

    Why on earth do you want to tell us things like this Unix was designed to be simple. This means, if they found that they could do certain things as libraries in user space, then it didn't belong in the kernel.? It has absolutely nothing to do with TRAM. Actually that's true for nearly everything you say in your article; you use a lot of irrelevant examples and try to mention everything you seem to know about Unix and then explain the solution in 2 lines?! Why don't you mention the real interesting things like that such cards most probably fail just as often as UPS'es, why this should be on a PCI-card and not on the disk (ok that's because you want to access the memory directly, but please explain this...) or what the consequences are concerning access-time?

    Although the idea is good, I think you could have done a much better article; come to the point!

    --
    0x or or snor perron?!
  19. sheesh. by godmode · · Score: 1

    wow, if only the 8GB ones didn't cost $26,000.

  20. Sounds Like a Network Appliance by halbritt · · Score: 1

    Network Appliance has been doing this on their filers for many years now and it works very well. Although I would question using DRAM for the purpose. How would one know when their battery has failed? NetApp uses 32MB of NVRAM and a lot of other fairly commonplace technology in an interesting fashion that results in a very very fault-tolerant piece of hardware. I have a 600GB Filer (limited to 200GB volumes for back expediency) that I can pull the plug on at any time without damaging the filesystem. As a matter of fact, I have done this on occasion. Boots take a few minutes regardless of how the filer was downed.

  21. Wrong angle. by dybdahl · · Score: 1

    The whole industry works towards less hardware to do the same job all the time. Solving a software problem by adding hardware is not the way to go, especially not when ReiserFS etc. are already present.

    1. Re:Wrong angle. by fatphil · · Score: 1

      Fewer subsystems isn't always the best way to go.
      I _don't_ want onboard video and sound hardware on my motherboard. It's a server in a wardrobe. No need for Matrox this, gfx that, live! the other, 16 bit, 32Meg, MPEG nonsense. Similarly I don't want AGP, PCI _and_ ISA on my Mobo.
      So personally, I wish the whole industry _wasn't_ working towards that goal.

      I agree with your points about doing the software bit right before throwing hardware at a problem, though.

      FatPhil

      -- Real Men Don't Use Porn. -- Morality In Media Billboards

      --
      Also FatPhil on SoylentNews, id 863
  22. Why don't you read... by keepper · · Score: 1

    He gives the explanation on why they shouldn't be on disks....

    ARRGGHH

    1. Re:Why don't you read... by zmooc · · Score: 2
      I meant: why use a PCI-interface instead of implementing this as a sort of cache on the disk by adding begin/end-tags to the protocol the disk uses (IDE/SCSI)? Is that explained in the article? If so; please show me where. Thank you.

      What I was trying to say is that hardly anything was said about TRAM compared to the extensive description of how it currenty is done; all sorts of current solutions are covered, but when it comes to TRAM he just says `do it so and so' without mentioning WHY to do it that way or what the alternatives are. This was just an example of that, but I could have chosen a better one...

      --
      0x or or snor perron?!
    2. Re:Why don't you read... by lscoughlin · · Score: 1

      Yes, as a matter of fact, it is.

      First, it would be difficult to make those cache's (which in most cases are already there) persistent, which defeats the whole point of using it for fault tolerance.

      This is why you place it in an arbitrary open persistent memory cache.

      --
      Old truckers never die, they just get a new peterbilt
  23. Whats so radical... by A+Masquerade · · Score: 5

    This has been talked of for quite a time, and is hardly radical. Whats more it is not an alternative to journal based filesystems, but logically its an adjunct to them.

    First you have your filesystem that buffers transactions in a journal that is streamed to disk. Then, for performance, by avoiding all those extra seeks, you put the filesystem journal on another device - say a small fast dedicated disk. Then you make that device a NVRAM device rather than something based on spinning rust.

    Whats more, if you are interested in something like mail systems, where you get a lot of transactions that *must* committed to stable storage (although a lot of MTAs don't do that in spite of the wording of RFC821), and you use a fileystem like ext3 with a data journalling mode, then putting the journal onto NVRAM makes a huge difference - by the time it comes to the point where data would be committed to the disk from the journal, most of the data (ie e-mail messages) is now unwanted (since the messages have been delivered to final local destination or for onward transmission) and so you don't even need to do the disk ops...

    All of this is pretty much available now in ext3 other than the tools to get the journal onto a NVRAM disk - and thats just detail.

    So, nice idea, needs more flesh, a little more infrastructure needed round it.

    [Those who came to the London UKUUG Linux Conference might well have heard these discussions before going on in various corridors :-) ]

  24. Yet another strong arguement for... by iomud · · Score: 1

    Fail-over clustering. Being redundant is good, good, good. If we think a few sticks of ram are going to solve inhierent file system problems we either dont understand the problem or we dont understand the technology. I see it's benifits but for some reason it feels like it should be part of the hardware architecture rather than a simple pci card with yet another buggy driver supporting it and making it all work.

  25. Wasn't there something said about don't be a dick? by Ashleigh · · Score: 1

    wow, you've taken what was otherwise an interesting article, and stolen its virtue, it's purity,its virgin soil, with a stupid first comment that only inspires other idiots to reply to it.

    And don't burn karma, there is plenty of coal to go around for all, and it lights much better.

    (Forgive my sentiment, for it's late, I have had a long day, and you're an asshole)

    --
    Why yes, all my base are belong to you.
    How did you guess?
  26. This is WAY too much work... by jcr · · Score: 1

    ... to solve a problem that can be trivially addressed by just buying a UPS!

    All it takes, is a UPS that has a serial port to tell the machine that power was lost, and the host can just sync and halt before the power goes out.

    If what you want is a genuinely reliable system, then start working on EROS (www.eros-os.org)

    -jcr

    --
    The only title of honor that a tyrant can grant is "Enemy of the State."
  27. Not in EROS, it's not. by jcr · · Score: 1

    EROS implements global, orthogonal persistance by checkpointing the machine to disk every five minutes or so. If you lose power, the machine restarts with all of its state intact as of the last checkpoint. No fsck, no muss, no fuss, and provable security to boot!

    Check it out at eros-os.org.

    -jcr

    --
    The only title of honor that a tyrant can grant is "Enemy of the State."
    1. Re:Not in EROS, it's not. by jcr · · Score: 1

      In EROS, if you're doing something that requires transaction surety, you can explicitly append memory pages to the most recent checkpoint.

      -jcr

      --
      The only title of honor that a tyrant can grant is "Enemy of the State."
  28. I'm not sure where to start explaining why this by __aakpxi9117 · · Score: 1

    is such an idiotic idea. What he wants is a battery backed hard drive buffer. Does that take 20 pages to explain? I have the same thing right now, I call it an UPS. It does the same thing without the slowdown of a PCI slot throughput and it doesn't cost a lot. So if you're reading this, I ask the question... Why battery back a buffer module and not the whole system? And this is only for production system... Home users would still need a JFS because they sure as hell don't want to pay for this shit! How many home users backup their systems period?

    To all JFS developers, 'Nothin to see here, move along' with development.

  29. To avoid fsck and to boot in "only" 2 minutes? by haggar · · Score: 2

    Aside from some reliability issues of this technology, I wonder if it's worth the trouble.

    Besides, BeOS boots in about 15 seconds into GUI, even if you previously turned off the PC without shutting down. So, journaled filesystems DO have advantages. Linux may never achieve such high speeds in booting up, but still, I predict that a good JFS will benefit it.

    --
    Sigged!
    1. Re:To avoid fsck and to boot in "only" 2 minutes? by Drone-X · · Score: 2

      Besides, BeOS boots in about 15 seconds into GUI, even if you previously turned off the PC without shutting down. So, journaled filesystems DO have advantages. Linux may never achieve such high speeds in booting up, but still, I predict that a good JFS will benefit it.

      Actually, it does. If you just disable services like Apache, proftpd, etc. I counted this starting at kernel boot till GDM was up and running on an Athlon 700. My P166 could do the same, but without GUI (it doesn't have X installed so I couln't test).

  30. no journaled filesystem? what about reiserfs? by skotty · · Score: 1

    The text says that there's no production-ready journaled filesystem yet for a free unix. I completely disagree with that. We use redhat linux on reiserfs on raid5 for some of our servers and it's MUCH better then normal ext2: no corrupt filesystems anymore, no 'enter root password for maintenance' when there's a fs problem anymore and it's much faster. Imho is reiserfs much more production-ready then ext2 or ufs.

  31. Not radical but simply evolution... by pvcf · · Score: 1

    It seems to me, that this idea like many things is simply an evolutionary idea as opposed to revolutionary.

    I believe that the original author's article is fundamentally correct but there is a bit too much arm-waving and it blurs the description somewhat.

    If one were to actually perform more detailed analysis on the proposal, you would probably end up having to produce a journaled or transaction logged system with all their related overhead. The difference being that the journal or xactlog buffer is a battery backup XRAM device instead of the hard-drive itself.

    This could improve performance of this part of the system significantly.

    After all, I think this is just perhaps another (better?) way of doing the same thing...

    ....Paul

    --
    F U NE X N M? Son: "Dad... How do you spell 'hourly'?" Dad: "0 * * * *"
  32. Why is FS-aware TRAM a Good Thing? by The+Monster · · Score: 1
    The only really new thing here seems to be the fact that the "TRAM" is file-system aware,
    I'm still trying to figure out why this is a Good Thing. What is wrong with an entirely OS-independent disk controller card with battery backed write cache.

    Whenever your computer told it to write to the disk, the first thing it'd do is write each sector to the cache, and write the drive and LBA sector number to a separate section of memory. The controller could then index this structure by LBA to implement "elevator" writes, vastly improving performance with little risk of data loss.

    With enough memory, it could also be configured to permanently cache the MBR, boot loader, kernel, init scripts, daemons... Make it big enough and the whole swap partition is in there, too. Think how fast something like this could boot.

    --

    [100% ISO 646 Compliant]
    SVM, ERGO MONSTRO.

    1. Re:Why is FS-aware TRAM a Good Thing? by Stormgren · · Score: 1

      Many RAID cards have onboard RAM. I have an AMI 1500 controller sitting next to me that takes a standard PC SDRAM DIMM as a cache chip. It has a rechargeable Li-Ion battery to keep the info on the chip persistent. It does read-ahead as well as write-back caching.

      As far as I can tell it does do what it is supposed to. In the event of a power failure, it contains the last bits of info that were sent by the OS in RAM, and will reconcile what actually had been written to disk before the machine went down.

      The problem with this, of course, is that the OS (in this case SCO openserver, not my choice, need to support legacy app) has this nasty habit, like all Unices, of caching data in system RAM by default. I've turned down or off as many cache settings as I can in the SCO kernel parameters, but there's still a little bit still going on.

      Performance is a little poorer than "normal" but the crash-reboot-fsck time is better than it was, and I've lost virtually no data with the RAID configuration, which was my goal.
      However:

      "With enough memory, it could also be configured to permanently cache the MBR, boot loader, kernel, init scripts, daemons... Make it big enough and the whole swap partition is in there, too. Think how fast something like this could boot."

      Yes, but doesn't that contradict what you said about a "entirely OS-independent disk controller card"? Seriously. Wouldn't the card have to understand whatever filesystem you were dealing with, and hence the operating system (somewhat)? The only way I can see around it is having the card remember what sectors it commonly read during the first X seconds after the first disk read. I'm not saying it's a bad idea, as I like the thought of having 20 second boots, but how would you do something like that and stay "OS independent"?

      "All those tubes and wires and careful notes!"

      --

      "All those tubes and wires and careful notes!"

    2. Re:Why is FS-aware TRAM a Good Thing? by boomi · · Score: 1

      >With enough memory, it could also be configured to permanently cache the MBR, boot loader, kernel, init scripts, daemons... Make it big enough and the whole swap partition is in there, too.

      uuuuh, I'm gonna upgrade the controller with 256Megs of ram to hold my Pagefile. Yeah, shure!
      And ooooh, yeah, the pagefile will be stored, even if the system fails! COOL!!!

      Man, everyone here is bothering about boot time.
      I got tears in my eyes when I discovered the "autotune" option in LILO, speeding up the boot by 6 lousy seconds. Later I decided that the boot time is rather irrelevant cause I reboot about once a week.
      Boot time is relevant to MS! Yesterday my Win95 box ...*uuuh, headache*
      I agree that controller-cache would be a good idea for Windoze boxes.

      Once an OS is up and running, it will cache everything, no need for crappy controller-cache.
      Anyway, we were talking about write-cache here, this makes good sense to speed things up!

    3. Re:Why is FS-aware TRAM a Good Thing? by The+Monster · · Score: 2
      Yes, but doesn't that contradict what you said about a "entirely OS-independent disk controller card"?
      No. All the card has to know is that these are the first x sectors that the computer asks for when it boots. It need not (and, I submit, should not) have any knowledge whatsoever of the structure of the filesystem(s) they comprise. All you need is a way to tell the card what you wanted done: Some command that could be sent over the existing bus, but intercepted by the card, to allocate the first N sectors of boot tracks, and hard-map M cylinders for use by a particular partition...

      Both boot and swap caching would be most helpful on a [lap|palm]top machine, where epic uptimes are irrelevant. Normally, low-power RAM that is most efficient for battery-backed use is slower than the RAM we're accustomed to using. But it's still orders of magnitude faster than disk access.

      --

      [100% ISO 646 Compliant]
      SVM, ERGO MONSTRO.

    4. Re:Why is FS-aware TRAM a Good Thing? by The+Monster · · Score: 1
      The point of the exercise is to make the system more reliable.
      Exactly how does this require that the TRAM have any knowledge of the OS(es) using it?

      I am quite well aware of the issues involved in journalling transactions to disk. But once the caching controller has staged the impending writes in its battery-backed RAM,

      all the data gets backed up on a persistent medium
      just like everyone wants, because the RAM itself is (relatively) persistent. The only downside at all is failure of the battery or the RAM; if you're doing something that critical, it ought to be replicated to multiple servers anyway.
      --

      [100% ISO 646 Compliant]
      SVM, ERGO MONSTRO.

    5. Re:Why is FS-aware TRAM a Good Thing? by Stormgren · · Score: 2

      I think that's what I was getting at in my prior post. If the controller can cache my commonly accessed sectors at boot-time, that's fine. But as for "M cylinders for a partition", for traditional partitions, that's fine. For things like UnixWare, Openserver and *BSD, it'd have to know about the sub-filesystem partitions they use, would they not?

      "All those tubes and wires and careful notes!"

      --

      "All those tubes and wires and careful notes!"

    6. Re:Why is FS-aware TRAM a Good Thing? by The+Monster · · Score: 2
      For things like UnixWare, Openserver and *BSD, it'd have to know about the sub-filesystem partitions they use, would they not?
      Not that I can see. Ultimately, all of the levels of abstraction map to commands to the HD to read/write one or more LBA sectors. If I want to reserve a specific range of sectors in the cache, all I have to do is use the OS tools that determine what those sector numbers are, and communicate that to the controller. If this means that a device driver would likely be written for a TRAM-enabled controller, sure.

      But it's important to keep these things straight: Drivers written for an OS (hopefully source code that can compile under any *nix) may well benefit from knowing about hardware. The controller never needs to know anything about the OS. (And let me reiterate, shouldn't know. Can you say "Win[Modem|Printer]?" Yuck.)

      --

      [100% ISO 646 Compliant]
      SVM, ERGO MONSTRO.

    7. Re:Why is FS-aware TRAM a Good Thing? by Stormgren · · Score: 1

      Well, if you're going to throw a driver into the mix, I see your point very nicely. I was purely coming from a hardware standpoint. The driver telling the controller what to cache on a regular basis would be nice, especially if it's somewhat admin controllable.

      Wow, first intelligent conversation I've participated in on slashdot in months. Amazing.

      "All those tubes and wires and careful notes!"

      --

      "All those tubes and wires and careful notes!"

    8. Re:Why is FS-aware TRAM a Good Thing? by The+Monster · · Score: 1

      Sure. The idea is to keep the hardware interface simple, with default behavior that makes sense under any OS/app load. That way, it's an immediate win to implement the hardware, and it doesn't become dated when we shift to the 3.x kernel in a few years (decades, at this rate). The driver is entirely optional, being "aware" of details of your specific setup. I can see the applications for portable computers including a "dump all memory to the cache, set wakeup timer for next cron job, then shutdown" that just wouldn't apply to a server, for instance.

      --

      [100% ISO 646 Compliant]
      SVM, ERGO MONSTRO.

  33. Historical Reference by JonesBoy · · Score: 1

    I remember back in '86, a manufacturer had an 8 meg RAM drive with a big gel cell backup battery. Essentially, a DRAM hard drive like you are describing for a cache. It worked on apple IIgs series computers, which usually had less than a meg of main memory, but could be expanded to 4mB. I am sure a RAM drive could be built today. I am not talking about the solid state hard drives some posters have commented on. They are using static RAM and communicating through a PCMCIA port. They have maximum throughputs of about 1MB/s sustained, which is not very useful if you are working with databases. If anything, having your OS on one of these would give amazing startup times. The apple IIgs usually booted from an 800kb floppy, and took a few minutes. Ads for the RAM drive said seconds! With a 1.2MHZ processor! Ahhh. the good ol days.

    But seriously, why the hell would you use this for a cache instead of the main drive? Why go from battery RAM -> Hard Drive -> Tape and not just batt. RAM -> tape? Espically if it is battery backed?

    --
    Speeding never killed anyone. Stopping did.
    1. Re:Historical Reference by TBC · · Score: 1

      I was going to bring this up. The device you are thinking of was manufactured by Applied Engineering, and actually let you put 2 ram cards into the IIgs. I was selling "Octo-Ram" ram cards at the time, capable of taking 8 Megs of memory. I populated 2 cards, created an 8MB ram disk (the IIgs could only address 8MB, even though the processor could handle 16MB) and copied 1/4 of my 20MB hard drive into it. Pretty wild what you could do with limited memory and drive space....

  34. Re:I'm a geek by fatphil · · Score: 1

    Cos it don't work. I said that 3 weeks ago, and they're still pestering me.
    -- Real Men Don't Use Porn. -- Morality In Media Billboards

    --
    Also FatPhil on SoylentNews, id 863
  35. RIO - RAM I/O by dugsong · · Score: 2

    see the Rio / Vista work by Pete Chen, Dave Lowell, et al. which won best paper at SOSP several years ago...

  36. another way to skip fsck at boot by thvv · · Score: 1
    TRAM is an interesting idea. Without taking anything away from it, let me mention some work done in the 70s.

    Multics had the same problem after crashes: it took a long time to bring the system back up because the "salvager" had to check every "VTOC entry." (fsck/inode in Unix terms)

    We re-wrote the file system to do the necessary checking on each use of the directory and VTOC data, and eliminated the salvage during boot. Everything still got checked, but only as needed. Boot times went from hours to minutes, and the system was much more solid and reliable.

    See http://www.multicians.org/thvv/marking.html

  37. Ext3 is working on this by sjmurdoch · · Score: 1

    I was at a talk by Stephen Tweedie (one of the developers on Ext3). He was saying that one of the recent things he was working on was storing the journalling data on a separate device from the hard disk where the data is to be stored.
    Initially he has tried storing the jornalling data in RAM to test performance but the plan is to store the journalling data on a NVRAM card that he was waiting to be delivered. This will increase speed of synchronous writes, like with databases and sendmail and give all the benefits of journalling.

    --
    Steven Murdoch.

    --
    Steven Murdoch.
    web: http://www.cl.cam.ac.uk/users/sjm217/
  38. About your punctuation: (and a comment) by Tom7 · · Score: 2

    You should read this Bob The Angry Flower comic:

    http://www.angryflower.com/bobsqu.gif

    How about this TRAM stored on the disk drive, and have the OS simply tell it the dependency DAG? It can perform its own write reordering (probably more efficiently since it knows where the disk head really is and all the specifics about its geometry) and then finish off the queue when first getting power after a power loss.

  39. I find it odd... by rabtech · · Score: 1

    I find it odd that in today's world, I still can't get a default distro of Linux or any free *nix with a journaling filesystem preinstalled as the default, and with a fs driver considered to be STABLE or RELEASE. Yet, Windows NT has had a full journaling filesystem since NT 3.x....

    Since NTFS is journaling, supports reparse points, extended meta data, and more, I look forward to the day when the NTFS fs driver for Linux is stable enough to boot from, then I can have one *stable* filesystem across all my disks.

    I might add that a boot-time chkdsk on a rather large partition (chkdsk on NT == fsck on Linux) takes less than 30 seconds, many times even less. Contrast that to your average ext2 or FAT32 system, which can take many minutes to check.

    -- russ

    -
    The IHA Forums

    --
    Natural != (nontoxic || beneficial)
  40. This is already been done by Erisian · · Score: 1
    Take a look at the Network Appliances devices. They write first to NVRAM and then say that the write was committed. Then they can write the data to actual disk at leisure. The NetFilers don't require any changes to the kernel or special drivers, so I don't know why the author thinks his idea would. Just implement the NVRAM onto the controller.

    Chris

    --
    What's the difference between an orange?
  41. Largely unknown? by bugg · · Score: 2
    Dru writes: "This is an article about a hardware technology that is largely unknown in the new Unix community.

    Perhaps in the UNREAD (which I guess is fairly large, hurmf) portion of the community. Chapter 8, section 2 of The Design and Implementation of the 4.4BSD Operating System talks about this idea on page 284. It referrs to research done by Moran et al, 1990. The references at the back of the chapter refer to "Breaking Through the NFS Performance Barrier," Proveedings of the Spring 1990 European UNIX Users Group Conference, pages 199-206.

    So there you go, there's TWO ways that we could have heard about this. I doubt anyone here got that first hand, but the 4.4BSD book is a fairly common book to have for those who are interested in the innards of an OS.

    --
    -bugg
  42. depends on the audience maybe? by Servo · · Score: 1

    I don't know, but I think the author may have been writing to a different audience. It just didn't "sound" right when I read it.... maybe posting it to Slashdot was a bad idea for the poster. The idea of TRAM sounds rather ridiculous honestly, except in something like a mainframe or midrange system where the OS and the hardware are designed by the same people. But even so, it still wouldn't be useful for what the author intends.

    Like many other posters have noted, it seems like the author really doesn't have any real world experience with this environment, but just though "Hey, this would be useful if it worked..."

    --
    A slip of the foot you may soon recover, but a slip of the tongue you may never get over. -Benjamin Franklin
    1. Re:depends on the audience maybe? by Servo · · Score: 1

      I never claimed that I was an expert. But, I DO know more about these systems than the average Anonymous Coward.

      Yes, I just described you as well.

      --
      A slip of the foot you may soon recover, but a slip of the tongue you may never get over. -Benjamin Franklin
  43. Why are solid state drives so expensive? by Raleel · · Score: 2

    Seriously...do they use some special proprietary ram? 8G of cdram in 512 meg chunks is only a couple of K. Hardly justifies another $24k tacked onto it. Is this another example of "charge what the market will bear"? I understand there are development costs and the like, but _geez_ $26k is _a lot_ of money. Don't give me an answer like "they are not intended for home use, so they charge more", because that's a bullshit reason (even though it's done all the damn time).

    --
    -- Who is the bigger fool? The fool or the fool who follows him? --
    1. Re:Why are solid state drives so expensive? by verch · · Score: 1

      This is an excellent question.. I was recently researching SS disks at work and found them to be much too expensive to be practical. The units from the big manufacturers (SolidData, Imperial Tech, Bitmicro) all come in at $15k-$20k for a 1G drive. Absurd.. It seems to me that building one of these is fairly simple.. Maybe a junior year EE project. In fact, the mechanics of it should be much simpler than a conventional disk drive unless I am missing something...

  44. gimme one for my linux box! by xshader · · Score: 1

    i dont giva damn about DBs. gimme one of those cards for my linux box and ill just use it for everything needed to boot and the rest as a swap partition. this would make my linux box boot in like 5 seconds... awesome

  45. Re:Thank you BSD. by britt · · Score: 1

    Not True.

    Data ONTAP is written in house. The only parts from BSD is some of the network stack, and that has been changed heavily as well.

    --
    --Britt
  46. Exactly what NetApp does by ansible · · Score: 3

    This is what the Network Appliance boxen do to speed NFS writes.

    All NFS write transactions are commited to NVRAM first, so that they can be acknowledged. Then the writes to disk are sorted and blasted out. Very efficient, very fast.

    It is this NVRAM (as well as using a modified RAID-4 on top of the WAFL filesystem) that makes a NetApp much faster (yet still safer) than most other NFS servers. I've often thought about creating just such an NVRAM board for a PC, so that I could do the same thing with my Linux fileservers.

    Note that the NetApp implementation caches NFS requests, not filesystem-level data. Say I'm changing 1 byte in a block. If I buffer filesystem data, I have to cache the whole block. If I'm buffering the NFS request, it'll be much smaller.

    Buffering (in NVRAM) the log data might work well for something like ext3.

    1. Re:Exactly what NetApp does by Dahan · · Score: 1
      This is what the Network Appliance boxen do to speed NFS writes.

      Yeah, the article said that :)

      Today, Network Appliance, Sun, and a few others hide TRAM technology in their core storage products. Network appliance uses TRAM in a different way, though. They hold the NFS rpc requests in memory, not the disk blocks. Their WAFL filesystem is journaled and can come up quick on its own, so it doesn't require the TRAM. They use it to respond to requests quickly, yet still give the client the semantic that the operation is 'safe' or committed to disk. To deal with the journaled filesystem, they use RAID striping.
  47. Re:Thank you BSD. by keepper · · Score: 1

    do you happen to know to what extend BSD code is used... and what flavor ( F/O/N BSD or BSDi ) ?

  48. (OT) Re:About your punctuation: (and a comment) by CoolVibe · · Score: 1
    yeah, and this:

    Bob the angry flower: plural's

    (yeah I know plural's is wrong, but that's the name of the strip, okay?)
    --
    Slashdot didn't accept your submission? hackerheaven.org will!

  49. It's just a Band-Aid by ecloud · · Score: 2

    There are better ways to fix this problem, such as put a battery backup on system RAM, so that the OS won't need to be reloaded at all; it can pick up where it left off when power to the CPU comes back on.

    And if computer power supplies were also designed for battery-backup (dual-voltage, can run on either 120VAC or say 24VDC) then the complexity of the UPS (converting DC to AC just so the computer can convert it back to DC again) would be eliminated, and the result would be more reliable. There should be a standard connector so an external lead-acid battery can be plugged into the back of every computer, and the power supply would be responsible for keeping it charged, and switching to using it when the power fails. (But there should be a switch to turn off the charging feature in case you have several computers hooked to one large battery with a separate external charger.) Then maybe when the power fails the OS would be notified, and it could finish doing any uncommitted writes before powering off the disks and CPU; the battery would continue to backup the memory for a very long time since it would be such a small load compared to the whole system. That way the two battery-backup systems could be combined. (Or not... maybe a separate memory backup would be more reliable.)

  50. This struck me as interesting.... by iCEBaLM · · Score: 2

    Note, no free unix today has, at least to the point of people trusting their main database on it, a production-ready journaled filesystem.

    Linux+ReiserFS.

    I would trust ReiserFS to keep my main DB safe, I've been using ReiserFS with Linux for some time now with no data loss. (and many power failures and some crashes due to a partitcular closed source XF86 video card driver)

    -- iCEBaLM

    1. Re:This struck me as interesting.... by iCEBaLM · · Score: 1

      Substantiate your claim or shut the fuck up.

      -- iCEBaLM

  51. Re:Semi Radical by Anonymous Coward · · Score: 1
    Linux Zealot Charged in Murder of Seven Co Workers
    "Linus made me do it," says Mcdermott
    MSNBC, Wakefield, MA - Linux user MICHAEL McDERMOTT, 42, was held without bail after being charged with seven counts of murder in the shootings Tuesday morning at Edgewater Technology. He entered innocent pleas on all seven counts. McDermott stood silently during the brief court appearance. He displayed no emotion as the prosecutor outlined graphic details of the rampage at Edgewater Technology.
    The innocent Microsoft users were earlier identified as: Jennifer Bragg-Capobianco; Janice Hagerty; Louis Javelle; Rose Manfredy; Paul Marceau; Cheryl Troy; and Craig Wood. All worked on the first floor of the company's offices, located in a converted factory building, where the shootings took place. Two were believed to be receptionists and the other five worked in the company's accounting department, authorities said.

    POSSIBLE MOTIVE
    Co-workers said McDermott was a user of Linux and other Open Source Software, about 6-foot-2 with a long, flowing beard. He was friendly but "a little frightening," said Mike Stanley, a project leader.
    McDermott recently had been coming in late and his performance wasn't as good as it could have been, Stanley said.
    Prosecutors were investigating whether McDermott was upset about an Internal Revenue Service request to withhold some of his wages to cover software licensing fees, Middlesex District Attorney Martha Coakley said.
    She said Edgewater had agreed not to begin taking out money from McDermott's paycheck to pay for the software until after the holidays. However, McDermott had an angry outburst in the company's accounting department last week over the prospect of losing some of his wages, according to an employee who spoke only on condition of anonymity.
    "He was like a wild animal, screaming something about a cathedral and a bazaar," said the anonymous employee. "We told him that it was illegal to not pay for software, but he would have none of it."
    Coakley said the shootings were apparently not random, since the suspect bypassed several people during the rampage. None of the victims was among McDermott's supervisors. They were all, however, innocent Microsoft users.

    SUSPECT HEAVILY ARMED
    McDermott had an AK-47 rifle, a shotgun, a semi-automatic handgun, and a stuffed penguin when police burst into the building and found him sitting silently in the reception area with his weapons. He made no attempt to shoot police and was described by authorities as "unresponsive."
    Police found McDermott sitting silently in the reception area, a body nearby, his weapons within reach. He was arrested without gunfire.
    "They made a split-second decision to hold their fire to try to effect an arrest," said Stephen Doherty, the police chief in this city 10 miles north of Boston.
    Authorities said McDermott, an employee with Edgewater Technology since March, came to work as usual Tuesday morning. Around 11 a.m., however, he walked into the building's reception area, shouted "I love you Stallman!" and opened fire on two co-workers. He then proceeded to another wing of the building and shot five more employees at their work stations, police said. Shell casings and bullets were found all over the office.
    ."There was an enormous amount of firepower, considering the guy was a dork," said Coakley. The entire shooting rampage lasted from 5-10 minutes, she said.
    She said McDermott did not have a permit for any of the weapons he was carrying, but had no prior criminal record, other than being a Linux user.
    "I thought I was going to die," said a 29-year-old innocent Microsoft user, who declined to give his name. He was one of up to 70 people who were at work in the building at the time, officials said.
    Frank Harrington, a contract consultant for the company, said he was in the building at the time of the shooting and heard eight to 10 shots.
    Mike Brownson, who works across the street from the office complex, watched as police arrived and surrounded the building.
    "When police are ducking behind their own vehicles, it's not something you see every day," said Brownson.
    .Darren Emery, the manager of a store across the street, told MSNBC Cable that 14 innocent Microsoft users took cover in his store.
    One witness said McDermott had "gone Columbine."
    McDermott lived alone in Haverill, Mass., also a suburb of Boston, said police.
    Kevin Forzese, who lived upstairs from McDermott in Haverhill, said the suspect had never mentioned money problems. He also said McDermott had mentioned that he collected japanese hentai, Furry fetish artwork and costumes, erotic anime Lego sculptures, and Star Trek memorabilia, but he had never seen any weapons in McDermott's apartment.
    "He never talked about the company," Forzese said. "I talked to him about money and he said he was doing really well, but he refused to pay for his software like an honest consumer, which sorta bothered me."
    Jonathan Oldham, a 35-year-old carpenter, said McDermott moved out of a six-unit apartment complex in Weymouth, south of Boston, at the end of October. Oldham was surprised to learn the man he said was a dork was accused of the rampage.
    "I freaked," he said. "You never know if someone has problems with their life, especially if they are a criminal or a Linux user. It could have happened here."

    EMPLOYEES 'SHOCKED AND DEVASTATED'
    Bill Gates, Microsoft's chief executive, released a statement of sympathy. "Everyone at Microsoft is shocked and devastated by the loss of our valued users," the statement said in part and added, "We extend our deepest sympathies to the victims' families at this tragic time, and a coupon for a $49 upgrade to Windows ME."
    ."The company was scheduled to be closed through the end of the week.
    Survivors of the attack were in shock, said the Rev. Tom Powers, who helped with grief counseling at St. Joseph's Church, where about 100 Microsoft users gathered after the shooting. They left sporadically, their faces stained by tears and holding each other for support.
    "There's nothing you can do to take the grief away, except lock these Linux users up before they hurt more people!" Powers said.
    Nancy Pecjo, a software developer with the company who is on maternity leave, was not at work at the time of the shootings but went to the building after hearing the news.
    She said 30 to 40 employees worked at the Wakefield office. She did not know who had been shot.
    "It's a great company, a wonderful company," she said, adding that she didn't know of anyone who'd been fired recently or was disgruntled.
    "It's a small company, you get to know everybody there," she said. "When something like this happens it's very distressing."

    The shooting was the latest in a string of criminal acts perpetrated in recent years in U.S. workplaces by Linux users.

  52. Why not battery back the System? by Anonymous Coward · · Score: 1

    The system RAM could be kept around just as easy on a battery. Mark the transaction section of RAM and on boot you could read it back. No more PCI card and MUCH faster access.

  53. What about MRAM? by ocie · · Score: 1

    IBM is supposedly releasing its magnetic ram technology in a few years. This is like core memory (needs no refresh, non-volitile). If the OS knew that the memory was non-volatile, it could do everything needed to be used as TRAM.

    --
    JET Program: see Japan, meet intere
  54. Apple ][gs RamFAST card by Anonymous Coward · · Score: 1

    Great idea - I've always wondered by we are no longer using these. My old Apple IIGS had a battery backed up Memory card which I always kept a copy of the OS resident in. The system was very fast to boot. Memory's cheap yet these devices have all but disappeared. You know your're getting older when old ideas become new again...

    1. Re:Apple ][gs RamFAST card by Dahan · · Score: 1

      It wasn't a RamFAST though... that's a SCSI card. You're probably thinking of the RamKeeper (IIgs-only, plugged into the memory expansion slot, and your memory card piggybacked onto it), or the RamFactor with RamCharger (worked on the II+ and up, battery was a big external thing).

  55. Solid state drives by Anonymous Coward · · Score: 1
    They're only useful for large enterprise-scale applications and embedded these days. That's a bit silly. One could easily make a ~128-512mb SSD in the consumer price range that runs off of IDE. I'd buy one, if it was under $500. It would make for a:
    • Stick small install on it, and you get a nice, reliable, silent thin client (no noise)
    • Stick it in a normal machine, and you have a much more reliable place to stick data.
    • In a normal machine, stick /usr and /etc on it (in Linux) or /windows (in windows) on it, and you get almost instant boot.
    I've been hunting for one of these for ages, as have several other people. Right now, high-end ones cost shitloads of money, are huge (gigs), use fans and tremendous power (usually). Embedded ones require special motherboards and special (usually binary-only) drivers. (Note: I mostly looked at Quantum's models, Disk-on-Chip, and several similar companies)
  56. I allreay have that by magister · · Score: 1

    To my understanding thats what my RAID controler does. Unfortanly fsck still wants to run on my RAID after a power cycle, but I just need to install a jurneling filesystem on it to acomplish the speed of the powerup.

    Even with some of the problems I have run into with power around here befor I got my UPS I never lost a single bit. The Mylex 1164 RAID controler comes with a minimum 32Megs of read/write cache, that has a battery backup.

    Now granted, it wasnt cheep, but with my small income I still managed to afford it. Also the Linux drivers worked perfectly. With RiserFS (or the likes) this would be exactly the solution he is talking about.

    If your a geek like me with a big collection of mp3's and mpg's you can not lose, and have many users, its the perfect solution. It was for me :)

    --
    -magister-
  57. nothing new here by soldack · · Score: 2

    This sounds like a lot of standard techniques already in use in storage today. Many storage controlers (FibreChannel/SCSI, RAID/nonRAID) support battery backup power that will let them finish writes. Most use internal write and read cache and include lots of memory. Transaction support already exists in some ways. If I send a command to write 1024 bytes to a disk that does 512 byte writes, a good controller will attempt to make the 1024 byte operation atomic even though it is internally broken up into multiple 512 byte writes. File systems fragment the pieces of a file and thus have to issue multiple non-sequential commands for a given operation. This is where the problem errupts. Some controllers support combining multiple operations into one call but this is usually done without FS knowledge; it just fills a buffer of ops and then dumps it. RAID already handles the issue of non-sequential operations by hiding them. RAID may present a 1 terrabyte drive as sequential data when it is really stripped across various areas of various drives. When RAID is told to write 1 sequential GB out it is done as one operation, even though it involves many non-sequential writes to multiple locations on multiple disks. The trick is to put file system support in the storage controllers or RAID systems. With file system support, it will attempt to make sure that physically non-sequential writes that are sequential at the file level are completely written out. This doesn't happen much because for various reasons but does exist. For example, some RAID systems support running the file system on their system directly.

    What does this do that is so different? At $300, it may be less expensive than similar solutions but I do not see "how it differs from everything out there".

    --
    -- soldack
  58. Re:Sounds Like a Symmetrix by halbritt · · Score: 1

    The Network Appliance also has a large DRAM Cache from 512MB 1 or 2GB on their latest released products.

    The benefit of having an NVRAM write cache is that there is no requirement for a backup power source in the event of a loss of power. One can also cluster NetApp filers as well as have synced read-only volumes on other filers.

    I wouldn't like to get into a heavy debate of EMC vs. Network Appliance other than to say that one can run a database application doing I/O over a network. One has to have a really good network though. Your Symmetrix is going to be FC connected to your hosts. In my application the Network Appliances are connected to the hosts via Gigabit Ethernet. The max speed of both media is 1Gbps. Granted there's going to be protocol overhead with the NetApp that would cause one to get less performance all other things being equal, except maybe cost. The EMC is going to cost 3x more.

  59. fsck ... Not a Problem Here by tmjva · · Score: 1

    I sometimes snicker at the odious convolutions
    Unix administrators have to go through to
    keep their systems up and running. I'm in
    an HPe3000 shop and its automatically
    self-journaling file system has never caused
    boot-up problems on me in 17 years.

    --
    Tracy Johnson
    Old fashioned text games hosted below:
    http://empire.openmpe.com/
    BT