One Developer's Experience With Real Life Bitrot Under HFS+

I've also had this happen with HFS+ by carlhaagen · 2014-06-14 01:30 · Score: 4, Informative

An old partition of some 20000 files, most of them 10 years or older, in where I found 7 or 8 files - coincidentally jpg images as well - that were corrupted. It struck me as nothing other than filesystem corruption as the drive was and still is working just fine.

Re:I've also had this happen with HFS+ by istartedi · 2014-06-14 01:57 · Score: 4, Insightful

coincidentally jpg images as well
Well, JPGs are usually lossy and thus compressed. Flipping one bit in a compressed image file is likely to have severe consequences. OTOH, you could coXrupt a fewYentire byteZ in an uncompressed text file and it would still be readable. I suspect your drives also had a few "typos" that you didn't notice because of that.

--
For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
Re:I've also had this happen with HFS+ by Jane+Q.+Public · 2014-06-14 11:14 · Score: 3, Insightful

I agree with istaredi.

Ultimately, it isn't a "failure" of HFS+ when your files get corrupted. It was (definitely) a hardware failure. It's just that HFS+ didn't catch the error when it happened.

Granted, HFS+ is due for an update. That's something I've said myself many times. But blaming it when something goes wrong is like blaming your Honda Civic for smashing your head in when you roll it. It wasn't designed with a roll cage. You knew that but you bought it anyway, and decided to hotdog.

Checksums also have performance and storage costs. So there are several different ways to look at it. One thing I strongly suggest is keeping records of your drive's S.M.A.R.T. status, and comparing them from time to time. And encourage Apple to update their FS, rather than blaming it for something it didn't cause, or for not doing something it wasn't designed to do.

Backup? by graphius · 2014-06-14 01:36 · Score: 3, Insightful

shouldn't you have backups?

Re:Backup? by kthreadd · 2014-06-14 01:42 · Score: 4, Insightful

The problem with bit rot is that backups doesn't help. The corrupted file go into the backup and eventually replace the good copy depending on retention policy. You need a file system which uses checksums on all data block so that it can detect a corrupted block after reading it, flag the file as corrupted so that you can restore it from a good backup.
Re:Backup? by ZosX · 2014-06-14 01:49 · Score: 2

This is a good idea, but not a solution. Often you have no idea that the file is bad until after the fact, in this case years later. I've had mp3 collections get glitches here and there after a few copies from various drives. If you have no idea the data is bad in the first place, your backup of the data isn't going to be any better. I would say that all of my photography I've collected over the years has stayed readable somehow. I do check in lightroom every once in a while, but I wouldn't be shocked to find a random unreadable file. Not good really, but there's probably not much I can do other than make sure that my files are verifiable.

--
zosxavius photography
Re:Backup? by dgatwood · 2014-06-14 02:09 · Score: 5, Insightful

Depends on the backup methodology. If your backup works the way Apple's backups do, e.g. only modified files get pushed into a giant tree of hard links, then there's a good chance the corrupted data won't ever make it into a backup, because the modification wasn't explicit. Of course, the downside is that if the file never gets modified, you only have one copy of it, so if the backup gets corrupted, you have no backup.
So yes, in an ideal world, the right answer is proper block checksumming. It's a shame that neither of the two main consumer operating systems currently supports automatic checksumming in the default filesystem.

--
Check out my sci-fi/humor trilogy at PatriotsBooks.
Re:Backup? by rnturn · 2014-06-14 02:27 · Score: 2

Even if you did have backups how could you even begin to know which saveset to restore from? You could have been backing up a corrupted file for a lo-o-ong time.
Friends wonder why I still purchase physical books and CDs. This is why. I'll have to come up with a simple 2-3 sentence explanation of the problem the OP was describing for when they ask next time. I've had MP3 files made from my CD collection mysteriously become corrupted over time. No problem, I can just re-rip/convert/etc. but losing the original digital version of your newborn would be heartbreaking. Make several copies to reduce the odds of losing it. Make a good print using archival paper and inks and keep in away from light in a safe deposit box so it could be rescanned should the digital file become corrupted. Of course, one can go overboard as not every photo is worth that kind of effort but it appears we might be starting to see, first-hand, the problems described in Bergeron's "Dark Ages II". Even worse what if this were to happen? (So don't even bring up the "cloud", OK?)

--
CUR ALLOC 20195.....5804M
Re:Backup? by nine-times · 2014-06-14 03:03 · Score: 2

If I remember correctly, that's not how Apple's current backup system works. Every time a file gets written to, there's a log someplace that records that the file was modified. Next time Time Machine runs, it backs up the files in that log. If the OS didn't actually modify the file, it won't get backed up.
I may be wrong, but that's how I understood it.
Re:Backup? by Anonymous Coward · 2014-06-14 04:21 · Score: 2, Informative

Macosx Time Machine works by listening to filesystem events except for the first backup where everything is copied over as is. Bit rot doesn't get transferred until you overwrite the file, time by which it should have been obvious something was fishy or the bitrot was negligible and you didn't notice yourself. There are also situations where Time Machine itself says "this backup is fishy, regenerate from scratch?". Happened last week, but only after a failed drive had to be replaced which caused a 150GB backup.
Re:Backup? by gweihir · 2014-06-14 04:42 · Score: 2

You should have:
1. backups
2. redundancy
3. regular integrity checks of your data
Or alternatively, you should have been using an archival grade medium, like archival tape or (historically now unfortunately) MOD.
What the OP did is just plain incompetent and stupid and if he had spent 15 minutes to find out how to properly archive data, he would now not be in this fix. Instead he made assumptions without understanding or verification against the real world now blames others for his failure. Pathetic. Dunning-Kruger effect at work.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Re:Legacy file systems should be illegal by Anonymous Coward · 2014-06-14 01:39 · Score: 2, Informative

The problem is, neither ZFS or Btrfs would have stopped an arbitrary bit inside an arbitrary file from becoming corrupt if the disk failed to write it or read it correctly. Only multiple disks and redundancy would have solved that.

Bitrot not the fault of filesystem by Gaygirlie · 2014-06-14 01:40 · Score: 5, Insightful

Bitrot isn't the fault of the filesystem unless something is badly buggy. It's the fault of the underlying storage-device itself. Attacking HFS+ for something like that is just silly. Now, with that said there are filesystems out there that can guard against bitrot, most notably Btrfs and ZFS. Both Btrfs and ZFS can be used just like a regular filesystem where no parity-information or duplicate copies are saved and in such a case there is no safety against bitrot, but once you enable parity they can silently heal any affected files without issues. The downside? Saving parity consumes a lot more HDD-space, and that's why it's not done by default by most filesystems.

Re: Bitrot not the fault of filesystem by jmitchel!jmitchel.co · 2014-06-14 01:52 · Score: 4, Insightful

Even with just checksums, knowing that there is corruption means knowing to restore from backups. And in the consumer space most people have plenty of space to keep parity if it comes to that.

Re:Legacy file systems should be illegal by kthreadd · 2014-06-14 01:43 · Score: 5, Insightful

At least you would know that the file was corrupted, so that you could restore it from a good backup.

Re:Legacy file systems should be illegal by Chandon+Seldon · 2014-06-14 01:46 · Score: 3, Interesting

Btrfs (at least) can store multiple copies on one disk and use a checksum to identify the good copy to read. Obviously more disks is better, but...

--
-- The act of censorship is always worse than whatever is being censored. Always.

Re:Legacy file systems should be illegal by jbolden · 2014-06-14 01:47 · Score: 5, Insightful

Yes absolutely great idea! Rather than having technical decisions being made at tech conferences and among developers, system administrators and analysts we should move that authority over to legislature. Because we all know we are going to see a far better weighing of the costs and benefits of various technology choices by the legislature than by technology marketplace.

Apple used HFS+ because it worked to successfully migrate people from Mac OS9, it supported a unix / MacOS hybrid. They continue to use it because it has been good enough and many of the more robust filesystems were pretty heavyweight. I'd like something like BTFS too. But I don't think the people who disagree with me should be jailed.

ZFS, Apple! by grub · 2014-06-14 01:48 · Score: 2

This is why Apple should resurrect its ZFS project. Overnight they would be the largest ZFS vendor to match with being the largest UNIX vendor.

--
Trolling is a art,

Re:ZFS, Apple! by grahamsaa · 2014-06-14 02:12 · Score: 2

I'm not sure this is true. Other vendors like iXsystems already sell products that ship with ZFS. As I understand it, ZFS is BSD licensed. While Oracle distributes its own version of ZFS that may (or may not) include proprietary features, the open sourced version is freely distributable. The only reason it's packaged as a userland utility for Linux is that the BSD license isn't compatible with the kernel's GPL license. Apple's kernel is definitely not GPL, so this isn't a problem for them.

One problem might be that using ZFS without ECC memory can result in data loss, and ECC memory is more expensive (and not compatible with most consumer oriented processors that Intel makes). This would increase the cost of Apple hardware and could (possibly) be a hurdle, as Intel doesn't want to support ECC memory on their consumer oriented processors (as this could hurt sales of more expensive server-oriented processors. But Apple is a large enough vendor that they could probably negotiate something with Intel that could be workable.

That said, I don't know many Apple users that know what ZFS is, and it doesn't seem like there are many people clamoring for it. It would be a great addition to OSX though.

--
Facts have a liberal bias.
Re:ZFS, Apple! by Calibax · 2014-06-14 02:23 · Score: 2

No they would not be sued by anyone.
Sun open sourced ZFS under a permissive license. Oracle close sourced it again. However, a number of companies are supporting derivatives of the open source version.
ZFS is available for a number of operating systems today. A non-inclusive list:
FreeBSD from iXsystems
Linux from Lawrence Livermore National Laboratory and also Pogo Linux
SmartOS from Joyent
OmniOS from Omniti
Osv from CloudOS
In addition a number of companies are using ZFS in their products:
CloudScaling
DDRdrive
datto
Delphix
GE Healthcare
Great Lakes SAN
Losytec
High-Availability
HybridCluster
Nexenta Systems
OSNEXUS
RackTop
Spectra Logic
Storiant
Syneto
WHEEL Systems
Zetavault
ZFS can detect and correct silent corruption when configured to do so. I have a NAS that has 24 TB of raw storage, 16 TB of useable storage, running under OmniOS. I have well over 10 million files on the NAS (it is used as a backup for 8 systems) - I haven't lost a file in 4 years and I don't expect to lose any.
Re:ZFS, Apple! by Kaenneth · 2014-06-14 07:49 · Score: 2

Back when I did tech support for a lightweight Mac database product, they didn't use Parity (much less ECC) RAM.
I had a customer call in because students were continually getting corrupted databases on their assignments.
over the course of several phone calls, we narrowed it down to only happening in 1 of 3 labs.
After excluding anything high-energy (like a physics lab) in the building, I got the customer to reveal that they were constructing a new building next door, and the construction power tools were running off the same circuits as the computer lab...
They got the construction workers to use a different source of power, and the corruption problems disappeared.
Re:ZFS, Apple! by jbolden · 2014-06-14 08:11 · Score: 2

Apple did announce why the project failed. ZFS on consumer grade hardware with consumer interactions was too dangerous. Things like pulling an external drive out during mid write could corrupt an entire ZFS volume. Apple simply couldn't get ZFS to work under the conditions their systems need it to. They had to backout completely and come up with a plan-B. The developer who worked on this left Apple and now produces a better ZFS for OSX. That company got bought by Oracle so Oracle owns it now.

Re:Legacy file systems should be illegal by mgmartin · 2014-06-14 01:54 · Score: 4, Informative

As does zfs: man zfs
copies=1 | 2 | 3 Controls the number of copies of data stored for this dataset. These copies are in addition to any redundancy provided by the pool, for example, mirroring or RAID-Z. The copies are stored on different disks, if possible. The space used by multiple copies is charged to the associated file and dataset, changing the used property and counting against quotas and reservations. Changing this property only affects newly-written data. Therefore, set this property at file system creation time by using the -o copies=N option.

Re:Legacy file systems should be illegal by peragrin · 2014-06-14 02:04 · Score: 4, Interesting

This is something everyone forgets.
It takes decades to build long term reliable file systems.

ZFS, BTFS, are less than a decade old.
Windows runs on NTFS Version something. NTFS was started in what year?
HFS, and then HFS + was built in what year?
How long has Microsoft been promising WinFS?

File systems change but only slowly. This is good. you need a good long track record to convince people they won't lose files every ten years due to random malfunctions.

--
i thought once I was found, but it was only a dream.

So far what I lost... by cpct0 · 2014-06-14 02:08 · Score: 4, Interesting

Bitrot is not usually the issue for most files. Sometimes, but it's rare. What I lost is a mayhem repository of hardware and software and human failure. Thanks for backup, life :)

On Bitrot:

- MP3s and M4As I had that suddenly started to stutter and jump around. You play the music and it starts to skip. Luckily I have backups (read on for why I have multiple backups of everything :) ) so when I find them, I just revert to the backup.
- Images having bad sectors like everyone else. Once or twice here or there.

- A few CDs due to CD degradation. That includes one that I really wish I'd still have, as it was a backup of something I lost. However, the CD takes hours to read, and then eventually either balks up or not for the directory. I won't tell you about actually trying to copy the files, especially with normal timeouts in modern OSes or the hardware pieces or whatnot.

Not Bitrot:

- Two RAID Mirror hard drives, as they were both the same company, and purchased at the same time (same batch), in the same condition, they both balked at approximately the same time, not leaving me time to transfer data back.

- An internal hard drive, as I was making backups to CDs (at that time). For some kind of reason I still cannot explain, the software thought my hard drive was both the source and the destination !!!! Computer froze completely after a minute or two, then I tried rebooting to no avail, and my partition block was now containing a 700mb CD image, quarter full with my stuff. I still don't know how that's possible, but hey, it did. Since I was actualy making my first CD at the time and it was my first backup in a year, I lost countless good files, many I gave up upon (especially my 90's favorite music video sources ripped from the original betacam tapes in 4:2:2 by myself).

- A full bulk of HDs on Mac when I tried putting the journal to another internal SSD drive. I have dozens of HDDs, and I thought it'd go faster to use that nifty "journal on another drive" option. It did work well, although it was hell to initialize, as I had to create a partition for each HDD, then convert them to journaled partitions. Worked awesomely, very quick, very efficient. One day after weeks of usage, I had to hard close the computer and its HDD. When they remounted, they all remounted in the wrong order, somehow using the bad partition order. So imagine you have perfectly healthy HDDs but thinking they have to use another HDDs journal. Mayhem! Most drives thought they were other ones, so my music HDD became my photos HDD RAID, my system HDD thought it was the backup HDD, but just what was in the journal. It took me weeks sporting DiskWarrrior and Data Rescue in order to get 99% of my files back (I'm looking at you, DiskWarrior as a 32 bit app not supporting my 9TB photo drive) with a combinaison of the original drive files and the backup drive files. Took months to rebuild the Aperture database from that.

- All my pictures from when I met my wife to our first travels. I had them in a computer, I made a copy for sure. But I cannot find any of that anywhere. Nowhere to be found, no matter where I look. Since that time, many computers happened, so I don't know where it could've been sent. But I'm really sad to have lost these

- Did a paid photoshoot for an unique event. Took 4 32GB cards worth of priceless pictures. Once done with a card, I was sifting through the pictures with my camera and noticed it had issues reading the card. I removed it immediately. When at home, I put the card in my computer, it had all the troubles in the world reading it (but was able to do so), I was (barely) able to import its contents to Aperture (4-5 pictures didn't make the cut, a few dozens had glitches). It would then (dramatically, as it somehow have its last breath after relinquishing its precious data) not read or mount anywhere, not even being recognized as a card by the readers. Childs, use new cards regularly for your gigs :)

- A RAID array b

And the story is? by Immerman · 2014-06-14 02:08 · Score: 3, Insightful

Bitrot. It's a thing. It's been a thing since at least the very first tape drive - hell it was a thing with punch cards (when it might well have involved actual rot). While the mechanism changes, every single consumer-level data-storage system in the history of computing has suffered from it. It's a physical phenomena independent from file system, and impossible to defend against in software unless it transparently invokes the one and only defense: redundant data storage. Preferably in the form of multiple redundant backups.

So what is the point of this article?

--
--- Most topics have many sides worth arguing, allow me to take one opposite you.

Re:Detachment by NormalVisual · 2014-06-14 02:11 · Score: 2

Yeah, tell that to the IRS when you go to pull your records during an audit... ;-)

--
Please stand clear of the doors, por favor mantenganse alejado de las puertas

article is suspect, summary is worse by sribe · 2014-06-14 02:15 · Score: 4, Informative

In a footnote he admits that the corruption was caused by hardware issues, not HFS+ bugs, and of course the summary ignores that completely.

So, for that, let me counter his anecdote with my own anecdote: I have an HFS+ volume with a collection of over 3,000,000 files on it. This collection started in 2004, approximately 50 people access thousands of files on it per day, and occasionally after upgrades or problems it gets a full byte-to-byte comparison to one of three warm standbys. No corruption found, ever.

Clueless article by alexhs · 2014-06-14 02:27 · Score: 4, Informative

People talking about "bit rot" usually have no clue, and this guy is no exception.

It's extremely unlikely that a file would become silently corrupted on disk. Block devices include per-block checksums, and you either have a read error (maybe he has) or the data read is the same as the data previously written. As far as I know, ZFS doesn't help to recover data from read errors. You would need RAID and / or backups.

Main memory is the weakest link. That's why my next computer will have ECC memory. So, when you copy the file (or otherwise defragment or modify the file, etc), you read a good copy, some bit flips in RAM, and you write back corrupted data. Your disk receives the corrupted data, happily computes a checksum, therefore ensuring you can read back your corrupted data faithfully. That's where ZFS helps. Using checksumming scripts is a good idea, and I do it myself. But I don't have auto-defrag on Linux, so I'm safer : when I detect a corrupted copy, I still have the original.

ext2 was introduced in 1993, and so was NTFS. ext4 is just ext2 updated (ext was a different beast). If anything, HFS+ is more modern, not that it makes a difference. All of them are updated. By the way, I noticed recently that Mac OS X resource forks sometimes contain a CRC32. I noticed it in a file coming from Mavericks.

--
I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.

Re:Clueless article by rabtech · 2014-06-14 05:56 · Score: 2

People talking about "bit rot" usually have no clue, and this guy is no exception.
It's extremely unlikely that a file would become silently corrupted on disk. Block devices include per-block checksums, and you either have a read error (maybe he has) or the data read is the same as the data previously written. As far as I know, ZFS doesn't help to recover data from read errors. You would need RAID and / or backups.
I'm afraid it is you who is clueless. Up until ZFS started gaining traction, we all had the luxury of assuming the storage chain was reliable (RAM, SATA controller, cables, drive firmware, read/write heads, oxide layers, etc). Or at least we would know something went wrong.
But it was found that in the actual real world, these systems all silently corrupt data from time to time. The problem is much worse as the volume of data grows because the error rates are basically unchanged, meaning what was once expected to be a random bit flip that would strike one user out of a million once per year is now something that strikes every single user multiple times per year.
I'm not talking theory or what *should* happen. I'm talking about actual real world experience with check summing filesystems that demonstrate, beyond any doubt, that bit rot happens and happens far more frequently than most people believe. Actual experience with ZFS proves that disks can and **will** read back out different bits than what was written silently with no block read errors.
Further, you're increadibly ignorant of now ZFS or BTRFS deal with redundancy. You can setup to mirror blocks, in some cases on a per-file or directory basis, providing protection against corrupting. A background scrubber scans the disk when idle cycles are available and detects and repair corrupting from the available good blocks, or log an error if there are no good mirrors or parity blocks available.
With our new knowledge and experience it is no longer sufficient to cross our fingers and hope for the best. We cannot trust filesystems or the underlying hardware, we must verify.

--
Natural != (nontoxic || beneficial)

Re:reading checksum + n blocks is SLOW by jbolden · 2014-06-14 02:57 · Score: 2

You don't have checksum blocks in the space efficient method. Rather in the computational way I'm talking about it is a transformation. You might have something like every 6354 bits becomes 6311 bits after the complex transformation. It doesn't slow down the read but you have to do math.

Re: Legacy file systems should be illegal by the_B0fh · 2014-06-14 02:58 · Score: 3, Informative

Not if your OS is tied intimately to your filesystem. Linux might not, because a large number of things are abstracted out, but FreeBSD depends on its file system, Solaris took a very long time/effort before it could boot off ZFS. Forget about moving Windows off NTFS. Apple actually did some work on putting it onto ZFS, maybe they will continue.

Re: Legacy file systems should be illegal by aix+tom · 2014-06-14 03:01 · Score: 5, Interesting

A database is something special

I basically make a "full backup" of my Oracle DBs once a week, and a "incremental backup" in the form of DB change logs every five minutes. (that is, the change logs are pushed "off site" every five minutes, of course they are being written locally continuously with every change.

The thing with backups, though, is not only to make them often but to also *check* them often. With my DBs there is a handy tool where I can check the backup files for "flipped bits" because there are also checksums in the DB files.

For my "private backups to DVD/BR" I only fill them up to ~70%, and fill the rest of the disk with checksum data with dvdisaster., for other "online backups" I create PAR2 files that I also store. With those parity files I can check "are all bits still OK?" now and then, and repair the damage when/if bits start to rot in the backup. In the 10 years I do this, with ~150 DVDs and ~20BRs so far I had 2 DVDs that became "glitchy", but because of the checksum data I was able to repair the ISO and re-burn them.

Basically, IF you go through the trouble of setting up an automated backup system either with software or with your own scripts, It doesn't add much work to also add verification/checksum data to the backup. And that goes a long way into preventing data loss due to bit rot.

Re:HFS reliability by sribe · 2014-06-14 03:05 · Score: 3, Insightful

Anyone who owned a Mac since the 80s remembers having to use Norton Disk Doctor and later DiskWarrior at least once per month to repair the filesystem. Entire folders could go randomly missing each time you booted up your Mac, and if you accidentally lost power to your hard drive, the use of one of those was mandatory.

Oh yes. I remember those days well. Journaled HFS+ fixed that, and for about the last decade the only times I have encountered a corrupted file system on a Mac, that discovery was followed shortly by total failure of the hard disk.

So, what was your fucking point?

It's all about ERROR rates by bussdriver · 2014-06-14 03:53 · Score: 2

RAM may have a low error rate much better than HDDs or SDs. That does not mean that you won't have errors even if you have a good brand and treat it well. Bit-level errors can and do happen all the time without us knowing; other times it happens in the wrong place and we notice (but think it is something else) it isn't until it gets really bad that we notice.

Example, say your RAM has a 1% bit loss rate (ignore that is insanely high) well if 90% of your data is not touchy code but data, the odds are that you may not notice 1 bit getting flipped that often. Then you have the fact that RAM could maintain that error rate over decades of smaller faster RAM but now you are storing MORE data and cycling it MORE than was possible on the older computers. So, if you had 1 bit error every gigabyte of throughput on a slow 1Mhz computer with 1MB of RAM it would take a long time for that 1% bit flip to happen (and if you noticed you'd still not likely blame the RAM) -- but today pumping though in seconds what that old machine would take a year; the error would occur quite often. SAME problem with storage but with an additional problem in that they still have the same lifespan requirements - RAM can be refreshed can checked.

Something else to be considered, the error correction schemes being used today are being pushed by the demand for higher density storage. Your HD isn't doing huffman or any of those old simple bit recovery schemes they've moved beyond that long ago to the next gen stuff from what your 56k modem was doing to fight phone line noise. They could make it better... but you would be giving up significant storage space. Perhaps somebody with a good marketing scheme and enough upset consumers could get you to pay MORE for less storage space... I know I would buy into it.

Essentially, we are at a point where HDDs expect you to scrub them for errors every year to avoid the bit rot... which is what I now do... haven't detected an error in years... however, the block level checksums the HDD uses has false positive error rate (just like CRC16 does) and the odds of a false positive may be poor--- again, we are working in the trillions now-- up near it's limitations (I'm assuming whatever they use now scaled... but it may not have which is why more people are talking about these issues. We know it's unlikely industry has adapted to the trends evenly over the decades... it's likely become a minior problem before they are forced to change devices to a newer proprietary checksum and error correction scheme. )

Do serious work? use ECC RAM. I'm still waiting for some low power AM1 motherboard that supports ECC so I can build a ZFS server... the AM1 chip supports ECC but no motherboards do.

--
Democracy Now! - uncensored, anti-establishment news

Re:Legacy file systems should be illegal by gweihir · 2014-06-14 04:20 · Score: 2

Bullshit. Anybody doing competent archiving will either use professional archive-grade tape or spinning disks in RAID that gets checked frequently and with a second copy on a geographically independent location. I do that and my loss statistics for the last 14 years is exactly zero. I do have to replace a disk about once every 2 years in the 3-way RAID1 I use as primary archive though. This RAID runs with full SMART self-test every 7 days and RAID consistency check (full data compare) every 14 days. Expecting your data to not rot if you do not maintain is is just plain incompetent.

There used to be one consumer-affordable medium that was archival-grade as well: MOD. I used them for about 10 years and never lost a single bit. Then it became to hard to get a replacement drive, and I moved to the spinning disk solution with additional off-site copies. It seems the consumer does not actually care about long-term archival or at the very least is far to stingy to pay anything for it. Otherwise MOD would have lived on. And do not even get me started about trash like "archival grade" writable DVD/CD. They are not.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Re:Legacy file systems should be illegal by gweihir · 2014-06-14 04:22 · Score: 2

Indeed. And only regular checking can detect it (ling SMART self-test and RAID consistency check every 7-14 days). The OP simply is naive and did not bother to find out how to properly archive data.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.

Re:how is this a file system problem? by kthreadd · 2014-06-14 04:47 · Score: 2

The file system can do quite a bit if it actually does consistency checks on the data when reading it. ZFS does this and will alert you if the contents of a file has changed after it was last written, allowing you to restore a good copy from backup and verify that it is still valid.

Re:Legacy file systems should be illegal by NJRoadfan · 2014-06-14 04:57 · Score: 2

Think of HFS+ as the equivalent of FAT32 for Macs. Its basically the old file system with support for larger drives and files. Apple latter tacked on journaling in OS X 10.3. I'm surprised Apple didn't push for a replacement file system after the switchover to Intel CPUs.

Re: Legacy file systems should be illegal by O('_')O_Bush · 2014-06-14 05:30 · Score: 3, Informative

Vs the chances you back up already corrupted files and don't notice until you've aged off the good versions.

--
while(1) attack(People.Sandy);

Re: Incompetent -- Learning Archival Strategies by FaxeTheCat · 2014-06-14 07:37 · Score: 2

Losing a day's or even an hour's data entry is not an option.

If you have that kind of requirements (less than an hour lost data), then you are not looking for just backup/archive. You are looking for a fully redundant storage system.
In addition to the backup system, of course.

For reading, check up on backupentral.com, Symantec.com (Backup Exec/Netbackup) emc.com (Avamar, networker).
I once managed a Filemaker database server (v5), and it has a built in featuer to copy the database files for backup. Real simple. Cannot remember if the database had to be taken offline, as we had users only during normal working hours, but these days that should NOT be a requirement.

Re: Legacy file systems should be illegal by jbolden · 2014-06-14 07:39 · Score: 2

The developer involved left Apple, went off to found his own company. Completed the work and then got acquired by Oracle. http://getgreenbytes.com/solut...
So it is in some sense done. The question is whether Apple is going to buy an Oracle product or Oracle will sell or ...

Re: Legacy file systems should be illegal by maccodemonkey · 2014-06-14 07:57 · Score: 3, Informative

They did try and replace the file system around the time of the Intel switch. Got killed by licensing problems.
http://appleinsider.com/articl...

Re:Legacy file systems should be illegal by Guy+Harris · 2014-06-14 08:37 · Score: 2

10.5 added hardlinking.

Are you certain? The ln command, when run without -s, would return an error if you used it under Tiger or earlier?

Or are you referring to hardlinking to directories, which was something UNIX traditionally supported, but which required root permissions (as it was used by the mkdir command to create the . and .. directories), and which was removed at one point (4.2BSD, as that added the mkdir() system call, making the ability of link() to link to a directory unnecessary?), and added back in 10.5 with the introduction of Time Machine, so that it could be used in backup trees as a very hacky form of de-duplication (each backup tree is a complete copy of the file system being backed up, but if there's an older copy of an unchanged file or a directory everything under which is unchanged, the "copy" is done by making a hard link rather than by copying the file to the backup disk).

Instead of tackling "bitrot" head-on, Apple seems to have taken the "make backups easy" approach. This works to some degree, but since the backups use hardlinking, you really only have two copies of the data -- the one on your main drive, and the one on your backup drive. This makes cycling your backup drives even more important than it already was.

That's what happens with any backup scheme that does incremental backups - if a file hasn't changed, a copy isn't made.

Re: Incompetent -- Learning Archival Strategies by WheezyJoe · 2014-06-14 08:39 · Score: 2

Simple: It is a "Datasheet" covering an "archival grade medium". If you do not know that, you have absolutely no business working on any kind of "mission critical" storage, as you are simply incompetent with regard to that subject.

Easy, there, big fella. Posting a link to a datasheet would have sufficed. Ain't right to call a man incompetent for asking a question. Truly, an incompetent is one who don't never ask the question assuming he already knows. Credit is due for seeking to learn something.

--
Take it easy, Charlie, I've got an Angle...

Re:Legacy file systems should be illegal by CauseBy · 2014-06-16 02:59 · Score: 2

I don't know if any popular filesystems do so, but there are ways to write data to disk such that flipped bits can be detected and corrected. I remember studying this in comps sci class way back in the Clinton/Bush transition era. So multiple disks and redundancy is one good solution but another is a filesystem that can recover lost bits.

If you need to store 64 bits of data you can imagine laying them out in an 8x8 square. Now, widen the square to 9x9 and write a checksum/evenness bit in the extra column to the right, plus the extra row on the bottom. So, if 8 bits sum to checksum 5, then your evenness bit is a 1 making 5+1 an even number. If the checksum is 4, your evenness bit is a 0.

Now, if one of the bits in the 8x8 square is flipped accidentally, then you can detect it and fix it by looking at your extra row-and-column of evenness data. If one bit in the 8x8 grid gets flipped, then the evenness bits in one row and one column will be wrong, reliably indicating the bad bit.

As a bonus, you have one extra bit in the bottom right corner which can act as a final checksum on the evenness bits. That allows you to detect if your checksums are valid. If only one bit in the whole 9x9 grid is flipped, you should be able to detect it and correct it no matter where it is.

I don't know much about real-world filesystems so I don't know if that is a common procedure or not.

Slashdot Mirror

One Developer's Experience With Real Life Bitrot Under HFS+

46 of 396 comments (clear)