A Good Filesystem for Storing Large Binaries?

jfs ? by jisom · 2006-02-11 16:48 · Score: 1, Informative

jfs is about the only one not mentioned that in linux.

The answer is... by Anonymous Coward · 2006-02-11 16:49 · Score: 0

JFS.

Next question.

JFS by member57 · 2006-02-11 16:51 · Score: 4, Informative

I use JFS on RAID 5, no errors, uptime of 200+ days currently. Handling large files 200-300MB each all day long. Excellent performance.

--
If Kerry was the answer, it must have been a stupid question.
The UN - The largest "political" cause of death.

Re:JFS by Anonymous Coward · 2006-02-11 17:12 · Score: 0

Indeed, it seems well rounded, fast and journaled, yet keeps itself out of the processor.

I've never lost data on a JFS partition, and I get angry at my machines a lot and just yank power cords.
Re:JFS by Anonymous Coward · 2006-02-11 18:32 · Score: 0

I've lost just about every single ext2 or 3 partition I've ever made, not just files, the whole partition, to simple power loss during idle times.

Jfs hasn't died on me yet through similar circumstances.
Re:JFS by ePhil_One · 2006-02-11 19:14 · Score: 2, Interesting

The only problem I've ever run into on JFS was trying to use it with NFS (specifically recommended against at the time). The JFS team was very responsive to my problems and very shortly had all the issues worked out. And during all the problems, the FS never lost a file that had been successfully written, even though the kernel locked up in interesting ways. Note that Linux JFS maps to AIX's JFS2 and is extent based, and very different animal than the original JFS. I'm not sure if this has been implemented on the mainframe as well, AIX is NOT the native OS of the IBM mainframe, FWIW, but given the amount of error checking that goes into their mainframe products I find it doubtful that files disappeared for any reason other than user-error.

--
You are in a maze of twisted little posts, all alike.
Re:JFS by dtfinch · 2006-02-11 20:57 · Score: 1

No errors in 200 days is normal with most filesystems I think. All our servers and my home desktop use ext3, which I've had no problems with. I've used JFS, but only recently. Short of other posts in this thread, I've heard only good things about JFS though.

I did see a Red Hat bug report a while back about very large file write performance issues on ext3: https://bugzilla.redhat.com/bugzilla/show_bug.cgi? id=156437. I don't know if the fix is in the official kernel yet, or if it was just a RHEL specific bug. The bug report's status is still "ON_QA". A successful workaround for now was to mount the partition with the noreservation option.
Re:JFS by IdleTime · 2006-02-12 02:45 · Score: 1

200-300MB are small files.

How big are the files? Hundred of gigabytes? terrabytes? Most non-commercial filesystems are very inefficient when it comes to handling files in the hundreds of gigabyte size.

Efficient handling of such large files requires that you also have disk hardware and I/O subsystems to handle it. If you have a large number of small files, the I have no idea, I only deal with VLF (Very Large Files, starting at 100GB+)

--
If you mod me down, I *will* introduce you to my sister!
Re:JFS by Covener · 2006-02-12 04:36 · Score: 1

I've used JFS on an IBM mainframe and on my Linux workstation

AFAIK, JFS doesn't run on IBM mainframe operating systems. The "unix like" hierarchal filesystems available on the mainframe are HFS and zFS.

Perhaps you used JFS on a big AIX cluster?
Re:JFS by electrofreak · 2006-02-12 05:59 · Score: 0

I've lost just about every single ext2 or 3 partition I've ever made, not just files, the whole partition, to simple power loss during idle times.
As have I. I've never actually used JFS, but I hear good things about it. I use reiserfs (v3.6, of course) everywhere (my servers and desktops) and haven't had any major problems with it yet. Only an accidental 'rm -rf *' at the root DIR, but that's certainly not the FS's fault.

--
I need a sig.
Re:JFS by Steinfiend · 2006-02-12 07:17 · Score: 1

This question isn't meant in a negative way at all, so please don't take it that way...

What type of data requires a file 100Gb in size?
Re:JFS by IdleTime · 2006-02-12 07:21 · Score: 2, Informative

many different things...

Drilling data and real-time sensor data from oil-wells in the North-Sea is one example. Observational data from particle accelerators, weather data.... many areas have HUGE data storage requirements.

--
If you mod me down, I *will* introduce you to my sister!
Re:JFS by Nutria · 2006-02-12 07:28 · Score: 2, Informative

What type of data requires a file 100Gb in size?

Thermouclear explosion simulations, for one.

On a more prosaic level, we've got databases that have hundreds of millions of rows, regularly growing into the billion+ size. Yes, it's partitioned, but the partitions are still huge.

--
"I don't know, therefore Aliens" Wafflebox1
Re:JFS by chris234 · 2006-02-12 07:31 · Score: 2, Informative

While not 100Gig, HDTV recordings can easily run 20-30Gig for long shows (I think my recording of the Olympic opening ceremonies was 21Gig for 3.5 hours).
Re:JFS by fordboy0 · 2006-02-12 08:14 · Score: 1

I've used JFS on an IBM mainframe and on my Linux workstation.

Last time I checked, the pSeries and the iSeries are not considered "mainframes", and I KNOW the zSeries doesn't use JFS...
So, on the pSeries (IBM's AIX box), you have the option under V5R4 (The current version) to run JFS or JFS2. JFS2 has some performance improvements (They never advertise a degrade, do they?) as well as being able to separate the journal from the data (inline vs. not)
Keep in mind that a *flakey* system can cause you to lose data on any filesystem. IMHO, EXT3 is the most fragile of the Linux Journalling FS's. Reiser3 a good, fast desktop FS (I ususally use Reiser3 for everything but /boot (EXT2/3) and /home (JFS)), but if you want server-class reliability and performance, it's JFS followed closely by XFS (JFS being the speedier of the two, in _most_ of the serving cases I've experienced)
As a side note, you always have to keep XFS as an option, since it allows for synchronized point-in-time snapshots (using LVM and xfs_freeze), something none of the other FS's offer on Linux. On a busy DB or mail server this is a major bonus if data integrity is of the utmost. The only thing better is apllication quiescing, but again, on a busy server this may be less than optimal. Anyone who has had to replay journals to bring data back in sych know to what I'm referring.
-FB0

--
Ligaguinggligagiggagoogoogwillgo
Re:JFS by andreyw · 2006-02-12 12:29 · Score: 1

Nice troll Rockway. Too bad it's complete bullshit.
Re:JFS by jrockway · 2006-02-12 12:39 · Score: 1

Remember when tigger was down for a week? That was because of JFS corruption issues. And a good portion of my MP3 collection was wiped out when I stored them on a JFS partition.

Obviously this is anecdotal evidence, but I'm not making anything up.

--
My other car is first.
Re:JFS by Nutria · 2006-02-12 12:49 · Score: 1

On a busy DB or mail server this is a major bonus if data integrity is of the utmost. The only thing better is apllication quiescing, but again, on a busy server this may be less than optimal.

Taking FS snapshots of an open, active database is a recipe for disaster, IMNSHO.

There are uncommitted transactions, buffered data, etc, which make the on-disk "stuff" transactionally inconsistent. DBMS vendors write backup utilities for a reason.

As a DBA of very large systems (TB-sized, with billion-row tables), I would never do FS snapshots, and would fight it like a wildcat if ordered to do so.

--
"I don't know, therefore Aliens" Wafflebox1
Re:JFS by dtfinch · 2006-02-12 13:06 · Score: 1

Hard disk images I suppose.
Re:JFS by hopews · 2006-02-12 16:35 · Score: 1

What database software do you use for those databases? Anything open source?
Re:JFS by LO0G · 2006-02-12 17:11 · Score: 1

Email. 10,000 users at 500,000,000 bytes of email storage each means 5TB of data to hold the email.

Since most email messages are in the 1-5K range, storing that much data in separate files is a disaster.

It's better to split the database up into a couple of hundred hundred G or so databases.

Also satellite imagery take up a TON of disk space.
Re:JFS by elint · 2006-02-12 20:03 · Score: 1

virtual machines (VMWare), where a virtual machine's 100GB virtual HDD is stored as a single 100GB vmdk file. Though this is a specific case, and VMWare has their own custome vmfs filesystem to support these files.

--elint
Re:JFS by member57 · 2006-02-13 07:54 · Score: 1

I didn't say they 200-300MB was huge, that is what sized most of my files are, separate databases, etc... JFS should handle VERY large files without problem.

--
If Kerry was the answer, it must have been a stupid question.
The UN - The largest "political" cause of death.
Re:JFS by member57 · 2006-02-13 07:57 · Score: 1

Oops, I guess I did sat that 200-300MB was large, err, well I didn't mean it, and I didn't inhale....

--
If Kerry was the answer, it must have been a stupid question.
The UN - The largest "political" cause of death.
Re:JFS by Nutria · 2006-02-13 08:32 · Score: 1

What database software do you use for those databases? Anything open source?

Oracle Rdb.

Runs on VMS, DEC created it in the early 80s and sold it to Big O in 1994. Former DEC engineers are still plugging away at it up in Nashua, just across a garden path from the HP OpenVMS engineering building.

In fact, they just ported it to OpenVMS Itanium.

--
"I don't know, therefore Aliens" Wafflebox1
Re:JFS by eno2001 · 2006-02-13 10:14 · Score: 1

Well... my home brew PVR's pause file can grwo pretty large prety fast. If I pause TV for a week, I could easily have a 100+ gig file. It's not unheard of.

--
-"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
Re:JFS by CMiYC · 2006-02-13 14:01 · Score: 1

Data Acqusition.

High-Performance Logic Analyzers and Oscilloscopes can generate 100Gig data files with no problem. Nearly every electrical engineer uses them (however they rarely collect that much data.)

The Google Filesystem by benploni · 2006-02-11 16:54 · Score: 5, Funny

Google made a filesystem for exactly that purpose: storing HUGE files highly reliably. OK, so it's not publically available, but it's still perfect for you (other than that).

Re:The Google Filesystem by MillionthMonkey · 2006-02-11 18:40 · Score: 1

I was going to say the same thing. I remembered reading Google's description of their file system as soon as I read the guy's question. It would be perfect for him.
Re:The Google Filesystem by hanwen · 2006-02-12 00:55 · Score: 4, Insightful

Google made a filesystem for exactly that purpose: storing HUGE files highly reliably. OK, so it's not publically available, but it's still perfect for you
I doubt that. To run GFS (assuming you have the code), you need to have a big honking cluster, to replicate data across machines. Also, it assumes a different file semantics, so you need to hand-code your apps to use the different reading and writing semantics. It only works well for appending writes and streaming reads. Furthermore, GFS does not have file-locking, and concurrent writes will leave your files in an undefined states.

--
Han-Wen Nienhuys -- LilyPond
Re:The Google Filesystem by cloudmaster · 2006-02-12 04:02 · Score: 1

Don't type GFS when you mean Google File System. GFS is the Global File System.
Re:The Google Filesystem by jZnat · 2006-02-12 04:50 · Score: 1

With Reiser4, I probably could write a plug-in to emulate this type of behaviour, but that's not a very good immediate solution, is it? :-/

--
'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
Re:The Google Filesystem by geniusj · 2006-02-12 04:58 · Score: 1

It was perfectly clear what he meant in the context he was using it in. Just take it easy.
Re:The Google Filesystem by Anonymous Coward · 2006-02-12 06:57 · Score: 0

Make me.
Re:The Google Filesystem by KarmaMB84 · 2006-02-12 10:38 · Score: 1

The Google File System paper refers to the FS as GFS so shush.
Re:The Google Filesystem by Anonymous Coward · 2006-02-12 11:05 · Score: 0

That's some silly FUD.

The Lucene/Nutch project has a F/OSS port, so it's not that hard to get the code.

You don't need a big cluster. A small (3 machine) cluster is fine.

Yeah, you need different semantics. The user _wants_ different semantics because these differences in the semantics are well suited to managing large files. If he wanted normal-file-tuned filesystems, he'd be happy with one.
Re:The Google Filesystem by PornMaster · 2006-02-13 03:42 · Score: 1

Well, he did ask for something for WORM-like access, so I don't think write concurrency would be an issue. Not to say that GoogleFS is appropriate, but multiple writers to the same file shouldn't be an issue, or could be worked around for the first write with a "staging" filesystem.

--
500GB of disk, 5TB of transfer, $5.95/mo
Re:The Google Filesystem by cloudmaster · 2006-02-14 05:08 · Score: 1

So, when the person who asked the question looks at the list of packages his distro ships and sees GFS, what's he gonna think? That there's a package for Google File System available? Given that he obviously doesn't know much about the different filesystems available, I felt it relevant to point out that the commonly available GFS is something else precisely *because* it was clear what GFS referred to in context. :)

Never mind that the Global File System would actually be a decent option for this use...

FAT16 by Anonymous Coward · 2006-02-11 16:55 · Score: 0, Funny

FAT16 is pretty damn good for DVD backups.

Re:FAT16 by Anonymous Coward · 2006-02-11 16:58 · Score: 0

FAT16? What a waste!! Surely you mean FAT12!
Re:FAT16 by MarkRose · 2006-02-11 17:05 · Score: 0, Troll

Don't be retarded: FAT16 can't store more than 4 GB of data, which most DVD's exceed.

--
Be relentless!
Re:FAT16 by John+Nowak · 2006-02-11 21:00 · Score: 3, Funny

You must be really fun at parties.
Re:FAT16 by Steinfiend · 2006-02-12 07:19 · Score: 1

I went with him to a party once, and to be fair he was quite fun. That thing he did with the chicken and the toaster was a little near the knuckle for me though...

Try JFS? by hunterkll · 2006-02-11 16:57 · Score: 2, Informative

I've been using JFS for about two months now, and it's been quite a plesant experiance with my anime storage. I run it on a 1.6TB array, four 400GB harddrives. It's preformance is damn fast from what I've observed copying too/from a JFS firewire drive. I trust it enough to keep data that I can't back up on it until I can get another identicle array to mirror with - only drive failure will seem to kill this FS, and these drives are about ~3 months old, so failure isn't that much of a concern anymore. I'd reccomend it for storage, havn't tried it as a system FS yet.

Re:Try JFS? by Anonymous Coward · 2006-02-12 01:51 · Score: 0

these drives are about ~3 months old, so failure isn't that much of a concern anymore

That may be the weirdest thing I hear all day today.
Re:Try JFS? by swilver · 2006-02-12 04:03 · Score: 1

You run a 4 drive array, without redundancy? That takes guts I suppose...
Re:Try JFS? by networkBoy · 2006-02-12 05:13 · Score: 1

balls of solid brass i tell ya, balls of solid brass.

Funny, b/c I too have a 4 drive array . . . 4 to make 1, the true paranoid array.
-nB

--
whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
Re:Try JFS? by hunterkll · 2006-02-12 06:22 · Score: 1

I accumulate on average 1-2GB of data a day (Thank you coffee shop w/free wireless!:D pounces my cellular conn any day!) It pinched my wallet to get enough to -store- it I'll probably set up a 4 + 4 = 4 array somehow, I'll either need a super raid card (anyone know of one?) or a second card and software mirroring ... i'd like a super card so I can rebuild using the card itself in case of a drive failure, but I suppose a simple dd would work to restore (or whatever linux offers for this, havn't researched it)
Re:Try JFS? by da5idnetlimit.com · 2006-02-12 12:07 · Score: 1

Not a "Super Card", but 3ware is the brand you want to see : http://www.3ware.com/products/index.asp

I think they have a model up to 16 drives, but you should look at the 9550SX-8LP, 8 drive ATA controller.

For SATA : model 8506-8
Up to 8 Serial ATA
0,1,10,5, JBOD

Departmental Servers, Security and Surveillance, Disk-to-Disk Backup...Raid 10 on two arrays looks good to me....

For max size, I think would opt for a RAID5 on 7 disks and keep the 8th drive as a "hot-remplacement" for the soon to be faulty drive...

7*500Gig/RAID5 should give you 3 Tera with (some) data protection and an hot-plug replacement...

Anyone cares to counsel ?

--
It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
Re:Try JFS? by dwater · 2006-02-12 12:42 · Score: 1

> For max size, I think would opt for a RAID5 on 7 disks and keep the 8th drive as a "hot-remplacement" for the soon to be faulty drive...

I would consider RAID6 instead since it uses the 8th drive for more parity storage to give extra reliability.

--
Max.
Re:Try JFS? by da5idnetlimit.com · 2006-02-12 12:49 · Score: 1

I thought of RAID6, but the card doesn't seem to support it natively....

Damn, that guy just got me thinking at my next fileserver 8)

--
It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
Re:Try JFS? by Nutria · 2006-02-12 12:54 · Score: 1

balls of solid brass i tell ya, balls of solid brass.

Brains, not balls, of brass.

--
"I don't know, therefore Aliens" Wafflebox1
Re:Try JFS? by dwater · 2006-02-12 13:16 · Score: 1

Ah, right. I am always thinking of s/w RAID these days. I don't much care for hardware solutions (too inflexible).

--
Max.
Re:Try JFS? by Anonymous Coward · 2006-02-12 16:54 · Score: 0

Take your cpu overhead and go away. Have you even bothered to look at processor usage from it?!?!

I'll take my battery backed hardware raid. Why waste cpu cycles when I can just add a dedicated CPU for cheap.
Re:Try JFS? by da5idnetlimit.com · 2006-02-13 01:07 · Score: 1

Using a "modern" cpu makes software overhead not that much of a problem, in my opinion - except when rebuilding the parity on the array after a fault.

When that happen, I hope my 2Ghz+ cpu will be more efficient than the dedicated cpu.

OTOH I intend to have the fileserver do more than just serving files, so a dedicated hardware for managing the drives can be a good idea when compiling/encoding/capturing/whatever.

Right now I'm using 3 hdds with no parity, no raid, no jbod, no security, but the server is also the Torrent downloader, web machine, ...(Ubuntu, Duron 1.3 ghz, 512 Mo RAM and a Nvidia TNT2 Ultra for that extra fps on Quake linux 8p, cannot do much less in hardware power)

Next machine will be a real fileserver, possibly based on the Dual PIII 1Ghz I have sleeping all the time... I just need an additional 1.5 gig ram (120eu), 8*500gig hdds (2500eu?), a 3ware card (600eu) and a new 600W psu (100eu)...Gosh, I gonna be broke 8)

--
It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
Re:Try JFS? by slaker · 2006-02-14 03:54 · Score: 1

I'm hovering at around 12TB for the computers in my home. All commodity hardware.

1. IIRC 3ware cards are limited to 2TB per array in 32-bit OSes. Not a HUGE deal, but something to keep in mind if you're running Windows or haven't gotten around to installing 64-bit Linux on your Athlon64.

2. I like having dense arrays, but I also like mirrored data. Each of my fileservers therefore has 2 arrays - a RAID5 on a 4-port 3Ware card and a software RAID0 (It's that or JBOD). I replicate the changes on each of the RAID5s on the RAID0 of a different machine (in a different room) twice a day.
RAID0 is generally a bad idea for storing, oh, pretty much anything. I'm not a big fan of it. On the other hand, I'm storing redundant data to begin with.

--
-- I wanna decide who lives and who dies - Crow T. Robot, MST3K

Filesystem choice... by strredwolf · 2006-02-11 16:57 · Score: 3, Insightful

So lets get this straight:

You need a filesystem that can be "burned" to a medium, yet have error correction capability.

Journaling doesn't do this. Journaling is for when you get a power surge in the middle of a write, you can get some of the data back. Currently no regular FS can do that.

--

--
# Canmephians for a better Linux Kernel
$Stalag99{"URL"}="http://stalag99.net";

Re:Filesystem choice... by RedLaggedTeut · 2006-02-11 21:30 · Score: 0

Thanks for summing up the question, because I didn't RTFA.

I would suggest creating sort of an RAID with DVDs, i.e. a redundant array of indepedent removable media, an "RAIRM". Remember, you heard it first on slashdot ;-)

--
I'm still trying to figure out what people mean by 'social skills' here.
Re:Filesystem choice... by Anonymous Coward · 2006-02-12 03:46 · Score: 0

Journaling is for when you get a power surge in the middle of a write, you can get some of the data back.

Urm no, Journaling is to ensure that no matter what happens, you end up with a consistent file system. It has nothing to do with data preservation. You will lose data with a journaling FS, unfinished writes (whether because of power surges or kernel lock-ups or whatever), will be rolled back: data loss, but a consistent filesystem.
Re:Filesystem choice... by Anonymous Coward · 2006-02-12 07:06 · Score: 0

You need a filesystem that can be "burned" to a medium, yet have error correction capability.
No he wants a disk filesystem to back up rips of his CDs and DVDs (large files).
Re:Filesystem choice... by Anonymous Coward · 2006-02-13 23:21 · Score: 0

zfs can.
(but since sun is evil here I will remain AC)

GFS by Anonymous Coward · 2006-02-11 17:00 · Score: 0

Chop the files up. Study Google File System. They store things as 'shards' which are little pieces that don't care where they live. The redundancy gives them speed and robustness. There is a white paper on teh Interweb about this.

Database companies have similar problems by aralin · 2006-02-11 17:00 · Score: 2, Interesting

I know this is not exactly, what you are looking for, but database companies have very similar problem on their hands and since filesystems usually are not quite good for this type of work, they usually come up with their own systems for handling raw disks. For example Oracle has its ASM (Automated Storage Management). You might want to look into these if they are not customizable for your problem or contact the relevant companies for specifics.

--
If programs would be read like poetry, most programmers would be Vogons.

Re:Database companies have similar problems by jZnat · 2006-02-12 04:54 · Score: 1

I've dealt with said database problems albeit in a probably smaller manner. The database in question was a MyISAM (MySQL) format reaching a size of at least 2 GB and increasing at probably 50 MB/day or so. Our only solution we could work with at the time was splitting the database up into smaller files and even across multiple harddrives to decrease access time (some funky hacks that could've been better if we were using a better cluster, I'm sure).

Then again, IANADBA, so I don't know the extent of how large a single database file can get.

--
'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
Re:Database companies have similar problems by StillDocked · 2006-02-12 05:50 · Score: 2, Interesting

In that vein, OCFS2 should do exactly what you would like from a FS. It is easier to set up than RAW/ASM, and if configured properly, it can be rock solid. There are good resources available for installing and configuring.
Re:Database companies have similar problems by PinkPanther · 2006-02-12 11:36 · Score: 1

Depends on the technologies, of course. Some code bases have hard-written limits due to internal data structures, whereas others will be limited by the filesystem itself.
Here's an example of one well designed, well written database's limits.

--
It's a simple matter of complex programming.

Why is this modded troll | Re:Filesystem choice... by hunterkll · 2006-02-11 17:02 · Score: 2, Insightful

Okay, why is this modded troll again? He misunderstood the direction of the backups, he wasn't trolling! If anything, it's -1 WRONG, not troll! Anyway - he backs up data FROM DVDs and CDs, not TO

reiser or jfs by william_w_bush · 2006-02-11 17:09 · Score: 2, Informative

reiser or jfs are both solid for this kind of work, with large file and volume support. personally i swear by reiser for my 2tb volume, and have had no problems so far, although there is a minor speed penalty when working with several multi-gigabyte files at once, something to do with shared fs locks/mutexes i'd imagine.

OTOH JFS is quite stable, and though it has less of the elegance in feature set I find in reiser, tends to make up for it with enhanced ruggedness and its handling of large volume/files.

Really can't recommend anything else, as you say, reiser4 is still untested for reliability imho, xfs has issues that vary from kernel to kernel, and ext3 appears quite primitive in comparision, although its journaling seems comparable to the other choices.

JFS if you need the speed, its dead fast in large scales, slower with small files, otherwise Reiser3 is an excellent all-round performer.

--
The first rule of USENET is you do not talk about USENET.

Re:reiser or jfs by Anonymous Coward · 2006-02-11 17:37 · Score: 0

Thank you for not being a Reiser zealot. And thank you for even mentioning JFS(the one nobody looks at) :)

Possibly... by dcapel · 2006-02-11 17:10 · Score: 4, Funny

p2pfs?

Just upload to bittorrent, ftp, or some other p2p system, and redownload it if you need it again!

Some small security issues may apply though...

--
DYWYPI?

Re:Possibly... by Anonymous Coward · 2006-02-11 17:28 · Score: 0

I can see it now...

"48 days at 98.6%. Someone please reseed, pleeeeese!!"
Re:Possibly... by OneDeeTenTee · 2006-02-12 00:13 · Score: 1

The security issues are easy to address.

1. Encrypt the files and then tack on a header that makes them look like a DRM protected video.

2. Name them something like XXXgirlzongirlz.wmv and then drop them into your share directory.

3. ???

4. Profit!

--
Stop the world; I need to get off.
Re:Possibly... by stanmann · 2006-02-13 07:12 · Score: 1

For Bonus stability, actually use a stego program to embed the encrypted volume in a wmv or avi.

--
Food not Bombs is a nice platitude but it breaks down when you notice that the Bombees are usually well fed

Not Linux, but try ZFS by duffbeer703 · 2006-02-11 17:10 · Score: 4, Interesting

ZFS has some built-in volume management & data integrity functions that would probably work for you. I don't believe that it is available for Linux, but is freely available via Solaris & OpenSolaris

http://www.sun.com/software/solaris/zfs.jsp

--
Conformity is the jailer of freedom and enemy of growth. -JFK

Re:Not Linux, but try ZFS by Anonymous Coward · 2006-02-11 18:02 · Score: 1, Informative

The data integrity of ZFS would be especially good for large binaries. ZFS stores checksums in a way (IIRC) that files can be error detected and repaired automatically across a RAID. I think this is part of Sun's "self healing" marketing push. It would also give a heads-up on a failing hard drive in advance to the actual failure.
Re:Not Linux, but try ZFS by larien · 2006-02-11 22:31 · Score: 1

Unfortunately, ZFS is still in a beta stage; I wouldn't want to trust anything critical to it just yet. Production release as part of Solaris is expected by the middle of this year, I think.

Ext3 or XFS. by MarkRose · 2006-02-11 17:10 · Score: 1

If you're not writing to the files extensively, ext3 is perfectly fine. If you don't need the data journalling, read `man mount` regarding ext3 mount options.

Personally, I'm using XFS for the same task. Why? Because it segments the filesystem, allowing segment locking instead of filesystem locking, which is nice if you're writing multiple big files at once. I've never had a problem with it.

If you are getting countless IO errors, have you done a `badblocks` on the disk? Have you tried a different IO card or disk?

--
Be relentless!

Re:Ext3 or XFS. by micheas · 2006-02-11 19:01 · Score: 1

XFS and JFS should both work fine. I also would suspect hardware problems as a possible issue.

Stacking drives and not adequately cooling them is another possible cause of your IO errors with XFS.

Another possibility is checking your hard disks with the manufacturers drive test utility.

--
Work bio at MMWD
Re:Ext3 or XFS. by baptiste · 2006-02-12 00:06 · Score: 5, Informative

Check out the latest. What? 2003? Haven't there been any bug fixes since then?
While it sucks you've lost data because of XFS, mant people use it heavily every day without issue (I'm one of them) I've deployed XFS across mail, database, and web servers without issue. Your statements about are total FUD. The reason the last 'release' was in 2003 is not long after that, XFS was accepted into the kernel itself. Thus there we no longer a need to 'release' XFS patches for the kernel. If you look at the command packages, you'll see them being updated on a regular basis.
As for bugs, I think your statement of bugs not being fixed is incorrect as well. Check the closed bug list. You'll see many that are being closed. Also, in your open bug list above, it does appear rather long. But MANY of those bugs are from users who opened a bug saying 'XFS Crashed On Me' and then never followed up with more info. The XFS developers haven't cleaned many of those out it seems. Bugs in the 200s date from 2003, bugs from the 300's from 2004. Late 300's and 400's from 2005.
So I hate you've had data loss - I wouldn't wish that on anybody (having experienced a RAID5 triple disk failure combined with backup tape failure. Thank goodness for OnTrack!) But don't post FUD about a filesystem that has performed very well for a lot of people and continues to be improved and innovative.

--
Top Most Bizarre/Disturbing Error Messages
Re:Ext3 or XFS. by Cthefuture · 2006-02-12 03:05 · Score: 1

Listen, just about everyone has lost data with every filesystem. I have personally lost ext3 partitions that needed to be fschk'd and which after running for days it was never able to recover from and it corrupted the whole filesystem. I have had similar experiences with ReiserFS. Also, at least in my experience ReisferFS (3) tends to badly corrupt files when shut down improperly (UPS failure, etc.). I have tried JFS many times in the past and it never seemed stable, of course it has been a couple years but I gave up on it because it was too flakey.

I also tried XFS many times and in the early days it was a CPU hog and unstable. However, that has changed in the last couple years and I have been happily using XFS for almost everything. I currently have a little over 1TB running on XFS with no problems at all and this stuff gets hammered with multiple VMware machines and stuff transferring huge amounts of data to and from the video machine. They have survived hard power-offs and all sorts of things without a single lost file. The performance is good also. Mounting a huge Reiser disk takes forever, XFS is instant. mkfs.ext3 would fail when I tried to create a 600GB drive. I'm not saying XFS is perfect but I haven't had any problems with it.

--
The ratio of people to cake is too big

XFS? by Shawn+is+an+Asshole · 2006-02-11 17:12 · Score: 1

I know you listed that as not working right, but I've had very good experiences with it. I do a lot of audio editing (~5G/song) and also run multiple VMWare images (all 2G+). No problems at all.

Are you sure your HD is good? What distro are you using?

--
"It ain't a war against drugs.it's a war against personal freedom" --Bill Hicks

Hand-tuned ext2/ext3? by Vo0k · 2006-02-11 17:12 · Score: 4, Interesting

Most fancy filesystems like ReiserFS are optimized for performance with lots and lots of tiny files where the disk reads little at a time, seeking, sorting, assembling, slicing etc take most of the time. Here you have few big files, so performance is your least worry - the harddrive read/write speed will be the bottleneck, and all the seeks, directory reads etc will be scarce and fast. Therefore the filesystem won't change much in the means of speed. (it MAY break a lot in the department, like, say compressed filesystems, but won't speed it up above what the harddisk does, and most of filesystems will perform just the same in the means of speed.) What you can do is to optimize the filesystem for capacity, reducing its overhead and allowing to get closer to "advertised disk capacity".

Just use tune[23]fs to reduce number of inodes significantly on the ext3fs. Or look for -simple- filesystems that don't do tricks in optimization of speed (because these usually waste diskspace), just store your files in a straightforward manner.

--
Anagram("United States of America") == "Dine out, taste a Mac, fries"

So just poo poo all the options then ask for one by thegrassyknowl · 2006-02-11 17:12 · Score: 5, Insightful

journaling methods to prevent data loss due to some natural disaster while data is being shifted around

Journalling doesn't do this!. Journalling helps reduce file system corruption in the event of a catastrophic failure while modifying the file system - ie, it's possible to bring it back to the last clean state before it crashed - journalling does not prevent data loss. You might say "well filesystem corruption and data loss are the same", but they are not. If the filesystem is corrupted, the data is not lost. It just becomes not easily retreivable. If the data is lost then it becomes entirely irretreivable.

I tried using SGI's XFS due to its promising details, but I was met with countless IO errors

Have you considered your hardware is shit? I use XFS on terabytes of raided disks and have been for more years than I remember... 5 or so? I don't see any I/O errors. XFS is very reliable and I trust it with my data.

I feel that Ext3 is not optimal for this

Well not all of your post was dumb!

ReiserFS is too slow when it comes to reading large data files

How is it slow? It takes a few microseconds longer to access the first data sector because it does some extra processing first? Give me a break. Filesystem performance for journalled filesystems is mostly bound by writing speed, and this is a function of how the journal is updated. I doubt you would notice the difference in read speed unless you ran a million tests over a million different files, took some sort of average for the filesystems and quibbled over a few milliseconds.

Reiser4 isn't mature enough to entrust my digital assets to

You entrust your assets digitally? Shit, why do you trust any filesystem? They are all buggy. Give me a break.

If you don't like it, keep backups on other media; buy a tape drive and a robot and get in bed with a good archiving company to securely store the backups. Don't come one here and poo poo all of the file systems known to man then tell me "is there anything better"? About the only 4 in common use you left out were JFS (good for large databases but not much use if you have a lot of small files), FAT[12/16/32] (not much good for anything really), NTFS (see FAT, but more complex) and ISO9660. I'll concede there are others, but if you want something that's in common use so you can actually retreive your data when the world turns to shit...

Anywho!

--
I drink to make other people interesting!

ZFS by ojek · 2006-02-11 17:14 · Score: 1

Have you looked at ZFS from Sun?

I believe at this moment it's only available on Solaris, but it's open source.

Personally I haven't had the chance to try it myself, but from what I read it seems really impressive, at least on paper. Might want to check it out.

Re:ZFS by mknewman · 2006-02-11 17:45 · Score: 1

I was going to suggest ZFS also, but I think it's still in pretty much a beta state, only available through OpenSolaris. It's largely oriented for data reliability and performance and should be available on Linux soon, as it's open source.

IMPORTANT - addendum by william_w_bush · 2006-02-11 17:15 · Score: 1

if solaris is an option go zfs. hands down best fs if you can run it, incredible flexibility and scalability.

also, you mentioned something about burning to dvd, filesystems won't really help with that, i'd look more into taring your filesystem/ fs segments into disc sized segments, then making extra par2 files for error-resilience.

really, backing up 2tb of live data is a f*ing nightmare however you look at it, usually when i reach that hurdle i just build another machine, copy over active data, and put the old box in the closet, a sort of living backup if you will, if i ever need to go back to it. at the cost of hd's now, especially with removable sata chassis's it's the only way to handle the sheer size of data you'll be dealing with.

--
The first rule of USENET is you do not talk about USENET.

You sure? by jd · 2006-02-11 17:16 · Score: 1

I know that's the case for metadata journalling, but if you use full data journalling, I would have thought you'd be fine.

Actually, I wouldn't use ANY filesystem for this sort of work. The files won't change in size and I doubt they'll be deleted. It would seem more sensible to battery-back the RAM on the computer and the hard drive, use a raw partition for the data and a "sequential index" database to figure out where the data starts and how long it is. Batteries guarantee that the state of the computer will never be lost (as the RAM is now non-volatile), so you won't need journalling, and if you can guarantee the data will never fragment, you don't need the overhead of a filesystem.

It would be better if you could find some way of using core memory, rather than magnetic disks or optical media. Magnetic disks have a lifespan of a decade or less, optical media won't even last that. Core memory, on the other hand, requires a refresh rate of about once a century and is unlikely to have significant errors within the remaining lifetime of western civilization. There are only two drawbacks. It's sloooow (anyone with a Commodore 64 disk drive? anyone care to imagine something that's about a hundred times slower?) and it's bulky (you could probably replace your CD collection - but you'd need to put the people on Rhode Island somewhere else first).

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)

Re:So just poo poo all the options then ask for on by william_w_bush · 2006-02-11 17:21 · Score: 1

XFS has issues that go version by version. I've had volumes work perfectly for months suddenly start spitting out I/O spam after a simple kernel upgrade. My guess is poorly applied patches, as many xfs kernel patches are added downstream by a distro or kernel contributor (MM/gentoo). just my 2c, YMMV.

--
The first rule of USENET is you do not talk about USENET.

You have other issues. by Anonymous Coward · 2006-02-11 17:31 · Score: 2, Informative

I have a 3 TB XFS file system and a 10 TB XFS file system that are regularly accessed by multiple processes that read and write hundreds of gigabytes each without write errors and with excellent performance (several hundred MByte/s sustained). You may have other hardware or software issues if you're seeing errors with XFS. Try to figure out the root cause of your problem before you try another FS.

zfs is new by r00t · 2006-02-11 17:35 · Score: 2, Informative

As a general rule, the latest and greatest stuff will be full of bugs. Give zfs a few years before you trust it with anything critical.

Re:zfs is new by elmegil · 2006-02-11 18:04 · Score: 1

An official release before using it might be nice too.

--
7 November 2006: The day Americans realized corruption and incompetence weren't addressing 11 September 2001

Comparison of File Systems by NuclearDog · 2006-02-11 17:38 · Score: 5, Informative

Comparison of FileSystems (from Wikipedia)

Personally, I run two 300GB drives in RAID1 on UFS and am quite satisfied with it, but you seem to be incredibly, incredibly picky, so I'm sure you could find something wrong with it ;P

ND

--
This statement is forty-five characters long.

Re:Comparison of File Systems by Xenophon+Fenderson, · 2006-02-13 04:45 · Score: 1

I read that as "incredibly, incredibly pricky", but your working makes sense, too.

--
I'm proud of my Northern Tibetian Heritage

One more common filesystem: UDF by r00t · 2006-02-11 17:39 · Score: 1

UDF is commonly used on DVD media. I think it fits the criteria, being writable (unlike iso9660) but optimized for reading.

I/O Errors??? by Stephen+Samuel · 2006-02-11 17:53 · Score: 4, Informative

If you're getting lots of I/O errors with XFS, I'd be inclined to look at a hardware problem (unless the I/O errors consist of attempts to read past the end of the partition -- which could be caused by you manually specifying the partition size, rather than letting mkfs.xfs figure it out).

Like someone else said -- try using badblocks(8) -- or just use dd to make sure you can read the entire partition without errors.
Bad disks do happen -- even new ones. Production code in Linux is generally very stable, and (unlike with windows), you can usually start with the presumption that things like I/O errors are caused by real hardware problems of some sort (even if it's just bad/loose cables).

--
Free Software: Like love, it grows best when given away.

Re:I/O Errors??? by Anonymous Coward · 2006-02-12 02:41 · Score: 1, Informative

If a drive starts having visible bad sectors, it's time to dump it. Since about a decade, harddisks have spare sectors, which they use to remap bad sectors. If the drive encounters one bad sector, most of the time you can hear it seeking back and forth, trying to read it multiple times, and after it has given up, it'll return a read error and map in a spare sector in its place, which obviously will not have the right data, but instead will be filled with 0x00 or similar. To check if this is has happened, you can read the 'grown defects list'. In Solaris, 'format' can do this, in IRIX the showbb option of 'fx', and in DOS there were some tools available for free from Seagate for this. Somebody please tell me which tool Linux, xBSD or Windows have to do this. I don't have any experience with this new-fangled SMART, but it may be a good idea if you're already encountering errors.
Re:I/O Errors??? by cortana · 2006-02-12 02:58 · Score: 1

It sounds like the tools you list just read various SMART attributes. Smartmontools is easy to use:

$ smartctl -a /dev/hda

prints out everything you need to know about a disk. Smartmontools also comes with smartd, which sits in the background, monitors the disk's attributes and administers regular disk test. It will perform an action such as mailing you or running a script if anything happens that you need to know about.

The one thing it is missing is some kind of long term data storage facility, from which are derived predicted failure dates.

Keep it simple. ext2 or fat32. by Radak · 2006-02-11 18:09 · Score: 4, Interesting

If you're looking for a filesystem to archive things indefinitely, avoid exotic new kids on the block with limited OS support and even more limited toolkit support.

You want a filesystem you'll be able to read at any point in the future and, should the worst happen, one which you'll have a reasonable chance of being able to recover.

ext2 and fat32 tend to write files in nice large chunks and there are lots and lots of recovery tools for damaged filesystems. Journaled filesystems like to put little pieces all over the place, and recovery of a badly damaged filesystem is next to hopeless.

There is no call for a complex filesystem just because you want to store large files. ext2 (and to some extent fat32) will do just fine, and you'll be glad for them someday in the future when something breaks.

Re:Keep it simple. ext2 or fat32. by Stephen+Samuel · 2006-02-11 18:35 · Score: 1

I think that I could second this motion.
ext{2,3} can handle nice, big files, and just about any version of linux can read it. You can even get modules for the Microsoft world to read ext2 filesystems. If you're looking at a read-mostly filesystem, then journaling won't get you much (other than making for a fast FSCK if you lose power).
Just remember to specify '-T largefile' on the mke2fs command line to optimize for larger files.

If you haven't thought about it yet, I'd also suggest raid5 rather than raid0. It's worth the extra expense to be able to recover from a dying drive (which will happen, sooner or later).

--
Free Software: Like love, it grows best when given away.

a bunch of bullshit? by jaydubscott · 2006-02-11 18:50 · Score: 0, Offtopic

Look at this, "Six Sigma Security Inc." is an authorized sales agent for citywatcher.com... http://www.sixsigmasecurity.us /wst_page4.html Sounds like a bunch of hooey to me.

It worked for me by toadlife · 2006-02-11 18:53 · Score: 5, Interesting

Around 1997, I discovered the magic of mpeg-layer3. I hung out in #mpeg3 on effnet and was part of what was probably the first ever mp3 trading circle. An aquaintance of mine had a CD of the rare Nirvana/Jesus Lizard single, which had Nirvana's "Oh The Guilt" on it. I borrowed it from him and ripped it to wave and encoded it a 256KB mp3 and returned the CD. Over the next year or so, quite a few people nabbed the song from me during normal trading sessions in #mpeg3. Sometime later I made a boo-boo and lost a folder permanently, and one of the files in it was that song. I was bummed, as the person I borrowed the CD from was gone and the CD was long out of print and cost a lot of money if you happened to find a copy. I forgot about it.

Quite a few years later - I think ~2002, I was on some p2p app, typed in "Oh the Guilt" and got a hit. I downloaded it, and it was a 256KB mp3 of the song. The file modification date in 1997, and the tags were typed in exactly the I would have put them if I had encoded the song. I can't prove it, but I'm pretty sure I got my file back.

--
I don't always use unix-like operating systems; but when I do, I prefer FreeBSD.

Re:It worked for me by Vellmont · 2006-02-12 01:47 · Score: 1

I don't think there's much question it was the same file. No one but no one was doing 256 kilobit mp3 rips in 1997. If you wanted to be extra sure, I'd bet there's some kind of header information the mp3 encoding program leaves in the mp3 file.

--
AccountKiller
Re:It worked for me by Kuciwalker · 2006-02-12 14:01 · Score: 0

Obligatory bash.org quote:
<NES> lol
<NES> I download something from Napster
<NES> And the same guy I downloaded it from starts downloading it from me when I'm done
<NES> I message him and say "What are you doing? I just got that from you"
<NES> "getting my song back fucker"
Re:It worked for me by robpoe · 2006-02-13 10:38 · Score: 1

I've done the same. Except I use a specific comment tag when encoding MP3s .. I didn't encode 256k in 1997 .. i encoded in 192k...

--
= Grow a brain...

Re: ext3 works fine, did you try it? by Matt+Perry · 2006-02-11 18:54 · Score: 4, Informative

I feel that Ext3 is not optimal for this

Did you try it with ext3? I have 688G in a RAID5 array spread across four 250GB drives. I use ext3 and I store lots of large files (15GB free on the array right now). I have about 156GB of DVD images, mostly movies that I own and have ripped to watch using daemon tools on Windows. Some of them are rips of training video DVDs I bought for software that I use like Adobe Premiere and Audition. I frequently move large AVI files to and from the array for video projects that I'm working on. These files originate on my Windows box and can be as large as 13GB (for an hour of video footage). I've been using ext3 for years and it's never let me down or given me any problems.

--
Slashdot: Failed Car Analogies. Amateur Lawyering. Anecdote Battles.

I/O errors? You have bigger problems than filesyst by aussersterne · 2006-02-11 18:57 · Score: 1

Since nearly all modern drives remap bad sectors automagically unless they're in truly pitiful shape, I'd check my cabling, connectors, termination (depending on the storage hardware platform you're using). I/O errors are not the result of inappropriateness of a filesystem for a given task, they're the result of lost data long before your filesystem ever gets a chance to f*ck it up.

--
STOP . AMERICA . NOW

Re:Keep it simple. ext2 or fat32. by mnemonic_ · 2006-02-11 19:06 · Score: 1

You're saying that data recovery of journaling filesystems is worse than that of non-journaling ones? What is it that you know and that hundreds of ReiserFS, ext3, NTFS and XFS programmers don't?

Re:Keep it simple. ext2 or fat32. by dougmc · 2006-02-11 19:10 · Score: 3, Informative

There is no call for a complex filesystem just because you want to store large files. ext2 (and to some extent fat32) will do just fine

fat32 cannot handle files over 4 GB in size at all. That alone probably renders it totally unsuitable for this person's needs.

Beyond that, I'd say pretty much anything will work fine -- most of the optimizations found in filesystems are needed for lots of small files, not a few large files. For large files, the speeds they can be accessed by various filesystems are not likely to vary more than a few percent unless you let the files get fragmented (which probably isn't a big concern here.)

And you are right -- if something does go wrong, ext2 or ext3 will probably give you the most options for recovering it. NTFS probably has even more recovery options (and FAT even more, as mentioned), but I'm guessing the OS will be *nix. But really, if your goal is reliability, you don't want some esoteric filesystem that can recover from disk errors (because ultimately, none can, though I guess one could be designed to keep ECC codes on the same disk transparantly -- but I'm aware of no such filesystem existing) -- you want multiple copies of your data. Keeping 5-10% (or more) par2 files for your archive can help a lot in recovering it if your media goes partially bad, and having md5sums or CRC32s of all archived files can help determine if you did recover something accurately, but really there's little subsitute for multiple copies of important data in multiple geographical locations. (And no -- RAID is not a subsitute for backups, no matter how many mirrored drives you have. Not that I saw anybody suggest this yet, but it seems to always come up in response to questions like this, so consider this to be a premptive mention of that.)

My problem by Kaenneth · 2006-02-11 19:12 · Score: 1

I can't seem to make a growable RAID 5 configuration; at least not growable in a useful way.

My plan was to migrate the small striped set to a larger set that included the old drives by making a raid set of the new drives, copy the data, then add drives (I would call that 'Horizontal'); or move all the data to the tops of the drives, and make a raid set of the lower segments of all the drives, and expand the segments. ('Vertical')

I'm up to using EVMS; but no useful Expand options appear to be available for RADI4/5 MD region manager, all I can do is concatenate space to the array, not expand the array itself. Except I once got it to add and resync a 'drive' to the array, making the cpacity of the array larger... except that only happened when I tried making the raid area only the first 100 gigs of each drive, and grow the segment size uniformly... instead of growing the segments, it added a link region of the empty space on the drives as if it were a single seperate drive...

Re:My problem by Anonymous Coward · 2006-02-11 20:09 · Score: 0

.... You can't just copy the data over. The controller or software should have utilities to do it for you, there is no other way.

All of the disk sizes need to be the same size, or they default to the smallest of the drives. AKA, don't put a 9gb in there with a bunch of 73's, else the 73's count as 9's.
Re:My problem by Kaenneth · 2006-02-11 21:12 · Score: 1

The thing is, I got it to happen once, with the wrong configuration; so I know it's possible... I just can seem to do it when I want to.
Re:My problem by Andy+Dodd · 2006-02-12 04:30 · Score: 1

http://ask.slashdot.org/article.pl?sid=05/11/26/03 37226

Look for swillden's post a little bit less than halfway down. This is the approach i'm using now.

In short - LVM on top of multile "small" (50 GB per drive) RAID 5 partitions. LVM will let you automagically move all data off of a given "physical volume" if there is sufficient free space on a volume group. Note that in this case the "physical volume" is not actually physical, but is a RAID 5 mdX device instead. Once all data is moved off of one of the mdX volumes, you can nuke that array and rebuild it with an additional drive.

--
retrorocket.o not found, launch anyway?

Ext2 rw,sync by evilviper · 2006-02-11 19:14 · Score: 4, Interesting

What I'm looking for is a WORM-optimized FS that also has good journaling methods to prevent data loss due to some natural disaster while data is being shifted around.

Ext2fs mounted with the 'sync' option.

For large sequential writes, nothing could possibly be more reliable or any faster. Your hard drive's pure IO speed will be the bottleneck unless you are writing to multiple files simultaneously, in which case fancy filesystems come in handy.

If that doesn't suit your needs, you haven't described them well enough for anyone to understand.

I feel that Ext3 is not optimal for this;

I feel hungry.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant

Re:Ext2 rw,sync by yeremein · 2006-02-12 03:51 · Score: 1

Ext2fs mounted with the 'sync' option.

For large sequential writes, nothing could possibly be more reliable or any faster. Your hard drive's pure IO speed will be the bottleneck unless you are writing to multiple files simultaneously, in which case fancy filesystems come in handy.

But ext2 (and ext3) store a block list that grows in proportion to the size of the file itself. That's fine for small or highly fragmented files, but it's a waste of space and I/O bandwidth when you have big, unfragmented files.
Re:Ext2 rw,sync by sconeu · 2006-02-12 06:14 · Score: 1

I can attest to this. I was backing up an image of an 80G drive. It took forever... I suspect a tree-based file system would be much better.

--
General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
Re:Ext2 rw,sync by evilviper · 2006-02-12 12:15 · Score: 1

But ext2 (and ext3) store a block list that grows in proportion to the size of the file itself. That's fine for small or highly fragmented files, but it's a waste of space and I/O bandwidth when you have big, unfragmented files.

You can use a large block-size and greatly reduce the ammount of wasted space and overhead (which is rather small to begin with, actually). You've got to expect that kind of overhead from just about any filesystem, and something like journaling will only add more.

If you've got something else in mind, feel free to speak-up.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
Re:Ext2 rw,sync by yeremein · 2006-02-12 13:00 · Score: 1

You can use a large block-size and greatly reduce the ammount of wasted space and overhead (which is rather small to begin with, actually). You've got to expect that kind of overhead from just about any filesystem, and something like journaling will only add more.

If you've got something else in mind, feel free to speak-up.

Using large blocks reduces block list overhead at the cost of internal fragmentation. In other words, big blocks mean more wasted space in the last block of each file.

A better approach would be to patch ext3 to store a list of contiguous extents instead of a list of blocks for each file. With the current file system, the block list might look like this:

1, 2, 3, 4, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20

Whereas with an extent system (which both NTFS and HFS+ use, incidentally), all you have to store is this:

1-4, 10-20

I did some Googling and apparently some work has already been done to get extents into ext3, motivated by the desire to eliminate slow deletion (which I've noticed... deleting a 160GB hard disk image took minutes on ext3, but was instantaneous on NTFS).
Re:Ext2 rw,sync by evilviper · 2006-02-12 17:07 · Score: 1

In other words, big blocks mean more wasted space in the last block of each file.

Yes, I assume this would be a non-issue, given this situation.

I did some Googling and apparently some work has already been done to get extents into ext3

That's an very interesting paper. It does, however, seem to refute your claim that block allocation is "a waste of space and I/O bandwidth":
http://ext2.sourceforge.net/2005-ols/paper-html/no de25.html

It would seem that extents will only really increase performance when CPU time is rather limited.

deleting a 160GB hard disk image took minutes on ext3, but was instantaneous on NTFS).

I really wouldn't take that as evidence of much. NTFS could be benefiting from architectural differences, hiding the real time taken.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
Re:Ext2 rw,sync by Kadin2048 · 2006-02-13 07:54 · Score: 1

I did some Googling and apparently some work has already been done [sourceforge.net] to get extents into ext3, motivated by the desire to eliminate slow deletion (which I've noticed... deleting a 160GB hard disk image took minutes on ext3, but was instantaneous on NTFS).

Very interesting. I've noticed the exact same thing, although I was comparing an HFS+ Mac to an ext3 Linux box. They both had the same data on them (rsync'ed from the same server), consisting of several large files; deleting them on the Mac was basically instant, which is what I'm used to, while deleting them on the Linux machine caused a noticeable delay. The delay was not present when deleting large numbers of small files. I suspected something like this was the case, although for some reason I was laboring under the impression that ext3 was extents-based.

I noticed nobody had pointed out HFS+ until this. Although it's not widely used (except on Apple systems), it is available under the APSL (FSF approved, although GPL incompatible) and there is support for it on Linux and BSD, although I'm not sure if it's something I'd want to use on a production system yet; it exists now more as a way for people to access their iPods, I think. I've never seen any hard metrics on how it performs, but it does do journaling and it has a reasonably good reputation as a workstation FS since it was introduced.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Re:Ext2 rw,sync by runderwo · 2006-02-13 11:55 · Score: 1

Even better, you can recover deleted files on ext2 with automated tools. This is impossible with ext3 and one must resort to disk editing, which is impractical for the type of data the AP is storing.

--
LRC, the best-read libertarian site on the web
Re:Ext2 rw,sync by evilviper · 2006-02-13 13:13 · Score: 1

Even better, you can recover deleted files on ext2 with automated tools. This is impossible with ext3

Just about anything that works on Ext2, should also work on Ext3. They are 100% compatible.

On reiser, you can also pretty easily recover deleted files with, eg., reiserfsck --rebuild-tree and the like. If you do it with your actual partition, you risk completely hosing it, and destroying other data on there. So you either need a second disk of the same size or larger, or perhaps you could use qemu's copy-on-write feature.

--
Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
Re:Ext2 rw,sync by runderwo · 2006-02-13 17:51 · Score: 1

Just about anything that works on Ext2, should also work on Ext3. They are 100% compatible.
Yup, just about anything. Except undeleting. The ext3 FS driver zeroes metadata when a file is deleted, making automatic recovery impossible; the ext2 driver does not do this.

--
LRC, the best-read libertarian site on the web
Re:Ext2 rw,sync by Anonymous Coward · 2006-02-14 05:05 · Score: 0

Here are some articles on the subject: idi

Re:Keep it simple. ext2 or fat32. by Radak · 2006-02-11 19:22 · Score: 1

You're saying that data recovery of journaling filesystems is worse than that of non-journaling ones?

I did say on a badly damaged filesystem. Simple filesystems tend to write files out in larger contiguous segments, which makes them worlds easier to recover when something utterly trashes your filesystem.

Yes, for typical day-to-day power loss type filesystem damage, journaling is great, but if I'm having to try to recover data from a filesystem that's lost 50% of its bits, I want it to be ext2 or fat.

Just out of curiousity by amliebsch · 2006-02-11 19:36 · Score: 1

What's your complaint with NTFS, other than that it is closed-source? Are there any reliable comparisons of NTFS with traditional open-source alternatives?

--
If you don't know where you are going, you will wind up somewhere else.

Re:Just out of curiousity by Anonymous Coward · 2006-02-11 19:59 · Score: 0

I lose NTFS just as much as my ext2/3 partitions?
Re:Just out of curiousity by thegrassyknowl · 2006-02-11 21:39 · Score: 1

No complaint apart from the fact it's from Microsoft and they have a history of being unreliable... and there is no way to write NTFS unless you're using Windows.

I haven't seen any unbiassed comparisons of NTFS against some of the other file systems, but if you want to limit your liability I'd go with something that's open. MS have a habbit of breaking backwards compatibility, and Vista seems to want to do a lot of that. Who says the version of Windows in the future will support today's version of NTFS?

At least with an open system you could actually re-engineer support into the OS of the future if you ever needed it and it wasn't there.

--
I drink to make other people interesting!
Re:Just out of curiousity by dtfinch · 2006-02-11 22:43 · Score: 1

Just personal experience here. I have a Windows 2003 server where the drive was something like 83% fragmented after only about 3 months of use. I ran defrag on friday and was surprised to see just a tiny blip of red, with the entire rest of the drive being unused. The drive was only about 10% full, but most of it was data files which grow constantly. Most large data files all had thousands of fragments each. Windows could not have done a worse job at avoiding fragmentation. It's not slow yet, as the database pretty much fits in ram, but what it's doing is generally very bad. This was exactly the problem with FAT32, and I've seen nothing to suggest that NTFS is any better in terms of avoiding fragmentation, despite Microsoft's promises. Luckily NTFS can be defragged without taking it offline, making it acceptable so long as you don't need good performance 24/7.
Re:Just out of curiousity by cyber-vandal · 2006-02-12 04:18 · Score: 1

Yes it can but if anything on the file system changes the defrag starts again and I don't know about you but it drives me nuts.
Re:Just out of curiousity by StillAnonymous · 2006-02-12 04:33 · Score: 1

If you use a commercial defragger, this isn't a problem. The one they provide with the OS is absolutely terrible.

Check out Raxco PerfectDisk, or O&O Defrag. They're both pretty good.
Re:Just out of curiousity by wanion · 2006-02-12 08:54 · Score: 1

I always liked this comment by the VirtualDub author (quoting only part):

It's fairly well known that the NT file system (NTFS) is very bad at avoiding fragmentation, partly due to its allocation strategy of intentionally placing tiny gaps between files -- which is good if those files expand, but bad if they don't. Today, I see this in a fragmentation analysis report of my hard drive:
Fragments 111
File Size 444KB
Name WINDOWS\$NtServicePackUninstall$
The cluster size of the hard drive partition is 4K. This means that NTFS has successfully managed to create a huge directory in which not a single pair of clusters are sequential. I used to think that the Amiga standard file system was bad, but this takes the cake.
Re:Just out of curiousity by dtfinch · 2006-02-12 10:29 · Score: 1

That was a terrible problem, but I haven't seen it outside of the Win9x line.
Re:Just out of curiousity by shibashaba · 2006-02-14 11:51 · Score: 1

There is a driver available for linux that'll let you write to NTFS. I can't remember who makes it but it's not free and different from the read only one usually included in your post. I do agree with everything else you said, NTFS is crap.

--
---------- Open Source is capitalism applied to IP.
Re:Just out of curiousity by thegrassyknowl · 2006-02-14 15:13 · Score: 1

The only writing driver for NTFS that is any good is the one Microsoft themselves wrote. All the others are well known for causing massive filesystem corruption. The Linux kernel one can't even create files - you can only modify existing files so long as they don't get larger.

Not so much of an issue if you're backing up using a Windows box and only need to read the data later. Big issue if you're using another OS to do all your backing up.

--
I drink to make other people interesting!
Re:Just out of curiousity by xpyr · 2006-02-16 11:47 · Score: 1

Maybe it has something to do with ntfs permissions and that you can't modify them easily from a linux environment. Plus the administrators uid in windows is always different with each installation compared to linux where root has a uid of 0 I believe. Feel free to correct me though.

--
My Gawd WTF...

HFS+, Journaled by amper · 2006-02-11 19:48 · Score: 1

At the risk of sounding silly, I will suggest HFS+, Journaled (but without ACL's). This, of course, requires that you use Mac OS X. However, some very good disk repair utilities exist for Mac OS X. As far as performance is concerned, it should be capable of easily handling uncompressed NTSC video without dropouts in a proper configuation, so I can't see that it would be a problem for storing DVD's.

Why bother? by Anonymous Coward · 2006-02-11 19:51 · Score: 0

I own hundreds of gigabytes of binary data... What I'm looking for is a WORM-optimized FS that also has good journaling methods to prevent data loss due to some natural disaster while data is being shifted around.

You don't need all that razzmatazz. If you lose your pr0n, just download it from bittorrent again.

Retry XFS by Anonymous Coward · 2006-02-11 20:18 · Score: 0

I would retry XFS, I've been using it for decades to serve video files, ISOs and other large files with no problems...if you end up using XFS, make sure you have a UPS - though you say that your application is write-once-read-many, so XFS caching writes isn't a problem - and XFS has aggressive caching that will increase performance and extend drive life by reducing load on the disk drives.

Consider making your files{,ystem} read-only, and only writing when you need to add something. That'll protect (somewhat) you against problems when you update your kernel (not that I've ever seen any of those). Don't feel obliged to constantly update your OS to the latest version...stay a few versions behind and try to find out if anyone else has any problems before you update.

Also, consider using RAID1 to backup your data - disks are cheap, RAID1 will automatically copy your filesystem to another device, and many controllers allow drives to be hot-unplugged these days so you can use the redundant devices as a backup. With s/w RAID1, you can even use a single cheap, big, slow drive to mirror another faster RAID array, then unplug it and store it off-site somewhere.

Yes, I'd choose to get XFS working. ...I'm no expert, but I feel compelled to give my opinion.

Re:Retry XFS by dtfinch · 2006-02-11 22:50 · Score: 1

I like filesystems that are generally ok to use without a UPS, but on top of that I use a UPS anyway, and disable write caching.
Re:Retry XFS by dwater · 2006-02-12 13:00 · Score: 3, Insightful

I never run a computer without a UPS. IMO, they should be built into the power supplies (I wonder if you can buy such things...). I guess I've had too many computers die on me due to power issues, but power can be very unreliable in my location. UPSes are so cheap it's not worth it to *not* use them.

--
Max.
Re:Retry XFS by dwater · 2006-02-12 13:14 · Score: 1

> I never run a computer without a UPS ... I've had too many computers die on me due to power issues

I suppose that could be considered a contradiction, however, there was a period of time over which I decided UPSes are essential.

I notice that some RAID cards even have battery packs on their RAM. I look at a UPS in the same light...

In any case, he's only doing one write (*WO*RM), so I think he'd do better making his filesystem read-only anyway, and only making it writable on the (rare) occasions he wants to write (mount -o remount,rw /video or something).

--
Max.

Re:I/O errors? You have bigger problems than files by cowbutt · 2006-02-11 20:21 · Score: 1

Since nearly all modern drives remap bad sectors automagically unless they're in truly pitiful shape

Not quite automagically - every time I've had a bad block develop, the drive has only remapped it on writing to it. Reading just returns various types of error (makes sense - if you're trying to recover the block in question, it might read successfully one time in a thousand, then you can write it back, and all is well again). I'm pretty sure that SCSI can return warnings that a block was hard to read, allowing the OS to rewrite the block. Evidence I've seen suggests *BSD makes use of this approach to try and pre-empt failed blocks.

Re:Keep it simple. ext2 or fat32. by perbu · 2006-02-11 20:32 · Score: 1

ext2/3 will use triple indirect adressing to reach blocks when the filesize reaches a few gigabytes. If performance is more important then stability I would go with a filesystem that supports extents - like reiserfs or jfs - both of which are resonably stable.

Re:Keep it simple. ext2 or fat32. by Theatetus · 2006-02-11 20:51 · Score: 2, Informative

You're saying that data recovery of journaling filesystems is worse than that of non-journaling ones? What is it that you know and that hundreds of ReiserFS, ext3, NTFS and XFS programmers don't?

Nothing, the programmers will tell you that themselves. Journalled filesystems aren't for protecting your files. They're for protecting your filesystems.

The point of a journal is that you can roll back to a consistent state of the filesystem easily in case of error -- not that you can roll back to a consistent state for a given file (or indeed any file on the filesystem). In point of fact, it's usually more difficult to recover actual data from a journalled filesystem than a traditional one, because the writing process is much more complex. What's more, if an automatic fsck is needed, it's actually a little more likely to lose some data on a journalled filesystem because a non-journal filesystem recovers based on the found files while a journalled one recovers based on a separately recorded journal (you do put your journals on a separate block device from the filesystem, right? Otherwise you're mostly wasting your effort.)

The main point of a journalled filesystem is that when both redundant circuits in your datacenter go at once (which happens) the boxes will be able to at least boot and get through the automatic fscks without a tech needing to drive out there and run the fscks himself.

--
All's true that is mistrusted

Re:HFS+, Journaled by John+Nowak · 2006-02-11 21:07 · Score: 1

Why does it require OS X? Linux deals well with HFS+ in my personal experience.

You're right about the tools though. DiskWarrior is *godlike*. I cannot believe the stuff it has pulled off... stuff that made fsck cry.

WORM by passthecrackpipe · 2006-02-11 21:34 · Score: 1

If you need WORM, it will usually be for compliance purposes. And if you need it for compliance purposes, the choice of filesystem simply isn't yours. Netapp, EMC, IBM and others all provide out-of-the-box WORM solutions for not to much money that are certified WORM devices - i.e. you can turn around to your local regulator with a piece of paper in your hand that proves your WORM is really WORM. From my experience with Netapp WORM devices, you are going to be stuck with XFS - incidentally, I spent a good portion of my time yesterday trying to recover from a broken XFS filesystem on a Netapp device, 4 day after our support contract ran out (isn't it always like that?) - trying to recover an XFS filesystem is like taking sandpaper to the platters......

--
People who think they know everything are a great annoyance to those of us who do.

ISO9660 by Handyman · 2006-02-11 22:41 · Score: 1

Well, I guess the best filesystem for this kind of stuff is ISO9660. Very optimized for WORM access, no file fragmentation, and anything new you write will not destroy any existing data. Guaranteed.

WORM optimized FS? by scotty1024 · 2006-02-11 22:52 · Score: 1

The only WORM optimized FS that I know of is UDF.

Of course you mention several FS' that have no support for WORM so I'm doubtful if you are serious about using WORM media?

UDF is an open international standard that supports writing data to ROM/WORM/WMRM and sequential access only media (tape). It completely optimized to support journaling/versioning through the properties of WORM.

DVD's use a ROM version of UDF.

UDF can also exploit the UDF on DVD's to create a hybrid FS of the DVD's on a UDF encoded WORM media that points to the various files on the UDF DVD disc filesystems and then allows versioning of those files on the WORM storage ie you can make updates to the ROM files via the WORM media.

If you were serious about using WORM you should check it out.

Re:WORM optimized FS? by jZnat · 2006-02-12 05:10 · Score: 1

Popular consensus here regarding the best WORM system to use is UDF/ISO9660, but then my question would be: how do you manage ISO-9660 data on a hard drive? I know there's mkisofs and whatnot, but any documentation on using ISO-9660 on something other than permanent storage mediums would be appreciated.

--
'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'
Re:WORM optimized FS? by scotty1024 · 2006-02-12 10:04 · Score: 2, Interesting

Which is why I asked if the poster was serious about using WORM or not.

ISO-9660 is not the same as UDF. If you have UDF and ISO-9660 on the same volume it is because some one mastered a hybrid filesystem structure onto the disc. Which was the norm on first generation DVD's.

ISO-9660 contains no optimizations for being a WORM filesystem, there are no linking records in ISO-9660 to allow re-writing of data into new blank spots on the non-rewriteable storage media, UDF supports these linking blocks.

When I wrote a UDF filesystem for Linux I tested it by building the structures into blocks on hard disk storage devices. It can be done but UDF isn't designed as a high performance FS, but rather as a highly interchangeable FS.

For all my WORM disks... by wfberg · 2006-02-11 23:22 · Score: 2, Informative

For all my WORM disks, I use either ISO 9660 or ISO 9660/UDF bridge format.
Yeah, I simply burn CD-Rs or DVDs. DVDs have the nice property of being easily stored off-site. And files are in nice large contiguous block so even if the filesystem dies you can still recover a lot. Unlike XJFReiFS 2.3.1.5, DVDs will be readable in 50 years time.
And if you need to burn really large files, just use, well, zip. And perhaps some par2 files.

Though, seriously, they're coming up with a UDF variant for hard drives too.

--
SCO employee? Check out the bounty

Re:For all my WORM disks... by Slashcrap · 2006-02-12 21:20 · Score: 1

Unlike XJFReiFS 2.3.1.5, DVDs will be readable in 50 years time.

So, do you actually have any 50 year old DVDs that you can still read? Or are you just hilariously naive and optimistic?
Re:For all my WORM disks... by wfberg · 2006-02-13 10:16 · Score: 1

Unlike XJFReiFS 2.3.1.5, DVDs will be readable in 50 years time.

So, do you actually have any 50 year old DVDs that you can still read? Or are you just hilariously naive and optimistic?

So, what do you guess the likelihood is that a filesystem supported by
* millions of home-cinemas
* millions of PCs and macs, including windows, linux, mac-ox, unix, etc.
* millions of playstation-2's
that's being used by a whole lot of people to archive stuff, not even counting the millions of discs sold with content already on them..
will be readable in 50 years, as compared to some obscure filesystems that only work on some systems and that you usually have to compile from source code?

Hmmm...

I'll take my chances and go with UDF+ISO9660. I mean.. It's ever so slightly less hilariously naive and optimistic than the other options by a factor of, oh, a million or somthing.

--
SCO employee? Check out the bounty
Re:For all my WORM disks... by grumbel · 2006-02-14 09:27 · Score: 1

### I'll take my chances and go with UDF+ISO9660.

I think the point was more that the DVDs itself will be unreadable, so it won't help much when the filesystem on them might still be supported. Even DVD-RAMs have an expected life-time of 30 years only, some DVD-R are rumored to self destruct after just 2-5 years.

Re:So just poo poo all the options then ask for on by Anonymous Coward · 2006-02-11 23:23 · Score: 0

XFS is very reliable and I trust it with my data.

Man, this can't be over-emphasized. I switched a few dozen TB of data from Ext3 to XFS and my once-in-every-couple-of-months corruption problems just went away magically.

People love to do laundry lists with filesystem features, but "peace of mind" is the one item that XFS brings that the others don't.

Data journaling by Anonymous Coward · 2006-02-12 00:50 · Score: 0

Quoth the poster:

I know that's the case for metadata journalling, but if you use full data journalling, I would have thought you'd be fine.

Seeing as the POSIX API isn't transactional, data journalling doesn't really guarantee a whole lot about the integrity of your data. It only really guarantees that your files won't contain "random" data after a crash (e.g. data from other files). It certainly cannot guarantee that data will still be readable by your application in any meaningful way -- in many (most?) cases a partially updated file is as useless as a completely garbled one.

Re:Data journaling by jd · 2006-02-12 16:04 · Score: 1

Accepted, though POSIX is getting archaic these days. Non-volatile RAM would seem to address the issue of lost data, though. Well, thinking about it further, you'd also need battery backup on the CPU and a sleep mode for it, or you'd lose the state of the CPU registers and any data in cache that had not (yet) been copied into RAM.

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)

sync mounted FS, no write cache by ByTor-2112 · 2006-02-12 01:19 · Score: 1

First of all, you shouldn't worry about losing data while "shifting it around". The source data shouldn't be unlinked until the operation has completed. A "sync" mounted filesystem on a drive without write caching should guarantee that the data has actually been written. Then the biggest threat to your data is going to be drive failure, which can be lessened by the use of RAID5.

As for which filesystem, I would humbly suggest the tried-and-true time-tested UFS. I don't think UDF is really what you want, because it sounds more like you are writing data and modifying it "infrequently" instead of just writing it once.

Suggestion: by warrax_666 · 2006-02-12 01:26 · Score: 1

(I'm assuming software RAID here, obviously.)

Use LVM on top of MD/RAID. When enough of your physical drives have the requisite extra free space you can just create a *new* MD volume from the free space and add that to your logical volume. Example: If you had a 100GB drive with 1 partition in the RAID and replace it with a 150GB drive, say, you'd just allocate 100GB to 1 partition and allocate 50GB to a second partition, readding the first partition into the old RAID volume and waiting for it to rebuild. Do this for all your new drives. When enough of your drives have unused 50GB partitions you just create a new MD/RAID volume out of those and add that MD volume to your LV(s). Simple, really.

--
HAND.

TAR files written to raw partitions by vrmlguy · 2006-02-12 01:26 · Score: 4, Funny

Don't laugh! Most (if not all) filesystems are optimized to handle the opposite of what you want. TAR files are designed for tape, so you won't be seeking all over the disk to get meta information, instead you'll get your data at the maximum speed supported by your hardware. TAR files are designed so that you can append files to them later, so you can use a *big* partition and just keep dumping stuff into it.

The only drawbacks are that you have to read the entire partitioin sequentially to find things, and you can't delete files. Both of these can be fixed with a bit of Perl. Write a program that maintains an index of offsets to the files, then you can use "dd" to skip to the correct offset and read from there. More dangerously, write a program that deletes files from the middle of an archive and shuffles everything backwards to fill in the gaps. You'll want to make sure that no one is trying to read the TAR partition while this is running.

--
Nothing for 6-digit uids?

Re:TAR files written to raw partitions by Anonymous Coward · 2006-02-12 03:29 · Score: 0

That is the craziest shit I've heard this week!

Lets take a nice, fast randomly accessible chunk of digital storage and treat it like a big spool of magnetic tape!

It sounds like you've got what they call "just enough knowledge to be dangerous".

Who cares? by countach · 2006-02-12 01:34 · Score: 1

I can't see why it would matter one whit what file system you use. The only problem you list is ReiserFS being too slow (which I doubt, but anyway...). Is speed the worry?? Well, just pick a bigger block size. Make a 64kb block size or something, and just go with whatever file system you would normally use.

Well by Anonymous Coward · 2006-02-12 02:06 · Score: 0

I thought this was going to be a decent question, then

I tried using SGI's XFS due to its promising details, but I was met with countless IO errors after trying to write large amounts of data to it.

Whoops, this is just a troll article. Sorry, nothing to see here.

Your usage scenarios makes no sense by heinousjay · 2006-02-12 02:08 · Score: 0, Flamebait

I'm willing to bet, based on the logical inconsistencies of your post, that you are infringing on copyright by downloading movies, music, and other entertainment from various internet sources.

I could be wrong, but I doubt it.

--
Slashdot - where whining about luck is the new way to make the world you want.

Re:Your usage scenarios makes no sense by Anonymous Coward · 2006-02-12 02:47 · Score: 0

No doubt. He probably wouldn't be too terribly concerned about losing the data if it was actually backups of CDs and DVDs he owned.
Re:Your usage scenarios makes no sense by swilver · 2006-02-12 04:46 · Score: 1

Of course not, watching movies doesn't take a lot of bandwidth, so why would he want to have such high speed?
Now if he asked for a solution that was safe, encrypted and fast enough for streaming media...

Re:So just poo poo all the options then ask for on by Anonymous Coward · 2006-02-12 03:54 · Score: 0

I know, I know, *BSD is dying and all, but it's still silly to leave out UFS :)

tune2fs won't work for that by Andy+Dodd · 2006-02-12 04:21 · Score: 2, Insightful

The number of inodes in a filesystem is an option that can only be specified upon filesystem creation. i.e. to mke2fs, not tune2fs.

--
retrorocket.o not found, launch anyway?

UDF is the correct answer by turgid · 2006-02-12 04:24 · Score: 2, Informative

What you're looking for is Universal Disk Format or UDF.

It is an open standard supported by all of the major OSes and manufacturers and is the filesystem of choise for Ultra Density Optical WORM and rewritable disks.

There a drivers for Linux, Windows and all of the major UNIXes. Here is the obligatory Wikipedia entry.

Hard disk filesystems like XFS, JFS, Reiser, ZFS etc. are all wonderful at what they do but they are unsuitable for WORM disks.

--
Stick Men

Re:HFS+, Journaled by MrHanky · 2006-02-12 04:33 · Score: 1

Eh, if you read the various forums for Mac these days, the fix for most problems is always suggested to be: fsck, then fix permissions. I've yet to see another OS that need a 'fix permissions' tool at all. Oh, and HFS+ has lost me important system files. This with the most recent version of Panther. It doesn't impress me at all.

Re:I/O errors? You have bigger problems than files by Tacvek · 2006-02-12 04:41 · Score: 1

A drive will not remap on a failed read. It will remap on a read that was successfull, but was almost unsucessful, it will also remap on a write that was unsuccessful.

--
Stylish sheet to fix many problems in Slashdot's D3: https://gist.github.com/801524

Not funny - F/OSS ports do exist by Anonymous Coward · 2006-02-12 04:44 · Score: 0

Why'd that get modded funny?

The Lucene/Nutch guys have a port that's usable for some apps.

Since F/OSS ports exist, it seems it was very insightful, and I if there was something funny about it, I missed it.

Delete performance - large files by Steve+Hamlin · 2006-02-12 04:54 · Score: 2, Informative

Others have said good things in general (XFS,JFS,ext3).

I looked into filesystem comparisons in setting up a MythTV box.

My issues were:
(1) efficient use of hard drive space, and
(2) performance.

Efficient use = filesystem settings have a big effect on amount of usable space.

For ext2/3:

-m 0 = setting 'reserved space for root' to 0%. Default is 5%, which can be 10-20 GB these days, all unusable to non-root users

-T ____ = can tell ext2/3 to optimize inodes and byte-per-inode for different size average files. Largefile versus news spools (tons of small files). Because of the way that a file can be spread out and mapped across the filesystem, this has an effect on 'wasted' space, and maybe performance (# of inode entries per file to lookup).

-b, -i - can set total # of inodes and bytes-per-inode directly. Advanced control over filesystem creation

I never got around to looking into this detail for XFS/JFS - they seem have fewer such options.

Performance I'll leave it to others to talk about filesystem performance with largefiles in general.

MythTV takes a lot of writing, and as it turns out, deleting, of large temporary files for the TV features (records, pause, FF/RR). After some reading online, I've found MythTV performance is drastically impacted by filesystem choice due to all of the deleting.

http://www.mythtv.org/docs/mythtv-HOWTO-24.html#ss 24.2

http://www.gossamer-threads.com/lists/mythtv/users /52672 ---SNIP---

> My last reply to myself. Based on a Googled reference, I was able to
> break my XFS 4G file size barrier by formatting the partition 'mkfs.xfs
> -dagsize=4g'. So, here are the complete results:
>
> Time to delete a 10G file, fastest to slowest:
>
> JFS: 0.9s, 0.9s
> XFS: 1.3s
> EXT3: 1.4s, 2.3s
> EXT2: 1.6s
> REISERFS: 6.2s
> EXT3 -T largefile4: 5.9s, 10.2s
>
> After running the XFS test, there didn't seem to be any point in
> reformatting the partition again, so I left it on XFS, but I think I
> would be happy with JFS, XFS, or EXT3 w/o '-T largefile4'.
>>>>
wepprop at sbcglobal
Feb 8, 2004, 2:33 AM
Post #21 of 22 (4121 views)
Re: Changing filesystems? [In reply to]
Robert Kulagowski wrote:

> Interesting. If others care to weigh in, I can either re-write the
> "Advanced Partitioning" section in the HOWTO, or whack it completely.
>
> William, can you give some background on the hardware used for your
> tests? I'd be curious if this data holds up across various drive types,
> LVM, etc. (Without trying to exhaustively test all the possibilities,
> that is)

It appears, based on my personal experience alone, that file deletes are
the only system operations that can stress the hard drive enough to
produce dropped frames. Unfortunately, as others have pointed out,
recordings and deletions go together in Myth. So, unusual as it may be,
it does make at least some sense to take file deletion performance into
account when deciding which filesystem to use for a video partition,
especially for people with multiple tuners.

The really ironic result from my personal perspective is that it would
appear that using the '-T largefile4' setting for ext3, which I was so
pleased with because it give me an extra 2G of storage, may well have
been responsible for all those recordings I had ruined by frame drops.

Assuming it works out, though, I could really get to like this XFS
filesystem because it appears to give me slightly more storage space
than ext3 w/ '-T largefile4' did and it has pretty fast deletes as well.
---SNIP---

Re:So just poo poo all the options then ask for on by Anonymous Coward · 2006-02-12 04:58 · Score: 0

Just another vote of confidence for XFS.

In a scientific application, I stream data from ADC boards to an XFS partition at a rate of 40-60 MB/s (depending on hard drive and write location). The data goes into a single file which eventually fills the partition (typically 250-300 GB) in a few hours. The data is then scanned through a few dozen times during analysis. No XFS problems yet.

I also experimented a bit with ReiserFS and JFS before deciding to stick with XFS.

PFS by JamesTKirk · 2006-02-12 05:04 · Score: 2, Funny

You're obviously looking for a filesystem optimized for porn. I'm impressed that you've managed to accumulate hundreds of gigs of the stuff. Perhaps there is a Porn File System out there somewhere?

Re:PFS by JDWTopGuy · 2006-02-12 11:41 · Score: 1

I can't believe the first porn joke (visible at 1+ anyway) was almost at the end of the page.

--
Ron Paul 2012

Re: ext3 works fine, did you try it? by jZnat · 2006-02-12 05:05 · Score: 2, Informative

Honestly, since I submitted this about a couple months ago, I just formatted the disks to ext3 and it's worked quite well since then, but any better ideas are always welcome.

--
'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'

LOL by rainer_d · 2006-02-12 05:54 · Score: 2, Funny

> usually backed up from other mediums such as CDs and DVDs

This is a very euphemistic way of saying:
"I download moviez, mp3 and porn via P2P all day and even though I usually don't view any movie twice, I still don't want to throw away anything, because I just can't delete anything".

How that could get an "ask slashdot"-posting is left as an exercise to the reader.

--
Windows 2000 - from the guys who brought us edlin

Re:LOL by Anonymous Coward · 2006-02-13 23:13 · Score: 0

Kudos :-)

Not if you like data integrity. by Anonymous Coward · 2006-02-12 05:58 · Score: 0

I tried the ZFS beta on my 2TB raid array and it sucked ass big time. Combined with the myriad problems I had getting OpenSolaris to work properly without crashing I've since switched back to FreeBSD.

Re:Not if you like data integrity. by Anonymous Coward · 2006-02-12 16:51 · Score: 0

Probably hardware driver issues specific to your system.

I'll probaby get flamed for this by austad · 2006-02-12 06:24 · Score: 2, Interesting

But, after trying just about every FS under the sun for my backups, on Linux, FreeBSD, and OS X, I finally settled on Mac HFS+ with journaling and case-sensitivity enabled. I have a 900GB RAID with it on it, and I'm storing some files that are 7GB+. I haven't had any issues with it at all.

Yep, it means you will probably need a mac, but Linux does have HFS support (I don't know how good it is). But everything is working out great, and supposedly has some sort of auto-defrag, but I'm too lazy to actually verify this.

--
Need Free Juniper/NetScreen Support? JuniperForum

Re:HFS+, Journaled by CableModemSniper · 2006-02-12 06:32 · Score: 1

Fix permissions has little to do with the file-system. Yes, its dumb, but not because of the file-system. You need to fix-permissions when apps (Carbon-based usually) that were written without awareness of FS permissions (since they were sitting on an OS whose FS didn't have fs permissions) update important files that need to be read by a host of programs. When these ignorant apps write these files with their default umasks, it wreaks all sorts of havoc. You don't repair fs permissions because the filesystem loses permissions from time to time (since it doesn't), you do it because some programs are not forwards-compatible and inadvertantly change the permissions.

--
Why not fork?

AIX does not run on Mainframe or on Linux by raftpeople · 2006-02-12 06:38 · Score: 1

AIX ran on mainframe from 1990 to 1993.

Linux IS an OS and thus AIX does not run ON Linux.

These inaccuracies bring your post into question.

Which distro for XFS? by rsax · 2006-02-12 08:42 · Score: 1

I see a lot of people advocating XFS. Which distro supports XFS the best?

Re:Which distro for XFS? by dwater · 2006-02-12 12:55 · Score: 1

> Which distro supports XFS the best?

I guess it would have to be SGI's Linux or IRIX 6.5.whateveritsuptonow. They're bound to have it all working hunkey-dory.

However, I use it on fedora (I tried it on Ubuntu too, but I gave up on ubuntu because I couldn't get a 32-bit v3.3 toolchain to work (libraries were the problem)).

--
Max.

FMWORM by davecb · 2006-02-12 10:22 · Score: 3, Informative

Also spelled FM-WORM, a filesystem which looks like anormal NFS server but knows intimately whaqt needsto be done to deal with WORM disks.

It's a commercial product from Siemens, which I used years ago for Sietec's large-scale imaging product.

There is probably a Linux port: We ran it on almost everything in existance (;-))

--dave

--
davecb@spamcop.net

How about a JUKEBOX by Anonymous Coward · 2006-02-12 12:04 · Score: 0

Journaled Universal Keeper for Entries of Binary Object eXtensiblly, or JUKEBOX for short.

Re:Keep it simple. ext2 or fat32. by Anonymous Coward · 2006-02-12 12:41 · Score: 0

No, NTFS with it's mystical magical spare set of data, somehow manages to stay dead. It's garbage, the only upside is it dies a lot less than ext2/3.

I only wish I were trolling... as so much of my lost data would attest if it weren't lost.

Venti from Plan 9 is what you want by CondeZer0 · 2006-02-12 12:51 · Score: 1

For WORM oriented storage Venti is really good, for the details see the paper "Venti: a new approach to archival storage"

It was originally developed for Plan 9 as the replacement of the dedicated fs kernel that used WORM jukeboxes, but now you can use it under Linux, BSD and other Unix systems because it's part of the Plan 9 from User Space" port of Plan 9 tool to Unix systems.

--
"When in doubt, use brute force." Ken Thompson

Re:So just poo poo all the options then ask for on by joel48 · 2006-02-12 13:06 · Score: 1

I have also been most impressed by XFS *except* for one very annoying issue that makes me not trust it. When editing a file and the machine goes down, the contents of that file are zero-padded, and not at the place of editing or such, but the file gets filled with zeros. Any known fixes or resolution for this? Cause?

Nothing wrong with ext3... by Wdomburg · 2006-02-12 14:26 · Score: 1

What I'm looking for is a WORM-optimized FS that also has good journaling methods to prevent data loss due to some natural disaster while data is being shifted around.

Erm, Write Once Read Many would imply data isn't shifted, period.

I feel that Ext3 is not optimal for this

You don't provide a reason for dismissing it out of hand. It's a nice solid mature filesystem (thanks to being ext2 + additional features) that's widely supported. And if you want to use it for large files, you just need to tune it appropriately, either by manually increasing block size and blocks per inode, or using the -T flag to use a preset like one of the other posters suggested.

I'm curious about the issues reported by some of the other posts. I've been dealing with terabytes of data across hundreds of filesystems, including data with high turnover (e.g. mailspools, log servers, etc) with no data loss that wasn't attributable to hardware, like RAID controllers without battery backup that were left in write-back mode. I don't care what filesystem you're running, it won't be able to recover data that was in volitile cache during a power event.

ZFS by Jozer99 · 2006-02-12 15:46 · Score: 1

ZFS has 128 bit file tables. To writw a file that would take up that much space to an hd would require more energy than has reached earth from the sun in the last few eons.

BFS by yuktar · 2006-02-12 17:12 · Score: 1

Why not take a look at BFS (or BeFS, depending who you ask)? It supports several petabytes of data, and it is specifically designed to handle large media files. Journaling is built in, as well as the handy metadata database (like Spotlight), if your OS can support that feature.

Of course, I can't guarantee there's a filesystem driver available for whatever OS you may be running, though the internet shows many hits for "bfs linux". You may be able to find something about that from haiku-os.org, or elsewhere on the internet.

Re:BFS by Slashcrap · 2006-02-12 21:41 · Score: 1

Why not take a look at BFS (or BeFS, depending who you ask)?

Because only BeOS can write to it.

Because it hasn't been developed for many years.

There are many, many other reasons I could list why it's not feasible but I think I'll just summarise as follows :

You are an idiot. And probably a BeOS zealot. But I repeat myself.

XFS Performance. by sudog · 2006-02-12 18:52 · Score: 1

Most of the performance numbers I've seen out there unfortunately don't take into account parallel accesses to large files by highly concurrent processes--the exact same kind of access patterns that enterprise-scale storage requirements demand. Instead, they concentrate on single-thread access patterns. This seems to be unrealistic.

Given enterprise-class accesses (extremely large files (hundreds of GB,) hundreds or thousands of accesses to those files per second, and tens or hundreds of processes accessing them simultaneously) XFS appears to excel. In terms of delete performance, XFS appears to do well there also.

I personally know of dozens of *large* companies and financial institutions who trust nothing *but* XFS as their FS of choice.

Your I/O errors indicate other issues, and are more likely not related to the design of the FS.

Make mine XFS. I just wish XFS were extended out to the *BSDs.

Attention, citizen by Phreakiture · 2006-02-13 01:52 · Score: 1

Please remain where you are. The RIAA will be arriving shortly to collect a copyright fee (99 Cents) and a convenience fee ($2000). There may also be unspecified interest and late charges.

--
www.wavefront-av.com

XFS and USB drives by Anonymous Coward · 2006-02-13 03:12 · Score: 0

I found one of the Fedora Core kernels had problems with using external USB deives, MD (RAID 1 and 5) and XFS. One of the drives would go offline and corrupt the XFS file system.

Database Snapshots by Zan+Lynx · 2006-02-13 03:57 · Score: 1

If your database can't handle recovery from a snapshot, then it's junk. Get a new DB.

The POINT of all those transaction log files, DB journals, and all the other things that make ACID guarantees real, is to recover from failure. A filesystem snapshot looks exactly like a system that just had a power loss. Your database center may have redundant power backups and all that shiznit, but I bet the datacenter still has a Big Red Button, so your DB had better be able to do recovery ANYWAY.

Some DB backup tools are designed to work with filesystem snapshots. They create a fast recovery point and stop updating the main DB files until after the snapshot. Otherwise the DB has to do basically the same thing as a snapshot, holding open an hours long transaction while doing the backup. When the filesystem or block layer can do a better, more efficient job than the DB, it makes more sense to let it take the load.

Just trying to make you see that snapshots exist for good reason, they do the job very well, and you should look into using the feature; some day you might need it.

Re:Database Snapshots by Nutria · 2006-02-13 08:43 · Score: 1

The POINT of all those transaction log files, DB journals, and all the other things that make ACID guarantees real, is to recover from failure.

Sure, and they work very well.

You'd have to split every file system that every database file and recovery file and roll-forward journal are on, all at the same instant.

On our systems, that's 60 filesystems. And if one is off by a second, you get possible inconsistencies.

Better to have a highly parallel RDMBS backup utilitiy. Which is what we have.

--
"I don't know, therefore Aliens" Wafflebox1
Re:Database Snapshots by Anonymous Coward · 2006-02-13 09:38 · Score: 0

Wow. You don't know anything.
Re:Database Snapshots by Nutria · 2006-02-13 13:14 · Score: 1

Wow. You don't know anything.

I know enough to manage TB-sized databases.

--
"I don't know, therefore Aliens" Wafflebox1
Re:Database Snapshots by Anonymous Coward · 2006-02-15 08:42 · Score: 0

clearly not ;\

Off topic... by toadlife · 2006-02-13 14:17 · Score: 2, Interesting

[completely offtopic]
I always caught shit from other mp3'ers back in the late 90's because of my 'huge' 256kb songs. People that would download from me would frequently complain that my files were too big and that there was no use encoding them at bitrates that high because "128kbps was already CD Quality".

It was also really easy to start flamewars by bringing up the topic. You could just go into an mp3 IRC channel, make an offhand comment like "128kbps mp3 files sound like crap; 192-156 is really needed to approach true CD Quality", and people would immediately start arguing with you - probably in a subconscious effort to justify the fact that they had spent the last three months encoding their entire CD collection at 128kbps.

--
I don't always use unix-like operating systems; but when I do, I prefer FreeBSD.

Something's wrong with your system by Bombcar · 2006-02-13 15:40 · Score: 1

XFS shouldn't barf like that. I've run terabytes of data though it without a single hiccup. Currently I've about a terabyte of data on my home system. Even perfect software can't handle intermittent hardware failures, just warn you about them. I'd get your system checked out.

--
Fellowship 9/11

re: A Good Filesystem for Storing Large Binaries by rar42 · 2006-02-13 19:31 · Score: 1

Well my preferred file system is this...

I keep my Cd's in a large loose leaf folder. I keep my vinyl on top of the cupboard. I buy Cassette Tapes at the cheapest price I can. My partner records tapes of what ever we want for travelling. My books are semi-randomly arranged on shelves.

The web is my storage for text files and I use spreadsheets or programming languages to solve problems.

--
rgds,
Richard Rothwell
"All that is required for evil to triumph is that the good keep silent"

How about Tar? by grumbel · 2006-02-14 09:35 · Score: 1

If its really a WORM-style of doing things I would skip regular filesystem completly and go with the most simple thing available, which would be good old Tar files. They are just file header and raw data, so their is not much that can go wrong in terms of filesystem integrity and even if it does they are reasonably easy to recover. They don't come with any build in ways to validate them but that might probally be add-able on top of them. There should be somewhere a tarfs floating around that would allow them to be mounted as you can a normal filesystem.

Sun StorEdge Performance and Utilisation Suite by PhunkySchtuff · 2006-02-14 09:41 · Score: 1

Using Sun's QFS - which is a SAN optimised file system, it's great for storing large and small files - for instance disc images and readmes. You can have a variable cluster size, where the first n blocks of a file are a small number of blocks, like say 1k, and then the rest of the file is stored in clusters of up to 64M.

Oh, you wanted something free?

Then give ZFS a go - it's free and is available in Solaris and OpenSolaris - which you can run on both SPARC and x86.

--
Specialist Mac support for creative pros, Melbourne

Slashdot Mirror

A Good Filesystem for Storing Large Binaries?

214 comments