Google Switching To EXT4 Filesystem
An anonymous reader writes "Google is in the process of upgrading their existing EXT2 filesystem to the new and improved EXT4 filesystem. Google has benchmarked three different filesystems — XFS, EXT4 and JFS. In their benchmarking, EXT4 and XFS performed equally well. However, in view of the easier upgrade path from EXT2 to EXT4, Google has decided to go ahead with EXT4."
I guess now is as good as any to go through my Gmail and Google Docs and make local backups. I'm sure my info is safe, but I have been through these types of 'upgrades' at work before and every once in a while....well, let's just say backups are never a bad idea.
I judt got a nre Kinesis keybiartf so please excusr ant egregiou typos.
Looks like Digitizor already melted.
Eats, shoots and leaves. Read it.
News for nerds. Stuff that matters.
Not that I RTFA or anything, but I find it interesting that XFS and EXT4 both appear to be equally impressive with benchmarks, and it's implied they are both better than JFS. You must not be a nerd.
I went to eat some animal crackers and the box said, "Do not eat if seal is broken." I opened the box and sure enough..
I managed to ease a pageview out of it. That said, the /. summary says all they say, and you're all better served by the source they point to, which is what SHOULD have been in the article summary instead of the Digitzor site.
See http://lists.openwall.net/linux-ext4/2010/01/04/8
SIG: HUP
They have Ted T'so of Linux filesystem fame working for them now.
"Oppression and harassment is a small price to pay to live in the land of the free." -- Montgomery Burns.
I guess they didn't consider btrfs ready enough for benchmarking yet.
It's interesting that ReiserFS wasn't even an option here. I myself even ended up using Ext4 when I set up a new box not too long ago. It's a real shame that just because the creator of the filesystem committed a crime, people are drawn to treat the technology itself are somehow dishonored.
The main advantage of EXT3 over EXT2 is that, with journaling, if you ever need to fsck the data, it goes a LOT quicker. It's interesting to note that Google never felt it needed that functionality.
Additionally, I was under the impression that Google used massive numbers of commodity consumer-grade harddrives, as opposed to high-grade stuff which I presume is less likely to err. Couple this fact with the massive amount of data Google is working with and there has got to be a lot of filesystem errors, no?
Can anyone else with experience with big database stuff hint as to why Google would not need to fsck their data (often enough for EXT3 to be worthwhile)? Is it cheaper just to overwrite the data from some backup elsewhere at this scale? How do they know the backup is clean without fscking that?
"A witty saying proves nothing." - Voltaire
Did they fix that nasty "if you have files > 512MB kiss them goodbye" bug ?
I want to delete my account but Slashdot doesn't allow it.
From TFA:
In their benchmarking, EXT4 and XFS performed, as impressively as each other.
WTF kind of retarded sentence is that?! Did Rob Smith help you write that article?!
In their benchmarking of EXT4 and XFS, EACH performed as impressively as THE OTHER.
We are still using ext2 on servers. Now I have an argument; if Google is still using ext2 maybe we aren't so foolish. We might update some day but it is not yet a priority. With UPS and proper fail over and backup procedure in place, I can't remember when a jounaling file system would have helped us in any way. They seem great for desktops/laptops although.
Everything I write is lies, read between the lines.
I too have abandoned using ReiserFS but it's not about the horrible crime Hans committed. It's about the fact I don't think the company that he owned (who developed ReiserFS) has a great future, so I foresee maintenance problems with that filesystem. Sure, somebody else can continue their work but I'm not going to hold my breath.
I went to eat some animal crackers and the box said, "Do not eat if seal is broken." I opened the box and sure enough..
I've used XFS on a RAID1 setup with SATA drives, and found the performance of the delete operation extremely dependent on how the partition was formatted.
I saw times of up to 5 minutes to delete a Linux kernel source tree on a partition that was formatted XFS with the defaults. Have to use something like sunit=64, swidth=64, and even then it takes 5 seconds to rm -rf /usr/src/linux. I've heard that SAS drives wouldn't exhibit this slowness. Under Reiserfs on the same system, the delete took 1 second. Anyway, XFS is notorious for slow delete operations.
Intellectual Property is a monopolistic, selfish, and defective concept. It is "tyranny over the mind of man"
I thought google had their own file system named the google files system.
http://labs.google.com/papers/gfs.html
Might this prompt someone at Google to make an installable file system driver for Windows for EXT4? Right now, there is none, because of differing inode sizes and some extra features over EXT2 that EXT4 demands I think.
i di'dnt read the funky article because it's been slashdoted, but i'd like to see properly the benchmarks
Gee, I hope they're not using Ubuntu 9.10 by any chance: http://www.ubuntu.com/getubuntu/releasenotes/910
The damn bug is STILL not fixed apparently. Some people get the corruption, and some don't. Scares me enough to not even try using ext4 just yet, and I'm still surprised Canonical was stupid enough to have ext4 as the default filesystem in Karmic.
Then again, perhaps Google knows what they're doing.
When does black become white?
#CCCCCC or #888888
Is there overlap with Flamebait?
When does an otherwise 'troll' moderation-worthy comment lose out on status that could validate 19 responses, with 50% scoring +2?
Sometimes a troll is a troll, but sometimes its just a shadow.
"In their benchmarking, EXT4 and XFS performed, as impressively as each other."
Welcome to 2001, subby. Glad you could make it this decade.
I completely understand them not jumping to XFS, though. I'd never want to convert exabytes of data from one FS to another.
what about all the people who don't even bother to log in to post as AC?
Is this why Google was down for about 30 minutes today? Did anyone else even experience this or was it a local issue?
Kill them with fire?
There is a war going on for your mind.
I just hope Ted T'so hasn't been cooking the ext4 benchmarks again by making data notoriously less safe with a lot of retarded default settings. With data integrity restored ext4 should perform on a par with ext3, but should do far better in filesystem in hundreds of gigabytes or many terabytes. XFS has reigned there for many years so I take the article with a pinch of salt.
Or, you could stop being lazy and go tweak your preferences, thereby saving the rest of us from your whining.
Linux, you magnificent bastard, I read the fucking manual!
When you run data centres around the world that are collectively the most powerful supercomputer known to man, you too can get a front page story on ./ announcing your upgrade.
Until then, STFU.
Anyone who loves or hates any language, platform, or manufacturer, doesn't know what they're talking about.
The data path from program to disk is loooong. On a system with heavy CPU load, benchmarks on a well-tuned XFS system can fall to the same level as ext2 with defaults. Even multi-core doesn't help XFS under load; running Folding@Home at nice +19 still sucker-punched it.
JFS? It fails to scale on disk-saturated systems. However, it does have some optimizations specific to database workloads. Populating a sparse file ran fastest on my system, where XFS was a total fail.
ext3 under heavy CPU load showed degradation that appeared in the benchmarks, but was noticeable on the desktop only if I was watching for it. And ext4 (formatted, not converted from ext2/3) under load is faster than ext3 without load, when using "elevator=noop" at boot.
N.B.: The above benchmarks on my system all used external journals, except ext2 natch.
You can configure an higher threshold; 1 should be enough to filter most ACs.
Dilbert RSS feed
I assume you mean Increase the signal-to-noise ratio. Did you mean reduce the noise floor?
Here ya go.
Quack, quack.
(See first post)
SIG FAULT: Post index out of bounds.
BREAKING NEWS:
Google switches to new softer 2-ply toilet paper to reduce employee chafing.
BeauHD. Worst editor since kdawson.
Their *what*, man, their *what*???
:)
Cheers,
I've seen huge performance leaps for large files and directories after reinstalling my system on an ext4 partition. Ext3 was very slow to list directories containing large numbers of files, and deleting very large files took tens of seconds, during which the filesystem was hung. I couldn't remove large files while recording TV, otherwise the recording would hang and skip several seconds. No longer the case now I'm on ext4.
Wasn't there an article here recently, regarding ext3 vs ext4 and power failures. ext4, while compliant with a white paper, was not doing due diligence on the journaling stuff (as ext3 did) and should not be considered a true, hardcore journaling file-system? I recall a tiny uproar regarding ext4 not being an upgrade path for ext3 users if you care about your data. (i realise Google is going ext2 > ext4).
The article claims the the project leader/developers think they are is in the right by sticking to the flawed white paper.
What's the fun in that, how would you know if somebody flames you? Half the time I get flamed, the initiating post ends up modded to +5
Apocalypse Cancelled, Sorry, No Ticket Refunds
>>I miss LFS.
Along with your many other idolatries...
No, no - He meant what he wrote, he just has no idea of what "signal to noise ratio" means, and doesn't care that he looks like an idiot to those that do.
Hello moderators?
I live in a zone where power failures are very common. While I was using EXT3 I lost data for several times due to power failures, and there was even a time a disk got corrupted. After I switched to JFS the data lost is minimal and I never had a corrupted disk. Another think I enjoy in JFS is that its really quick to fsck a disc after power failure. So is it safe to switch to EXT4 ?
Google has their own proprietary file system called gfs (and now gfs2), who came up with this rubbish?
They have special file system because of their design demands and the inherent flaws
in most file systems when you cluster vast amounts of computers together.
What does the writer of this post think he will accomplish by sending out this garbage is what I want to know!