IBM's High Performance File System

Nothing new here. Move along. by kperrier · 2006-03-10 05:21 · Score: 3, Informative

There is nothing new about GPFS. Its been around for years.

Re:Nothing new here. Move along. by Mes · 2006-03-10 05:34 · Score: 2, Informative

I was working on this 5 years ago, and Im sure its been around much longer than that.
Re:Nothing new here. Move along. by MoxCamel · 2006-03-10 05:37 · Score: 4, Informative

Agreed. We've been using GPFS for 2 1/2 years. The long and short of it is that it's much more stable on AIX than it is on Linux. It's getting better on Linux, but it's still got a ways to go.
Mox
Re:Nothing new here. Move along. by Kadin2048 · 2006-03-10 05:43 · Score: 5, Informative

I think the "news" is the transfer rate, not the file system.

According to this article, the idea was just to see how fast a sustained transfer rate they could achieve. That rate was 102 GiB/s, which apparently is a record. The purpose of the project apparently has something to do with reducing the bottlenecking in parallel-computing interconnects. The machine they used, ASC Purple (a weapons-research system at Lawrence Livermore Labs) has about 10,000+ processors, so that's their obvious application.

The filesystem itself doesn't seem to be anything new -- I have no idea why the poster fixated on that, since it's kind of a minor footnote in most of the articles I've read about this today.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Re:Nothing new here. Move along. by Illbay · 2006-03-10 05:43 · Score: 1

Is this generally available? Anywhere I could get it?

--
Any technology distinguishable from magic is insufficiently advanced.
Re:Nothing new here. Move along. by Anonymous Coward · 2006-03-10 06:20 · Score: 0

The size of the file system is also astonishing at 1.6 petabytes

1.6 * 1024 * 1024 * 1024 / 1.44 = 1193046471.1111112

so i need only 1.2 billion good old floppies to store one filesystem ...

Please insert disk #67093424 and press return to continue the boot ...
Re:Nothing new here. Move along. by Mostly+a+lurker · 2006-03-10 07:00 · Score: 1, Interesting

If it is available on AIX, does this mean that The SCO Group will claim IBM had no right to make it available for Linux in the first place? I am not joking (at least, not deliberately). If I understand tSCOg's derivative works theory, they claim that any code that has touched SYSV is automatically a UNIX derivative and under their control.
Re:Nothing new here. Move along. by soft_guy · 2006-03-10 07:12 · Score: 1, Funny

SCO would claim you have to pay them for the air you breathe because Darl McBride farted once.

--
Avoid Missing Ball for High Score
Re:Nothing new here. Move along. by einhverfr · 2006-03-10 09:02 · Score: 3, Funny

I tried to install GPFS on Windows 98 and I keep getting GPFs... Is this supposed to happen?

--

LedgerSMB: Open source Accounting/ERP
Re:Nothing new here. Move along. by wealthychef · 2006-03-10 16:52 · Score: 1

Yes, just click here.

--
Currently hooked on AMP
Re:Nothing new here. Move along. by Illbay · 2006-03-10 23:02 · Score: 1

Why, thank you, you're too kind. I hope that I might be of equal service someday.

--
Any technology distinguishable from magic is insufficiently advanced.
Re:Nothing new here. Move along. by Anonymous Coward · 2006-03-11 13:09 · Score: 0

1.6 PetaBytes is a lot!!! for 7nth-order magnitude of optimization: forUVLSI circuits like Pentium-M Core Solo, reduction of 100 tr-millions to 50 tr-millions, ...
PuffffffFfFFfFffFFFfff!!!

Well.... by GoMMiX · 2006-03-10 05:22 · Score: 2, Funny

Atleast someone can make a new filesystem... *cough* Microsoft *cough*

Re:Well.... by ackthpt · 2006-03-10 05:42 · Score: 5, Funny

Atleast someone can make a new filesystem... *cough* Microsoft *cough*
Oh, come now. They just finished winning their latest legal round on FAT
Give them a moment to catch their breath, will you?
introducing OrigamiFS, you write it out on paper then fold it in half as many times as you can

--

A feeling of having made the same mistake before: Deja Foobar
Re:Well.... by Ostsol · 2006-03-10 06:21 · Score: 1

Yeah, it'd be nice if NTFS received some updates other than just in new versions of the Windows. An easy way to use junctions would be nice. Symbollic links have been easily usable in Linux for a long time. . .
Re:Well.... by garaged · 2006-03-10 06:21 · Score: 1, Interesting

introducing OrigamiFS, you write it out on paper then fold it in half as many times as you can

Thats about 8 times ? or do they have that girl that can do it 12 ??

--
I'm positive, don't belive me look at my karma
Re:Well.... by Firewalker_Midnights · 2006-03-10 06:26 · Score: 3, Informative

"introducing OrigamiFS, you write it out on paper then fold it in half as many times as you can"

Apparrently it can only be folded 12 times, at most. Unless M$ has created a new form of highly (unstable) foldable OS :D

--
I Lost My Virginity While Waiting for BSD to Compile.
Re:Well.... by ivan256 · 2006-03-10 09:03 · Score: 1

A breakthrough in stretchy paper technology would break that 12 fold barrier.
Re:Well.... by GoMMiX · 2006-03-10 13:50 · Score: 1

Actually I was just referring to the fact that Vista's main feature was supposed to be a new file system - which eventually got dropped off the feature list.

Apparently, though, some Microsoft employees with moderation points took offense to that. Which actually makes me glad I posted it. :)

Personally, I use windows to play games - and only because I have no choice. I could care less about any changes. I'm just estatic to hear Vista will not be backwards comopatible with older software written for other versions of Windows. Talk about shooting themselves in the foot! If they keep that crap up, I may not [b]have[/b] to use Windows for games anymore!(Yeah, I know - wine/linux - sorry - too much trouble for me)
Re:Well.... by Ostsol · 2006-03-11 05:23 · Score: 1

Me too. If it weren't for Galactic Civilisations 2, I'd be using my Linux partition -all- the time.

Well.. by include($dysmas) · 2006-03-10 05:22 · Score: 0, Redundant

bah!, i want my 25mb hdd from my Amiga 500+ back, at least i undestood the TLA's with that...

;)

10 Tbytes? by kyouteki · 2006-03-10 05:23 · Score: 3, Funny

But what kind of performance does this give on relatively small ( 10Tbytes) file systems? Petabyte arrays are still kind of out of reach for most.

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.

Re: 10 Tbytes? by KDan · 2006-03-10 05:39 · Score: 5, Funny

You puny geekling. It's been years since I migrated my enormous collection of pr0n to my petabyte array...

Running out of space too... maybe I should build a beowulf cluster of them.

Daniel

--
Carpe Diem
Re: 10 Tbytes? by Tester · 2006-03-10 05:45 · Score: 2, Interesting

they have 104 servers... that's almost 1GB/s/server ... that's a lot.. and they have 4 raid controlers per server.. that means each raid controler does around 250 mb/s.. (which normal for a high end raid controler) and they are connected with a 10gb/s interconnect (probably infiniband or 10G ethernet). So the whole thing is not that hard to do if you use your servers properly.

But they have 1000 clients.. so its only 100MB/s/client.. so 1Gbps/s/client.. so the clients are probably gigabit ethernet... Otherwise they could do much more... I've seen other cluster file systems do 600MB/s/client, its not that impressive. It only shows that IBM has a huge budget and they can afford lots of hardware.

This is like saying, NASA builds huge rocket for many many billions of dollars. Its just of matter of cash, not of great technical prowness.

If we do a dollar count.. lets say 10k$/server * 104 = 1M$ + 25k$/storage controler w/ disks * 416 = 10M$ + 2k$/client * 1000 = 2M$, plus the switches etc... give me 30M$ and I can do the same thing.

Btw, I work on a cluster filesystem, that performance is not that hard to achieve if you have that kind of hardware.
Re: 10 Tbytes? by chris_eineke · 2006-03-10 06:01 · Score: 4, Insightful

relatively small ( 10Tbytes) file systems
Seagate recently released a 500GB hard-drive. It costs $431.99CAD. 2 of them makes 1 TB. 2000 makes 1 PB. (Yes, that's overly simplified because it doesn't take into account interconnection cost, cooling, hydro, &c.)

2000 x 431.99 = $863,980CAD

I don't think that that's a lot of money for a petabyte raid. Hell, you might even get a 20% discount. Now think back about 20 years. That sum of money could have bought you 1 GB - that is an order of magnitude less in hard drive space. But here is the kicker:
Approx. 20 years down the road you will get at least two magnitudes more for the same amount of money (wo/ inflation). Why? Because approx. 30 years ago, that sum of money bought you 1 MB of space.

Ray Kurweil calls it the "Law of Accelerating Returns". 20 years down the road I will call it my petaporn array . Or maybe better not. ;)

--
"All you have to do is be fragile and grateful. So stay the underdog." Chuck Palahniuk, Choke
Re: 10 Tbytes? by kyouteki · 2006-03-10 06:10 · Score: 1

Fair enough. But I made $21,000USD last year. I have 5Tb of storage in my house already (got some good deals on 250GB drives). I was speaking more from a more-storage-than-I-know-what-to-do-with perspective (though Bittorrent might make that point moot soon enough) than an enterprise perspective.

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.
Re: 10 Tbytes? by Kadin2048 · 2006-03-10 06:11 · Score: 4, Informative

From the articles I've read, this was accomplished using (some subset of) ASC Purple, which is full of a lot of either custom or IBM-proprietary stuff (or else stuff that nobody but IBM seems to be using).

According to the published/unclassified spec sheet:

"Purple has 2 million gigabytes of storage from more than 11,000 Serial ATA and Fibre Channel disks. ... Each login node has eight 10-gigabytes-per-second network connections for parallel file transfer protocol and two 1-gigabyte-per-second network connections for network file systems and secure shell protocol. The system has a three-stage 1,536 port dual plane Federation switch interconnect ..."

I think that it was this last thing, the Federation interconnect, that they were pushing the data over in this test, since it forms the backbone of the machine and links the storage nodes to the login node controllers, which then connect to the login nodes themselves (of which there are apparently over 1,400 of, according to this). I couldn't find much information on Federation, as it seems to only be used in a few systems, of which Purple is the most notable. One reference I found seems to put it at 1.49 GB/sec (11.92 Gbit/s) bandwidth, although it's not clear if that's "dual plane" Federation or not. 4X SDR Infiniband is around 10 Gbit/sec, IIRC, so Federation's a little faster.

It does sound a little like it was a case of "hey, what can we do with $230M worth of hardware? I know, let's break some records." So they did. I'm not sure that there's anything there that anyone else couldn't do, with different technologies, given the same investment of capital -- it's just a matter of who else wants to, and has the capability.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
Re: 10 Tbytes? by Anonymous Coward · 2006-03-10 06:43 · Score: 2, Funny

I recently purchased a 1,024 TB SAN array to hold all my porn. Now there are *TWO* peta-files in my house!
Re: 10 Tbytes? by Anonymous Coward · 2006-03-10 06:43 · Score: 0

If they use windoz they get 1B/minute like the rest of us.

OPENBSD FTW
Re: 10 Tbytes? by Anonymous Coward · 2006-03-10 06:44 · Score: 0

I migrated my enormous collection of pr0n to my petabyte array...

Are you perhaps confusing this with your pedobyte array?
Re: 10 Tbytes? by Anonymous Coward · 2006-03-10 06:59 · Score: 0

No one needs that many pictures of goatse.
Re: 10 Tbytes? by chris_eineke · 2006-03-10 07:10 · Score: 1

I was speaking more from a more-storage-than-I-know-what-to-do-with perspective
Oh, don't worry. There is never enough storage(TM). Movie encoding quality will increase, games will get more immersive (maybe movies too), more detail, more of this, more of that. If transmission speeds increase quality will go up, if quality goes up transmittion speeds will have to increase. Mix in new technologies at any point and the more-storage-than-I-know-what-to-do-with-dept. won't close anytime soon ;)

--
"All you have to do is be fragile and grateful. So stay the underdog." Chuck Palahniuk, Choke
Re: 10 Tbytes? by xquark · 2006-03-10 09:52 · Score: 1

why use the term array anymore? I'm guessing it will just be a
petabye hdd, an all in one enclosed unit.

what might be called an array in the future might be a zettabyte
array or something similar in size.

Arash

--
Arash Partow's Philosophy: Be a person who knows what they don't know, and not a person who doesn't know.
Re: 10 Tbytes? by mfrank · 2006-03-10 10:33 · Score: 1

Looking at my hardware purchases since 1990, I'd say that hard drive density per dollar goes up an order of magnitude every 4 years. So:

2010: 5 terabyte
2014: 50 terabyte
2018: 0.5 petabyte

The main problem is, what are people going to use them for?
Re: 10 Tbytes? by Watson+Ladd · 2006-03-10 11:14 · Score: 1

Heavy Petabyte that is.

--
Inventions have long since reached their limit, and I see no hope for further development.-- Frontinus, 1st cent. AD
Re: 10 Tbytes? by oliverk · 2006-03-10 11:46 · Score: 2, Funny

You puny geekling. It's been years since I migrated my enormous collection of pr0n to my petabyte array...
Great, but you only ever watch 7 minutes at a time! That's like 100 billion years of pr0n!!

--
---- Please be nice in case my Slashdot karma ~= my real life karma.
Re: 10 Tbytes? by soldack · 2006-03-10 12:34 · Score: 1

"I think that it was this last thing, the Federation interconnect, that they were pushing the data over in this test, since it forms the backbone of the machine and links the storage nodes to the login node controllers, which then connect to the login nodes themselves (of which there are apparently over 1,400 of, according to this). I couldn't find much information on Federation, as it seems to only be used in a few systems, of which Purple is the most notable. One reference I found seems to put it at 1.49 GB/sec (11.92 Gbit/s) bandwidth, although it's not clear if that's "dual plane" Federation or not. 4X SDR Infiniband is around 10 Gbit/sec, IIRC, so Federation's a little faster."

Did some research and found the following:
http://www.llnl.gov/asc/platforms/purple/configura tion.html
http://www.redbooks.ibm.com/abstracts/sg246978.htm l - Info on the switch used

It appears to be an IBM thing that is only used on these big ASC platforms. The other parts of the company are using InfiniBand quite a bit though.
-Ack

--
-- soldack
Re: 10 Tbytes? by Anonymous Coward · 2006-03-10 13:51 · Score: 0

From the articles I've read, this was accomplished using (some subset of) ASC Purple, which is full of a lot of either custom or IBM-proprietary stuff (or else stuff that nobody but IBM seems to be using).

According to the published/unclassified spec sheet:

"Purple has 2 million gigabytes of storage from more than 11,000 Serial ATA and Fibre Channel disks. ... Each login node has eight 10-gigabytes-per-second network connections for parallel file transfer protocol and two 1-gigabyte-per-second network connections for network file systems and secure shell protocol. The system has a three-stage 1,536 port dual plane Federation switch interconnect ..."

I think that it was this last thing, the Federation interconnect, that they were pushing the data over in this test, since it forms the backbone of the machine and links the storage nodes to the login node controllers, which then connect to the login nodes themselves (of which there are apparently over 1,400 of, according to this). I couldn't find much information on Federation, as it seems to only be used in a few systems, of which Purple is the most notable. One reference I found seems to put it at 1.49 GB/sec (11.92 Gbit/s) bandwidth, although it's not clear if that's "dual plane" Federation or not. 4X SDR Infiniband is around 10 Gbit/sec, IIRC, so Federation's a little faster.

Federation was the project name. HPS is the official product name. It was originally supposed to be a NUMA interconnect, but IBM changed their mind on NUMA. IBM's performance report is here:

http://www-03.ibm.com/servers/eserver/pseries/hard ware/whitepapers/pseries_hps_perf.pdf

The theoretical max banwidth is 2 gigabytes/sec for send and 2 gigabytes/sec for receive for each adapter, simultaneously. With the big Squadrons box, 16 adapters can be used to achieve a max of 64 gigabytes/sec/machine. Federation GA 1 was a rather bulky zero copy protocol stack. GA 2 was a return to the FIFO packet mode that was the basis of SP2 (colony/corsair) (yes, to save our asses because the zero copy architecture was horrible wrt latency). GA 3 added RDMA capabilities to the FIFO packet mode stack, which provided low latency __and__ high bandwidth.

"dual plane" means that each node is connected to 2 different Federation networks. They built the machine with 2 separate networks to reduce the number of hops that it takes to cross the network. Even with 2 planes it still can take 3 hops (they seem to always be chopping the machine up...).

There is more info here:

http://www.redbooks.ibm.com/redbooks/pdfs/sg246978 .pdf

Hitachi's SR11000 machines use POWER5, AIX5L, and Federation.

http://www.hitachi.co.jp/Prod/comp/hpc/SR_e/11ktop _e.html
Re: 10 Tbytes? by rubycodez · 2006-03-10 14:25 · Score: 1

wannabie. my little petabyte array is just for my money-shot collection.
Re: 10 Tbytes? by mdielmann · 2006-03-10 17:45 · Score: 0

The main problem is, what are people going to use them for?

I'm guessing about half will go to the latest version of Windows and Office.

--
Sure I'm paranoid, but am I paranoid enough?
Re: 10 Tbytes? by blaksaga · 2006-03-10 18:25 · Score: 1

Shit. For that amount of money I will just buy a time machine on ebay and bring back $50 petabyte hard drives from the future.
Re: 10 Tbytes? by mrhartwig · 2006-03-11 12:44 · Score: 1

I think that it was this last thing, the Federation interconnect, that they were pushing the data over in this test, since it forms the backbone of the machine...

The Federation switch is just the last incarnation of IBM's SP switch, which was the high-speed, low-latency, redundant, leaps-tall-buildings-in-a-single-bound, interconnect between the nodes of their Scalable Processor (SP) systems. Said systems were the basis for a bunch of the "we're at the top of the Supercomputing 500 list" IBM systems in the 90s, and probably into the current decade.

SPs were useful for highly parallel scientific processing, and were also great for some commercial applications like Oracle parallel (when you could get it to stay up). The high-speed switch beat the crap out of hooking systems together with Ethernet...

Originally, SP nodes were modified RS/6000 systems; they had the switch hardware tied very closely to the CPU/memory bus, and therefore had pretty fast data transfer around the system. Later, IBM started building cards to put into more normal HW, and (here's were I stopped playing with them) started to be able to connect other systems into a SP complex.

SPs used proprietary racks (called "frames") that had power distribution & the switch HW in the bottom, and up to 16 nodes in the frame. Part of the switch connections could go to other frames, so you could hook a bunch of them together. IBM officially supported some number of nodes (128? 256?) but you could go higher; that's what some of the high-end stuff in Sandia, etc., was.

Each node had up to 4 paths through the switch to get to any other node, so you had a lot of bandwidth on a continuous basis. You could run either TCP/IP or some other (don't remember what it was called) protocol over the switch.

GPFS was originally designed to handle multi-media files (video). When I used it (5 years ago) it was an additional layer that allowed SP nodes to access data from disks attached to other nodes with only a 7-instruction penalty. All of the communication between nodes went over the switch, so you really didn't care which node the data actually was on.

Of course all of this is from ancient memory, but it's probably close, and I don't feel like doing the research (this is /. after all...) http://www.ibm.com/ if you want.

*NIX Integration by Anonymous Coward · 2006-03-10 05:23 · Score: 2, Interesting

Are there open source drivers for this FS that can perhaps be integrated into Linux or the *BSD projects?

Re:*NIX Integration by Anonymous Coward · 2006-03-10 05:55 · Score: 0

Yuo fail it. (it is trolling)

Who will be the first... by Anonymous Coward · 2006-03-10 05:24 · Score: 1, Funny

to say that we can put a lot of porn in 1.6 petabytes ?

Re:Who will be the first... by Anonymous Coward · 2006-03-10 05:57 · Score: 0

We can put a lot of porn in 1.6 petabytes...

There, I said it. ...as long as it isn't HD porn.
Re:Who will be the first... by Anonymous Coward · 2006-03-10 05:57 · Score: 0

You can really put a lot of porn in 1.6 petabytes!
Re:Who will be the first... by AndroidCat · 2006-03-10 06:11 · Score: 1

If you don't stop watching pr0n at 102 Gigabytes per second, you'll go blind.

--
One line blog. I hear that they're called Twitters now.

How many by Anonymous Coward · 2006-03-10 05:24 · Score: 0

How many times is a group going to use G F and S in their filesytem name.

Theres GFS, GFS, GPFS, etc.

dammit.

Re:How many by RandoX · 2006-03-10 05:29 · Score: 1

Don't forget GFS.
Re:How many by Mr+Z · 2006-03-10 06:57 · Score: 1

And then there's GFS!

--
Program Intellivision!

Can I use it? by ShieldW0lf · 2006-03-10 05:24 · Score: 3, Interesting

Is this stuff available in a fashion where we might see it ported for use on standard x86 hardware? Is it GPL'd? I want this in my living room!

--
-1 Uncomfortable Truth

Re:Can I use it? by deviantphil · 2006-03-10 06:55 · Score: 1

A quick google search seems to indicate it is not GPL. I have not found any hard evidence it is GPL and some evidence to support it is not.
Re:Can I use it? by Anonymous Coward · 2006-03-13 02:24 · Score: 0

It isn't GPL it is a proprietary product. Refer to http://lwn.net/Articles/72894/

Fast Stuff by britneysimpson · 2006-03-10 05:27 · Score: 4, Funny

Wow that"s fast stuff, plus with the ability to slow light to save energy IBM should have some great new systems coming out!

--
Britney Simpson

So what about JFS? by kalpol · 2006-03-10 05:27 · Score: 1

What's going to happen to JFS? I was hoping it would get some attention - I am using it for my multimedia server and have been very pleased with the way it handles large DVD image files, not to mention power failures.

--
12:50 - press return.

Re:So what about JFS? by Tester · 2006-03-10 05:36 · Score: 3, Informative

GPFS is a cluster file system.. its in a completely different category.

Bad Article Title by Anonymous Coward · 2006-03-10 05:28 · Score: 5, Funny

I thought this article was going to be about IBM's HPFS from OS/2.

I'm Surprised by Nom+du+Keyboard · 2006-03-10 05:28 · Score: 4, Funny

I'm surprised that the content industries (read **AA) let them release this. After all, everyone knows that the only reason for large amounts of writable storage is to store stolen content and deprive artists of their just rewards. All things considered, I'm also surprised that IBM doesn't have to close a non-existent Analogue Hole, nor implement a Broadcast Flag to prevent the storage of infringing materials.

That aside, how do I get one for my TiVo?

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

Re:I'm Surprised by Anonymous Coward · 2006-03-10 06:12 · Score: 0

You can't. It's a file system, not a storage device.
Re:I'm Surprised by MindStalker · 2006-03-10 06:53 · Score: 1

Shall I slap you with a trout again???
Re:I'm Surprised by Nom+du+Keyboard · 2006-03-10 08:25 · Score: 1

Shall I slap you with a trout again???
I'd rather have you fetch me a shrubbery.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
Re:I'm Surprised by jZnat · 2006-03-10 11:13 · Score: 1

Well, IBM is quite a mammoth of a company, so they wouldn't take that sort of shit from Double A.

--
'Yes, firefox is indeed greater than women. Can women block pops up for you? No. Can Firefox show you naked women? Yes.'

Translation: by PFI_Optix · 2006-03-10 05:30 · Score: 1, Funny

"That's nice, but will Linux run it?"

--
120 characters for a sig? That's bloody useless.

Re:Translation: by slackaddict · 2006-03-10 05:41 · Score: 2, Informative

Yes:
GPFS supports the current releases of AIX 5L and selected releases of Red Hat and SUSE LINUX Enterprise Server distributions. See the GPFS FAQ1 for a current list of tested machines and also tested Linux distribution levels.

--
ConsultingFair.com
Re:Translation: by PFI_Optix · 2006-03-10 08:48 · Score: 1

Oh wow someone with mod points needs a sense of humor.

--
120 characters for a sig? That's bloody useless.

since the /. blurb doesn't explain it... by frankie · 2006-03-10 05:31 · Score: 5, Informative

...let's see if I can, never having heard of GPFS before 10 minutes ago:

GPFS is not new; GPFS 1.0 dates to 1998
IBM is touting its latest point update, v2.3
analogy: desktop PC is to BlueGene as RAID is to GPFS cluster

It's basically data striping across 1000 disks. I suppose the hard part is coordinating all of that parallelism.

So, could someone who actually knows this stuff tell me how well I did?

Re:since the /. blurb doesn't explain it... by Amouth · 2006-03-10 05:37 · Score: 5, Funny

root@ibm$rm - r *

humm that was quick

--
'...if only "Jumping to a Conclusion" was an event in the Olympics.'
Re:since the /. blurb doesn't explain it... by dow · 2006-03-10 06:24 · Score: 2, Funny

Shouldn't that be:

root@ibm# rm -rf *

And as always on storage/bandwidth topics: the pr0n/ogg/divx potential of that thing... *sorry*
Re:since the /. blurb doesn't explain it... by cluening · 2006-03-10 06:24 · Score: 1

I don't profess to actually know much about gpfs, but I do use it on a daily basis. But, I can say that you are mostly right. Two additions: GPFS's original name was MMFS, putting it much older than 1998 (I believe). 2.3 is indeed the latest release, but we've been using it for around a year now and are up to patch level 10.

When I started playing with gpfs on our linux machines about a year ago, I got pretty angry at it pretty often (mostly because we, with other people, were making it do things that the developers were still implementing for us). It's been pretty stable for the last four to six months though and definitely lets us do some great stuff:

tg-login2:~> df -h /gpfs
Filesystem Size Used Avail Use% Mounted on /dev/gpfs 210T 137T 74T 66% /gpfs

--
Posted from the wireless couch.
Re:since the /. blurb doesn't explain it... by TRS-80 · 2006-03-10 06:24 · Score: 2, Informative

You missed the fact that GPFS is non-Free (tm):
The prices for GPFS for AIX 5L, GPFS for Linux on POWER, and GPFS for Linux on Multiplatform are based on the number of processors active on the server where GPFS is installed.
Re:since the /. blurb doesn't explain it... by cluening · 2006-03-10 06:27 · Score: 1

(Darn. Should have previewed.)
tg-login2:~> df -h /gpfs Filesystem Size Used Avail Use% Mounted on /dev/gpfs 210T 137T 74T 66% /gpfs

--
Posted from the wireless couch.
Re:since the /. blurb doesn't explain it... by Amouth · 2006-03-10 07:29 · Score: 1

i only login to my linux box about once a year as that is all that is required

and i never use that command

and i am not going to login and check it

--
'...if only "Jumping to a Conclusion" was an event in the Olympics.'
Re:since the /. blurb doesn't explain it... by flaming-opus · 2006-03-10 08:37 · Score: 2, Informative

yeah, except substitute 1000 disks with 10,000 disks. They almost certaintly are stiping across a bunch of mid-range IBM raids, each with ~100 disks, and probably getting around 1-2 GB/s.

It's also striping across many machines in a cluster. Each of those nodes maxes out at 'only' 15 GB/s of I/O, so they wire up all the nodes to a bunch of fibre channel cards, and plug them all into the raids, to distribute the I/O access to the nodes. GPFS also lets you do the I/O over the cluster interconnect, but then your interconnect bandwidth usable by the application has to compete with the filesystem traffic.

As for coordinating all the parallelism, there's a metadata node (actually a failover pair of nodes) that does the metadata operations (create, rename, remove, link) and each cluster node does file I/O directly to disk. Typically, each of the nodes write to seperate files , to avoid having to do concurrent I/O. You can have all the nodes write to different byte ranges within the same file, but you have to use special flags to enable this, and the application has to written to legitimately write to very distant parts of the file. Often it's simplest just to write to different scratch files for intermediate results, and then combind the output at the end of the run.
Re:since the /. blurb doesn't explain it... by dow · 2006-03-10 11:39 · Score: 1

The syntax police are worse than the spelling cops. Its one command I've never issued either, but thinking about it I really want to now... rm -rf /* I'm gonna have to set up a box just for the pleasure of executing that one, and being able to say "I've done it."
Re:since the /. blurb doesn't explain it... by AdamTheBastard · 2006-03-10 12:25 · Score: 1

Check out User Mode Linux, they even list trying `rm -rf /` as one of the things people use the application for.

GPFS Information and links by Anonymous Coward · 2006-03-10 05:39 · Score: 5, Informative

GPFS FAQ - http://publib.boulder.ibm.com/infocenter/clresctr/ index.jsp?topic=/com.ibm.cluster.gpfs.doc/gpfs_faq s/gpfs_faqs.html

GPFS Whitepaper - http://www-03.ibm.com/servers/eserver/pseries/soft ware/whitepapers/gpfsprimer.pdf

"GPFS is a cluster file system providing normal application interfaces, and has been available on AIX® operating system-based clusters since 1998 and Linux operating system-based clusters since 2001. GPFS distinguishes itself from other cluster file systems by providing concurrent, high-speed file access to applications executing on multiple nodes in an AIX 5L cluster, a Linux cluster or a heterogeneous cluster of AIX 5L and Linux machines. The processors supporting this cluster may be a mixture of IBM System p5(TM), p5 and pSeries® machines, IBM BladeCenter(TM) or IBM xSeries® machines based on Intel® or AMD processors. GPFS supports the current releases of AIX 5L and selected releases of Red Hat and SUSE LINUX Enterprise Server distributions. See the GPFS FAQ1 for a current list of tested machines and also tested Linux distribution levels. It is possible to run GPFS on compatible machines from other hardware vendors, but you should contact your IBM sales representative for details.

GPFS for AIX 5L and GPFS for Linux are derived from the same programming source and differ principally in adapting to the different hardware and operating system environments. The functionality of the two products is identical. GPFS V2.3 allows AIX 5L and Linux nodes, including Linux nodes on different machine architectures, to exist in the same cluster with shared access to the same GPFS file system. A cluster is a managed collection of computers which are connected via a network and share access to storage. Storage may be shared directly using storage networking capabilities provided by a storage vendor or by using IBM supplied capabilities which simulate a storage area network (SAN) over an IP network.

GPFS V2.3 is enhanced over previous releases of GPFS by introducing the capability to share data between clusters. This means that a cluster with proper authority can mount and directly access data owned by another cluster. It is possible to create clusters which own no data and are created for the sole purpose of accessing data owned by other clusters. The data transport uses either GPFS SAN simulation capabilities over a general network or SAN extension hardware.

GPFS V2.3 also adds new facilities in support of disaster recovery, recoverability and scaling. See the product publications for details2."

Re:GPFS Information and links by miller701 · 2006-03-10 07:54 · Score: 1

I thought it was one less than HPFS!

Re:Other uses by Anonymous Coward · 2006-03-10 05:39 · Score: 0

It could also allow users to access their porn collections with much greater speed and efficiency.

There's a reason they call them _peta_bytes...

GPFS is not new by flaming-opus · 2006-03-10 05:39 · Score: 2, Informative

GPFS is one of the more entrenched parallel cluster filesystems available. (others include the classic vax cluster fs, Tru64 cfs, redhat gfs, adic stornext, lustre, Sanergy, polyserve, others) GPFS has been running on IBM's high performance clusters for a decade or more. I've used it, and it's as robust as any of the others I listed above.

I'll caution everyone that you can get 100GB/s of throughput, only if you have a hundred million dollar collection of computers and disks like Livermore has.

I'd guess 10-20 GB/sec by Anonymous Coward · 2006-03-10 05:40 · Score: 0

Not too shabby.

But then some noob application programmers will do something stupid like use C++ streams to do IO and give it all back.

You can't drive a system at its design limits without coding to the hardware design.

So will this mean cheaper storage costs by zenst · 2006-03-10 05:41 · Score: 2, Interesting

Will this mean that you can share storage more easily, maybe. It certainly seems to reduce sharks/ESS into an expensive interface for attaching discs (but there again there just a load of discs with a AIX box or 2 and SSA adapters to conenct the discs anyhow).

Given the managment/maintenance levels of discs wil be more intergrated and distrubutable with this I cant help but think that OS/features and the trend in (and rightly so) resiliance,easy and sharing resources approach towards what Plan 9 was setout to be.

The more we move on the more we seem to get towards the lego-type appraoch to IT were you can just buy another box of bricks and add on and keep your older bricks instead of throwing the whole lot out and/or hacksawing the end of a brick of and gluing it onto the side of....

Storage wise this is a nice step forwards and having worked on AIX and its many filesystems and managment tools and the ease of getting the job done with the option to get clever if you wish (you chose and not forced) this looks funky albeit its RAID for SAN's in a way.

What I realy want is a FS that will propergate automaticaly and resiliantly in a way that accomodates network diversaty already and I still come down to me wanting, what is all intent a filesystem sat on a database sat on a p2p network, alas atm performance would suck, least today but you know how long code takes to get right and how fast hardware moves - remember alot of code in windows XP has origins to when it was written on a humble 386 cpu if not lower.

What this does show is how netowrk/storage interfaces have moved forward and I/O requests dont hammer CPU's as much as they used to, getting there :).

Re:So will this mean cheaper storage costs by Apparition-X · 2006-03-10 07:44 · Score: 1

Will this mean that you can share storage more easily, maybe. It certainly seems to reduce sharks/ESS into an expensive interface for attaching discs.

Well, it is one thing to send output from thousands of nodes to thousands of nodes, and achieve very high performance, as seems to be the case here. It is another to send output from a single node to a single node as you describe, where one node is a storage controller (ESS, whatever) and the other is a large database server that is probably not running a file system at all, and may be generating as many (small) I/O requests as hundreds of those nodes in the first case are. Add on top of that resiliency, replication, caching, etc. and there is a great difference between a massively (horizontally) scaleable GPFS and a masssively (vertically) scaleable like a Shark. All of which is not necessarily to say that the two approaches won't merge at some point, but I suspect that point is quite far off--say 10 or 20 years--given the very different requirements that the two approaches attempt to meet.
Re:So will this mean cheaper storage costs by zenst · 2006-03-10 08:17 · Score: 1

ESS/Sharks have multi pathed access to discs (least if configured correctly) and general resiliantly cabled to for all effect AIX box;s inside the unit (2 generaly). As such there already multi-noded internaly and just present a single node with regards to storage. You are right with regards to cacheing but (prolly have CACHEFS on the shark/ESS's aix box's running :) that can be accomodated and indeed negated with regards to thruput of access if the base FS more than keeps up with the app, or localised caching.

Its all a means to an end, just nice to see the end take more shape in the distance ;).

If anything this will help shark/ESS/Storage array sales as you get nice support contracts on those and managers like not relying upon internal resources for potential shitfan situations were possible.

Besides they can also moti hone to various nodes and ironicaly not seen a ESS/shark not attached to at least two external application box's, nor do I expect to. But they are out there.

But FS managment short/long and now term on kit may just get better and I'm sure even the M$ junkies are drowling over the prospects of not having to archive there PST files due to any form of FS restrictions :D.

You can have mine by LunaticTippy · 2006-03-10 05:42 · Score: 0, Offtopic

It's only 5mb, but it was plenty for my ST.

SCSI interface, you might be able to upgrade it to 25.

I'm not using it anymore...

--
Man, you really need that seminar!

If only Google supported the community by Anonymous Coward · 2006-03-10 05:43 · Score: 0

They would have released their Google File System.

nah..... by Anonymous Coward · 2006-03-10 05:44 · Score: 0

Doesnt seem revolutionary if its going to be proprietary like that. I'm more impressed with XFS. At least its usable for all Linux users. Oh, and it rocks, too.

Tech details by MasterC · 2006-03-10 05:48 · Score: 2, Informative

The article, as usual for news stories, are lacking juicy tech details. Here's some I found:

The article says 102 GB/s transfer. This PDF about the ASC Purple says they have 11,000 SATA & fiber channel disks (amongst other neat stats). So cursory math says that's about 10 MB/s from each disk.

My question is how useful is that transfer? Pulling in at 102 GB/s is fast and all, but if you can't consume it then it's just ego boosting. What kind of useful data transfer can you do on it? Surely it's for parallel processing (ASC = Advanced Simulation & Computing) of some kind so can this parallel app handle 102 GB/s collectively?

--
:wq

Re:Tech details by MasterC · 2006-03-10 05:50 · Score: 1

Crap. Teach me for not scouring my preview before submitting. Here's the PDF I intended to link to:

http://www.llnl.gov/asc/platforms/purple/sc2005-pu rple.pdf

--
:wq
Re:Tech details by Anonymous Coward · 2006-03-10 06:04 · Score: 0

I think the big usage for this is for supercomputer-type stuff, like simulations and experiments. You may have some high-energy physics experiment that generates data at 100GB/s, which will have to be stored, then analyzed by your supercomputer. In this case you will probably only spend a few minutes collecting the data, but then next few months analyzing it.

dom
Re:Tech details by Helios1182 · 2006-03-10 06:13 · Score: 2, Insightful

I think your last sentence hit it. There are groups producing huge amounts of data that needs to be stored then processed. What is the point in having 10,000 CPUs crunching numbers only to have the system I/O bound by the hard disk? Memory is still a couple orders of magnitude behind hard drives in size so they have to cache data on the disk at some point.
Re:Tech details by AchilleTalon · 2006-03-10 06:15 · Score: 1

I'm pretty much sure this filesystem capacity can be consume easily by pushing an order of magnitude or two some nifty simulations. So, that's not just ego boosting.
Just to give an idea, when LHC will turn into operation at CERN, it will produce data at a rate equivalent to the whole EU telephony/data network capacity. And, this only part of the story. Since you have to analyze data, compute, compare, etc. You need to be able to move it fast between processors.
Imagine nuclear weapons simulations, hurricanes simulations, global warming model simulations, etc.

--
Achille Talon
Hop!

unit correction by psbrogna · 2006-03-10 05:52 · Score: 2, Informative

petabyte !== 1,024 terabytes

petabyte == 1,000 terabytes

ref: http://en.wikipedia.org/wiki/Petabyte

Kibibytes is just so much more fun to say. Especially when it leads to "kibbles & bits."

Re:unit correction by Anonymous Coward · 2006-03-10 06:06 · Score: 0

petabyte !== 1,024 terabytes
petabyte == 1,000 terabytes

You're just a tool for the evil misleading hard drive industry! ... and the USB thumbdrive industry ... and the network card industry ... and the DVD industry ... and the CD industry ... and half of the floppy industry (go find out what "1.44 megabytes" actually means -- it's hilarious) ...

You know, now that I think about it, pretty much the only places where the "good, churchgoing, non-evil, right" definition is used is memory and operating systems. Hmm.
Re:unit correction by Anonymous Coward · 2006-03-10 08:08 · Score: 0

Dude this is Computer Science
1024 bytes = 1 Kilobyte

KB at least forms a decent approximation of 1000 bytes.

Work on the more horrendous parts of CS nomenclature before messing with some trivial part.

Try fixing Computer Science (doesn't even remotely resemble a science) or Software Engineering (programming software isn't an engineering discipline) first.
Re:unit correction by Anonymous Coward · 2006-03-10 09:11 · Score: 0

Yes - totally screw that!!!!

Engineers use powers of 2
Marketing fucks uses powers of 10 to make their packages look bigger!!!

Re:Other uses by Anonymous Coward · 2006-03-10 05:53 · Score: 0

that would help with porn; If I can't get it to slideshow through enough pictures within 10 seconds, it's over. Less time if i already have my pants down.

It's not whether it runs Linux... by 1.21GW · 2006-03-10 05:53 · Score: 0

but does it run DOOM?

GPF - S? Peta-files by Anonymous Coward · 2006-03-10 05:54 · Score: 0

What is GPFS -- is it a General Protection Fault System? Wasn't that fixed with Windows 95?

Does the system that stores petabytes of data store them in Petafiles or Pedafiles?

binary prefixes by Lord+Ender · 2006-03-10 06:03 · Score: 4, Insightful

The submitter and editors need to learn their numeric prefixes. Come on! This web site is supposed to be for people who understand computer technology!

A petabyte == 1000 terrabytes
A pebibyte == 1024 terrabytes

Please see the NIST definition page:
http://physics.nist.gov/cuu/Units/binary.html

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.

Re:binary prefixes by Surt · 2006-03-10 06:20 · Score: 1

The baffling question is: did the article submitter get it wrong, slashdot get it wrong, IBM get it wrong, did you mean 1024 tebibytes, or possible 1024 terabytes, or .... ?????

The plethora of SI prefixes gets more and more confusing. And remember, not everyone has or is in any way bound to adopt the NIST convention, after all megabyte = 1024 kilobyte was around and in use long before nist got into the act!

--
"Who is the Journal of Quantum Physics going to believe?" --Stephen Hawking
Re:binary prefixes by Anonymous Coward · 2006-03-10 06:25 · Score: 0

It's *so* fun to correct nitpickers!

A pebibyte is 1024 TEBIbytes, not terabytes.

Do everyone a favor and call them 2^50-bytes. Scientific notation is used with good reason, there are just too many words for powers of thousands/kibibytes to keep them straight.
Re:binary prefixes by Richard+Steiner · 2006-03-10 06:30 · Score: 4, Informative

The new SI prefixes are nice and all, but there are three or four decades of prior usage that have to be unlearned before some of us will use them intuitively. Or at all. :-)

Context-sensitive conversion of SI prefixes isn't all that difficult. Really. It's commonly understood that data is stored in powers of 2, and the subject is only relevant if (1) you're a sales type, or (2) you are being overly pedantic about an unwanted and unneeded SI standard.

--
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.
Re:binary prefixes by Anonymous Coward · 2006-03-10 06:33 · Score: 0

Someone please give this man a beer.
Re:binary prefixes by hswerdfe · 2006-03-10 06:45 · Score: 1

yes well
1000 Meter = 1 KiloMeter

was around long before 'megabyte = 1024 kilobyte'
the prefix's mean something.
just cause some CS tard decided that 1024 is close enough, and that base 2 is easier then base 10
does not make it correct

--
--meh--
Re:binary prefixes by Surt · 2006-03-10 06:54 · Score: 1

Likewise, just because some self righteous officiant at the BWM thought that doing everything in multiples of 10 would be a great idea doesn't make it 'right' either. No side is 'right' on this issue, it's all opinion.

--
"Who is the Journal of Quantum Physics going to believe?" --Stephen Hawking
Re:binary prefixes by slavemowgli · 2006-03-10 07:04 · Score: 1

Ouch. First of all, it's "tera", not "terra"; and second, 1 pebibyte = 1024 tebibyte, not 1024 terabyte (and if you think that that's a difference that doesn't matter, why are you complaining about the confusion of pebibyte and petabyte?).

--
quidquid latine dictum sit altum videtur.
Re:binary prefixes by Anonymous Coward · 2006-03-10 07:05 · Score: 0

According to your website, a terabyte is spelled with one 'r' and 1 pebibyte is 2^50/10^12 = 1125.9 terabytes.

Have a nice day.
Re:binary prefixes by Mr+Z · 2006-03-10 07:09 · Score: 1

It also doesn't help that the new binary prefixes make the speaker sound like an imbecile if you try to say the units aloud, at least to my American ears.

--
Program Intellivision!
Re:binary prefixes by Anonymous Coward · 2006-03-10 07:31 · Score: 0

Centuries of prior use trumps decades of misuse.
Re:binary prefixes by Kjella · 2006-03-10 07:43 · Score: 1

A petabyte == 1000 terrabytes
A pebibyte == 1024 terrabytes

ROFL. Not to mention "terra" is Earth, "tera" is SI. One of many issues with the new names is that they sound like complete and utter crap. I'll never ever move away from mega-, giga- and terabytes. I abbriviate them correctly with the i's (MiB,GiB,TiB) and for anal people I'd specify it as "decimal *-byte" and "binary *-byte" or just give it in raw bytes. But those names.... OMG what nerd came up with those?

--
Live today, because you never know what tomorrow brings
Re:binary prefixes by Anonymous Coward · 2006-03-10 08:09 · Score: 0

Funny, to me Americans sounds like imbeciles when they measure something in miles. SI was using powers of 10 before you were born.
Re:binary prefixes by Anonymous Coward · 2006-03-10 08:11 · Score: 0

Meh. The British still use miles also.
Re:binary prefixes by Anonymous Coward · 2006-03-10 08:19 · Score: 0

Just because you don't want to follow standards doesn't mean you have the right to change what they mean. It's commonly understood that kilo translates into x10^3.
Re:binary prefixes by duerra · 2006-03-10 08:43 · Score: 1

But those names.... OMG what nerd came up with those?

Yeah, I agree. "bi"? Maybe the guy was confused.

It would have been so much easier just to change the "a" to an "i" - petibyte instead of petabyte.

--
A community-oriented lyrics site
Re:binary prefixes by Lord+Ender · 2006-03-10 09:50 · Score: 1

You're right. I misspelled "tera." That doesn't change my point at all. But thanks for pointing that out.

--
A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.
Re:binary prefixes by Richard+Steiner · 2006-03-10 11:39 · Score: 1

The SI standard didn't exist even ten years ago, so who is changing whose existing practices?

It is commonly understood that a "kilobyte" refers to 1024 bytes in all but a hard disk sales context. No offense intended, but if that isn't obvious to you, you must be very VERY new to the computer industry...

--
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.
Re:binary prefixes by DeathFromSomewhere · 2006-03-10 11:56 · Score: 1

Are you kidding me? SI is only 10 years old? I suggest you read the history section.

--
-1 overrated isn't the same thing as "I disagree".
Re:binary prefixes by vidarh · 2006-03-10 13:14 · Score: 1

He presumably referred to using "SI-style" prefixes to "byte". However that isn't part of the SI standard at all, as byte isn't an SI unit.
Re:binary prefixes by Richard+Steiner · 2006-03-13 04:30 · Score: 1

I was referring to the base-2 SI-style prefixes being discussed, of course. Sheesh. Pay attention to context.

--
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.

Google File System by TubeSteak · 2006-03-10 06:05 · Score: 1

We have designed and implemented the Google File System, a scalable distributed file system for large distributed data-intensive applications. It provides fault tolerance while running on inexpensive commodity hardware, and it delivers high aggregate performance to a large number of clients.
...
... The largest cluster to date provides hundreds of terabytes of storage across thousands of disks on over a thousand machines, and it is concurrently accessed by hundreds of clients.

If it's scalable, there's no reason it couldn't scale up to 1.6 petabreads. And the fact that it runs on commodity (cheap) hardware means that you don't need "a hundred million dollar collection of computers and disks like Livermore has".

If not... what's the key difference between the two?

--
[Fuck Beta]
o0t!

Re:Google File System by InsaneGeek · 2006-03-10 06:23 · Score: 2, Insightful

If not... what's the key difference between the two?

When you care about throughput as well as capacity.
Re:Google File System by Wesley+Felter · 2006-03-10 07:46 · Score: 1

GoogleFS does not support Unix semantics, so if you mounted it (which you can't anyway) some apps would not behave correctly. Also, GoogleFS uses smart storage server nodes, while GPFS runs on a block-based SAN. Also, you can buy GPFS.

1.6 petabytes isn't that big a deal by jm91509 · 2006-03-10 06:05 · Score: 4, Informative

ZFS from Sun is 128-bit. According to this guy
thats a whole load of data:

"Although we'd all like Moore's Law to continue forever, quantum mechanics imposes some fundamental limits on the computation rate and information capacity of any physical device. In particular, it has been shown that 1 kilogram of matter confined to 1 liter of space can perform at most 1051 operations per second on at most 1031 bits of information [see Seth Lloyd, "Ultimate physical limits to computation." Nature 406, 1047-1054 (2000)]. A fully-populated 128-bit storage pool would contain 2^128 blocks = 2^137 bytes = 2^140 bits; therefore the minimum mass required to hold the bits would be (2^140 bits) / (10^31 bits/kg) = 136 billion kg.

That's a lot of gear."

Re:1.6 petabytes isn't that big a deal by mungtor · 2006-03-10 06:30 · Score: 1

Yeah, but it's from Sun which means that it's evil and won't get a front page mention here.
Re:1.6 petabytes isn't that big a deal by Bromskloss · 2006-03-10 06:31 · Score: 1

1 kilogram of matter confined to 1 liter of space can perform at most 1051 operations per second on at most 1031 bits of information

It's an interesting subject you bring up. (Not being up to speed with it yet, but the knowledge maniac within will make me at some point.) It seems to me like very few operations and very few bits, doesn't it? Would energy supply have to be included? (Like you can lower the entropy of a system if you add external energy.) Even with that, couldn't a battery powered computer within those constraints beat that, or am I totally of scale here?

--
Swedish plasma phys. PhD student; MSc EE; knows maths, programming, electronics; finance interest; seeks opportunities
Re:1.6 petabytes isn't that big a deal by Anonymous Coward · 2006-03-10 06:33 · Score: 0

Your quote is poorly formated. The article mentions that computer, in the current engineering practices, should have a maximum rate of 5.4258e50 operations per second. Here is a better formatted quote:

... If, in addition, we make use of the natural electromagnetic interactions between nuclei and electrons in the matter to perform logical operations, we are limited to a rate of 10^15 operations per bit per second, yielding an overall information processing rate of 10^40 operations per second in ordinary matter. Although less than the 10^51 operations per second in the ultimate laptop, the maximum information processing rate in 'ordinary matter' is still quite respectable. Of course, even though such an 'ordinary matter' ultimate computer need not operate at nuclear energy levels, other problems remain -- for example, the high number of bits still indicates substantial input/output problems. At an input/output rate of 10^12 bits per second, an Avogadro-scale computer with 10^23 bits would take about 10,000 years to perform a serial read/write operation on the entire memory. Higher throughput and parallel input/output schemes are clearly required to take advantage of the entire memory space that physics makes available.

taken from http://www.nature.com/nature/journal/v406/n6799/fu ll/4061047a0_fs.html
Re:1.6 petabytes isn't that big a deal by Anonymous Coward · 2006-03-10 06:45 · Score: 0

My computer is faster than 1khz! My computer breaks physics!
Re:1.6 petabytes isn't that big a deal by rkww · 2006-03-10 07:00 · Score: 2, Informative

1051 operations per second on at most 1031 bits

That'll be 10^51 and 10^31...
Re:1.6 petabytes isn't that big a deal by Cyno · 2006-03-10 13:50 · Score: 1

By those calculations your logic is flawed. The quote, "1 kilogram of matter confined to 1 liter of space can perform at most 1051 operations per second", is talking about computational limits, not storage densities.

That's like assuming you're going to store the entire capacity of ZFS on a 136 billion kg stack of multi-core CPUs.

Brilliant!

I only know what IBM have published by jd · 2006-03-10 06:07 · Score: 1

But you sound right to me. Having said that, I would have absolutely no objection to IBM porting support for ultra-parallel RAID to Linux. In fact, there are probably a number of areas in the kernel that they could use their experience in parallel architectures to tighten up on.

Since GPFS is basically RAID on speed, it should be easy for IBM to write a wrapper for Linux that would allow you to read/write GPFS, without needing to port GPFS per-se. As IBM sells Linux-based machines, being able to access GPFS partitions would seem "obvious", but I could understand them wanting to keep the best-of-the-best for systems they make more money off of.

--
It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)

Re:I only know what IBM have published by Shisha · 2006-03-10 06:20 · Score: 1

But you sound right to me. Having said that, I would have absolutely no objection to IBM porting support for ultra-parallel RAID to Linux. In fact, there are probably a number of areas in the kernel that they could use their experience in parallel architectures to tighten up on.

NOOOO!!!! You've just finally provided SCO with the evidence it needed! Filesytems were used in UNIX and SCO owns everything UNIX related. Now they know that IBM could maybe consider integating, ehm, UNIX technologies, we mean UNIX source code into Linux.

Comparisons to other Parallel/Clustered FS? by soldack · 2006-03-10 06:08 · Score: 1

It would be nice to see comparisons to RedHat/Sistina's GFS, Lustre (backed by HP), and others listed here.

Also how does this compare to clustered storage that is not run on the hosts themselves like NetApp new Spinnaker based clustering. You also have folks like Isilon, Panasas, and Terrascale.

Anybody have an good data on this?
-Ack

--
-- soldack

Re:Comparisons to other Parallel/Clustered FS? by Rheingold · 2006-03-10 07:21 · Score: 1

For that matter, how does it compare with Tivoli TotalStorage SAN Filesystem, which seems to be another shared-storage filesystem from IBM/Tivoli? Trying to read IBM's descriptions is an exercise in marketing-fluff cryptography.

--
Wil
wiki
Re:Comparisons to other Parallel/Clustered FS? by hitchhacker · 2006-03-10 08:13 · Score: 1

don't forget about the Parallel Virtual File System (PVFS)

-metric
Re:Comparisons to other Parallel/Clustered FS? by dwater · 2006-03-10 13:06 · Score: 1

...hrm...I seem to remember a company called Silicon Graphics, or was it SGI...they had a great filesystem back when they were still in business - cxfs or something, IIRC. Oooh, they left their web site running - must be by mistake since I'm sure they went out of business a while ago...

http://www.sgi.com/products/storage/tech/file_syst ems.html

"...architected to address single files as large as 9 million terabytes..."

"...and filesystems as large as 18 million terabytes...".

the only performance numbers I could find are :

http://www.vets.ucar.edu/Reports/CXFSPerformance/i ndex.html

--
Max.

Windows Crash Error by c0d3r · 2006-03-10 06:10 · Score: 1

Sounds like a windows crash error:

GPF Fault Error.

I don't get it by TheSkyIsPurple · 2006-03-10 06:12 · Score: 2, Funny

So, the important question: How many Libraries of Congress is that per second?

We need a common benchmark by Linker3000 · 2006-03-10 06:12 · Score: 4, Funny

Typical porn movies per hour (TPMH)??

--
AT&ROFLMAO

Re:We need a common benchmark by sunwukong · 2006-03-10 08:17 · Score: 1

How does fast forwarding through the "dialogue" affect the benchmark?
Re:We need a common benchmark by noz · 2006-03-10 14:01 · Score: 1

"Typical porn movies per hour (TPMH)??"
The "insightful" mod of parent is scary, but not unexpected.

SCREW THAT!!! ;-) by Ossifer · 2006-03-10 06:12 · Score: 3, Insightful

Do you even read your own links?

the exact number in common practice could be either one of the following:

1,000,000,000,000,000 bytes -- 1000^5, or 10^15.
1,125,899,906,842,624 bytes -- 1024^5, or 2^50.

Real geeks use powers of two; powers of ten we're only introduced for marketing purposes, which real geeks eschew.

Re:SCREW THAT!!! ;-) by Anonymous Coward · 2006-03-10 06:50 · Score: 0

Real geeks reserve powers of two for subjects that are inherently powers of two, and use the proper names for them: KiB, MiB, etc.

Most things, even in computers, are not naturally powers of two. Such as disk capacities. Real geeks understand that the metric prefixes have meant powers of ten since shortly after the French Revolution. Real geeks have been exposed to science and engineering, where you learn the proper terminology for units.

Hacks who write for magazines and AOL chatroom moderators get confused by this stuff, though.
Re:SCREW THAT!!! ;-) by hswerdfe · 2006-03-10 06:52 · Score: 1

base 10 comes from SI units.
base 2 was used because it was easier to count with.
and some jerkwad decided that 1024 is close enough.

--
--meh--
Re:SCREW THAT!!! ;-) by Hydrophobia · 2006-03-10 07:49 · Score: 1

Base 2 was used because that is the language computers speak in, its called binary. Welcome to Computer science.
Re:SCREW THAT!!! ;-) by Ossifer · 2006-03-11 07:55 · Score: 1

Most things, even in computers, are not naturally powers of two. Such as disk capacities.

Au contraire, mon frère... Most things in computers ARE involving powers of two, especially disk capacities. Specifically, they are multiples of powers of two, or directly powers of two themselves. In the celebrated case of the hard drive, the sector size (the smallest division of the disk) is usually 512 or 2048 (2^9 or 2^11). My so-called 160 gigabyte hard drive really stores 160041885696 bytes, which is a multiple of 512, it's sector size. It's really a 149+ gigabyte drive, but hey, who wants a 149 gig drive when you can get a 160 gig'er for the exact same price? (for the exact same drive...)

Darn! by tomcres · 2006-03-10 06:17 · Score: 0, Redundant

I thought this was going to be about OS/2's HPFS! You don't see too many technical articles on OS/2 anymore... bummer! :-(

Re:Darn! by Richard+Steiner · 2006-03-10 06:36 · Score: 1

Me too, but HPFS was a Microsoft filesystem, anyway. So why did they drop it in favor of crap like FAT32 and NTFS? :-)

--
Mainframe/UNIX Bit Twiddler and long time Windows/Linux Hobbyist.
The Theorem Theorem: If If, Then Then.

Well, ... by wasatched · 2006-03-10 06:19 · Score: 2, Funny

... 1.6 PB ought to be enough for anybody.

Available now. by Kadin2048 · 2006-03-10 06:20 · Score: 1

GPFS (apparently -- I know only of what I've learned in the last few hours) is available for Linux, from IBM, right now.

Some people further up in the discussion have warned however that it's not as stable on Linux as it is on AIX, which is really its native platform.

From IBM's page on GPFS:

"GPFS is available as:
* GPFS for AIX 5L on POWER(TM)
* GPFS for Linux on IBM AMD processor-based servers and
IBM eServer® xSeries®
* GPFS for Linux on POWER"

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Bad Experience with GPFS by localman · 2006-03-10 06:31 · Score: 3, Interesting

We used GPFS in our production environment for about 9 months in 2004/2005. We chose it specifically because it allowed several machines to share the file system (like NFS) but with file locking. It was also supposed to be very fault tolerant with no single point of failure. We set it up using a fiberchannel SAN.

Unfortunately we had a lot of problems with it. For one, performance was quite bad in ceratin cases... doing an ls in a large directory would take a very long time. Doing finds would take a very long time. Once you had a specific file you wanted, opening and reading it was reasonable (though all disk ops were still on the slow side), but multi file operations lagged on the level of 10s of seconds or more. I think it was having to issue network checks to every machine in the set for each file or something.

Also, the CPU usage was very high across all our machines, primarly from lock manager communications. It really taxed the system. And perhaps worst of all, it would caused crashes sometimes. A single machine in the set would die (usually a GPFS assert), and though that didn't break the set permanently, a multi-minute freeze on all disk reads would take place until the set determined the machine was unavailable. We spoke with IBM about all this stuff... provided debugging output and everything, we used the latest patches. But we never got the issues resolved. It was a very rough few months indeed. I probably averaged 4 hours sleep per night.

When I say "slow" what am I comparing it to? In the end we switched to NFS and we came up with a somewhat clever way to avoid the need for file locking. NFS used the same SAN hardware, but had a single point of failure: the head server. We doubled up there with warm failover. The load on all servers dropped dramatically (I'm talking from ~40 load to ~.1 load). Disk operations were orders of magnitude faster. And we've not had a single NFS related lockup or failure in the past year and a half *knocks on wood*.

Anyways -- GPFS probably has some good uses. But I would not recommend it for a very high-volume (lots of files, lots of traffic) mission critical situation. Unless they've made some major improvements.

Cheers.

Re:Bad Experience with GPFS by isj · 2006-03-10 07:03 · Score: 1

From my limited knowledge of GPFS my guess is that GPFS is slow at metadata operations (opening files, listing directories, updating last-changed-date, ..) but lightening fast for I/O once you have the file open.
Re:Bad Experience with GPFS by localman · 2006-03-10 11:42 · Score: 1

Yeah, that lines up pretty well with our experience. So it might work great for a shared disk at a video editing shop (few large files) but for our application it was a problem. The lockups were still a concern in any case, but perhaps this was something that only showed up at high transaction rates too.

Cheers.

Function of Purple by Kadin2048 · 2006-03-10 06:31 · Score: 2, Informative

The intended purpose of ASC Purple is nuclear weapons simulations.

Since they can't actually do tests, either aboveground or below, by treaty anymore, they do simulations instead. I assume these have something to do with modeling how radioactive decay affects the weapons' usability and yield over time (since I don't think they're really in the business of designing new toys, but who knows really), so that you know that a bomb is going to go "pop" instead of "fizzle" when you want it to.

I'd imagine that those kinds of simulations could easily produce tera- and petabytes of data, when run with the sort of precision and initial conditions that LLNL probably wants to use.

I think BlueGene/L (No. 1 on the list of top supercomputers, Purple is 3) is used for the same purpose. Or at least, that was their reason/excuse for purchasing it; exactly what they do with it every day is anybody's guess.

--
"Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."

Technical details by Anonymous Coward · 2006-03-10 06:46 · Score: 0

Much better info here (pdf).

No, the limits are much higher than that by FreeUser · 2006-03-10 06:50 · Score: 4, Informative

"Although we'd all like Moore's Law to continue forever, quantum mechanics imposes some fundamental limits on the computation rate and information capacity of any physical device. In particular, it has been shown that 1 kilogram of matter confined to 1 liter of space can perform at most 1051 operations per second on at most 1031 bits of information

Um, no, that's wrong.

Bremmermann's Limit is the maximum computational speed in the physical universe (as defined by relativity and quantum mechanical limitations) and is approximately 2 x 10^47 bits per second per gram (or, for those who prefer sexagesimal, one jezend, 60^11, bits per second per gram).

Bousso's covariant entropy bound also called the holographic bound is a theoretical refinement on the Bekenstein Bound that may define the limit of how compact information may be stored, based on current understanding of quantum mechanical limits, and is theorized to be equal to approximately one yezend (60^37, or ~10^66) bits of information contained in a space enclosed by a spherical surface of 1 sq. cm.

Given this, 1 kg of matter can perform approximately 2 x 10^50 bit operations per second per kilogram, in a space much smaller than 1 liter of space. Of course, other physical constraints (non-quantum related) probably limits us to a couple of orders of magnitude less computation, in a couple of orders of magnitude more space, but of course what those limits might be is very speculative

--
The Future of Human Evolution: Autonomy

Re:No, the limits are much higher than that by Cyno · 2006-03-10 14:13 · Score: 1

I have never heard of sexagesimal, or babylonian numerals, those links were fascinating..

but some corrections, qazend: 60^11 is 3e19, jezend: 60^26 is 1.7e46, lezend: 60^27 is 1e48
Re:No, the limits are much higher than that by Kaenneth · 2006-03-10 18:27 · Score: 1

Yes you have...

How many Seconds in a Minute?

How many Minutes in a Hour?

1.6 petabytes is overkill ... by DrJimbo · 2006-03-10 06:54 · Score: 2, Funny

640 terabytes should be enough for anybody.

--
We don't see the world as it is, we see it as we are.
-- Anais Nin

The marketing geniuses at IBM strike again!! by The_REAL_DZA · 2006-03-10 06:59 · Score: 1

Surely I'm not the only one who sees "GPF" and thinks "General Protection Fault"?!

First it was OS2 (OS 2 ? does the "2" stand for 2nd rate? Is it your 2nd attempt? Is it just a big piece of "#2"?) and now it's this. Don't get me wrong, I think their products are great, but I really think they'd have a hard time marketing air on the Moon!
(Slightly more) seriously, IBM could stand to hire the same marketing folks the beer companies hire...Especially since their markets overlap so much.

--

This space intentionally left (almost) blank.

Re:The marketing geniuses at IBM strike again!! by Mostly+a+lurker · 2006-03-10 07:20 · Score: 1

... they'd have a hard time marketing air on the Moon!
The question is, given the somewhat limited short term potential client base, would they try?
Re:The marketing geniuses at IBM strike again!! by Rheingold · 2006-03-10 07:31 · Score: 1

I ran into this the other day trying to search for discussions of it... GPFs is overwhelmingly used as the plural for General Protection Fault...

--
Wil
wiki

Petabreads? Or is that pita bread? by Mr+Z · 2006-03-10 07:04 · Score: 1

Stop it. You're making me hungry for Mediterranian food.

--
Program Intellivision!

Sign of the times by Indigo · 2006-03-10 07:06 · Score: 1

Ok, color me cynical. The first thing that came to mind when I saw this was gee, just what the NSA needs to help them process their enormous collection of data on the day-to-day lives of law abiding American citizens...

Re:Sign of the times by Anonymous Coward · 2006-03-10 07:57 · Score: 0

Nobody is an innocent law abiding citizen anymore.
They've set up such complicated overlapping and generalized or vague laws that it's impossible to not be breaking something anymore.
Heck, even that b.s. with 'suspect' whatever is trouble for anyone they like.
"Why did you take nanna away in that black van?"
"She's a suspected terrorist."
"Huh? Why is she suspected?"
"Because I find her suspicious."
"When will she be back?"
"When and if we release her from the undisclosed holding area for her to either go to trial or home, whichever we decide."
"When will that be?"
"When we decide to."
"I'm gonna call daddy and tell him."
"No you're not little girl, it is illegal for you to discuss this with anyone. Hmmm, come to think of it, you were discussing it with me before I told you it was illegal. You are under arrest!"
"I didn't know! I'm only 5 years old!"
"Ignorance of my law is no excuse, neither is age! Central, we need another raven-taxi out here, with extra small restraints..."

Anyway, back to the main thread, if you take a mega freaking ultra box and make it do something faster than everyone else, so what. That's like saying a jetboat can go faster than a guy in a peddlepaddle. I just don't find it impressive. Now get those performance numbers on an 'average' small business rig, then I'll be drooling.

Chuck Norris by City+Jim+3000 · 2006-03-10 07:06 · Score: 3, Funny

Chuck Norris penis is so big that 1.6 petabyte can only store 4 seconds of Chuck Norris porn.

Re:Chuck Norris by geekoid · 2006-03-10 07:15 · Score: 3, Funny

Which is more then any of us deserve to see.

--
The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect

I speek for thousands of nerds when I say by geekoid · 2006-03-10 07:11 · Score: 1

Fuck you.

--
The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect

Re:I speek for thousands of nerds when I say by Anonymous Coward · 2006-03-10 07:18 · Score: 0

In that case, you made thousands of nerds look like idiots by spelling speak incorrectly.

if GPFS is so good why doesnt IBM use it? by Anonymous Coward · 2006-03-10 07:22 · Score: 0

If GPFS is so great my did IBM put Luster on Blue Gene and not GPFS.

my gpfs problem by krismon · 2006-03-10 07:22 · Score: 2, Informative

We ran GPFS for about 10 months. It's great for it's primary purpose, and it was pretty stable on Linux, though we had a crash or two... but the biggest problem we ran across was with large number of files. We had > 150 million small files in 10000 directories, and gpfs couldn't handle the load. I'm sure with a smaller number of files, our experience would have been very different. Waiting 10 minutes for an ls in a directory wasn't really what I considered fun. :)

Re:my gpfs problem by Anonymous Coward · 2006-03-10 11:05 · Score: 0

So, uh, why didn't you come up with a slightly more intelligent way to lay out your directory structure? 150 million+ files in 10,000 directories == clusterfuck. I can't imagine a conventional filesystem handling that....
Re:my gpfs problem by Rick+Genter · 2006-03-11 11:31 · Score: 1

Actually, at one point I did an experiment on OpenBSD 3.3 where I created a directory structure of 9 million small files (each 1K). The structure used GUIDs for the filenames with the first 3 digits of the GUID determinining the directory into which the file was stored. Thus, if the file was named 12345678-9ABC-DEF0-1234-567890ABCDEF, the file was stored in directory 1/2/3.

It took a 1.47GHz Athlon with 128MB of RAM 17 minutes to delete the hierarchy. The HDs were basic Maxtor ATA-100s (60GB, as I recall) in a RAID-1 configuration.

I did a similar test on Windows XP Professional using NTFS on a single HD (again, a Maxtor ATA-100 - I think it was a 100 GB). It took a 2.7GHz Celeron with 1GB of RAM 48 hours to delete the hierarchy.

Draw your own conclusions.

--
Don't underestimate the power of The Source

Most of all... by drinkypoo · 2006-03-10 07:31 · Score: 2

...the title of the story submission is INCREDIBLY STUPID. Why? Because the Filesystem in OS/2 is called HPFS, which stands for "High Performance File System". Anyone who knows more than what they read this week knows this already, and was expecting an article on HPFS from the title (until they saw the blurb.)

Further evidence that "editor" is a misnomer 'round these parts.

--
"You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"

Re:Most of all... by arodland · 2006-03-10 13:30 · Score: 2

Not to mention that the summary text is really nothing than some numbers taken out of context with absolutely no practical meaning. Really, what the fuck, Zonk? This is your idea of a "story"? It's more like a fourth grader's idea of a "science report".

NTFS by Jaime2 · 2006-03-10 07:34 · Score: 2, Informative

NTFS has supported 16 exabytes since 1993. That's about 10,000 larger than this new system. I'm not saying that NTFS is great or that IBM's accomplishment is small. But the submitter really shouldn't have said that a 1.6 petabyte filesystem is anything to write home about. Most likely every modern filesystem is at least 64 bit(16 exabytes).

Re:NTFS by SecurityGuy · 2006-03-10 09:21 · Score: 1

There's a large difference between a filesystem format that can support a lot of data and having a collection of spinning disks that comprise a 1.6 petabyte filesystem.
Re:NTFS by Arimus · 2006-03-10 10:49 · Score: 1

There is a world of difference between the technical file size limit (ie based on the bit size) and the actually maximum anyone has used... 1.6 petabytes while being well short of the theoretical limits of a 64bit address system is still an impressive feat...

--
--- Users are like bacteria -> Each one causing a thousand tiny crises until the host finally gives up and dies.
Re:NTFS by jbsmurray · 2006-03-12 12:06 · Score: 1

NTFS didn't support that much space until Server 2003 SP1 and I'm not sure if it does even now. I tried to span a bunch of SAN drives together and was maxing out around 10TB. After that point, adding a disk to make the volume larger would cause an integer overflow that wasn't properly trapped so the disk would just stay the same size but not include the newly added LUN. Before 2003 SP1 you couldn't even import a single LUN of larger than 2TB.

1 petabyte = 1000 terabytes, not 1024. by cepler · 2006-03-10 07:47 · Score: 1

1 petabyte is 1000 terabytes, not 1024 terabytes. Please read and understand SI units for binary prefixes.

Re:1 petabyte = 1000 terabytes, not 1024. by Anonymous Coward · 2006-03-10 08:33 · Score: 1, Insightful

No thanks, I hate that SI stuff. Please understand the context, and if it's not "marketing", then use 1024, just like we've always done.
Re:1 petabyte = 1000 terabytes, not 1024. by The_REAL_DZA · 2006-03-10 08:43 · Score: 1

Yeah, it's not like 24 terabytes is a significant amount of storage... Let's just all say "about a thousand terabytes" and leave it at that.

--

This space intentionally left (almost) blank.
Re:1 petabyte = 1000 terabytes, not 1024. by Anonymous Coward · 2006-03-10 09:33 · Score: 0

"Please read and understand SI units for binary prefixes."

Sorry, I've got to wash my hair.
Re:1 petabyte = 1000 terabytes, not 1024. by vidarh · 2006-03-10 13:09 · Score: 1

Except that a "byte" isn't an SI unit in the first place, so what those prefixes means in the context of SI units have absolutely no relevance whatsoever.
Re:1 petabyte = 1000 terabytes, not 1024. by TheStonepedo · 2006-03-10 14:58 · Score: 1

SI units are not used for storage. Electrical and computing applications have prefixes corresponding to increases of 2^10 rather than 10^3 as they have had for years.

--
I'll be your candy shop of infinite deliciousity if you'll be my discotheque of endless rump-shaking.
Re:1 petabyte = 1000 terabytes, not 1024. by cepler · 2006-03-10 17:38 · Score: 1

http://physics.nist.gov/cuu/Units/binary.html

Try six orders of magnitude by irritating+environme · 2006-03-10 08:01 · Score: 2, Informative

Unless I forgot, a single order of magnitude is 10x, not 1000x.

Peta = 1 000 Tera = 1 000 000 Giga = 1 000 000 000 Mega

--

Hey, I'm just your average shit and piss factory.

Re:Try six orders of magnitude by Anonymous Coward · 2006-03-10 14:54 · Score: 0

Perhaps he meant an order of geeky prefixitude.

TheStonepedo

I'm confused.... by fdawg · 2006-03-10 08:05 · Score: 1

I RTFA and the most I could glean off of it was some jazz about parallel reads. Now hold on, I thought a RAID's bread and butter were about parallel reads. Am I missing something?

Do they mean Parallel Reads off of a NAS setup? A bunch of NAS boxes, with some IBM magic, that shows up as a single volue?

They mention that the machine they are using is some massively parrallel monstrosity with multiple raids per unit. Does this FS aggregate all of them into a single volue? (If you haven't noticed, I'm looking for a clustering filesystem that won't fall appart.)

Re:I'm confused.... by rlh930 · 2006-03-10 20:09 · Score: 1

Ummm - the thing you're missing is that this was from 2000 separate RAIDs at the same time. Sounds easy. It's not.

You *like* JFS? by Junta · 2006-03-10 08:10 · Score: 1

I tried JFS, and it handled power interruptions very poorly.

Essentially, I liked philosophically that the act of mounting and journal replay are separated, it really makes sense. Journal replay should be more an fsck option, thought that was neat. And when you mount read-only, you *mean* read-only, no journal reply or anything even on a 'dirty' filesystem.

However, I found all too frequently that after power failures, it would replay the journal and think everything was fine, until a few hours of usage later when it figures out that it left something in an inconsistant state and remounts read only all of a sudden. Then you fsck and watch lost+found get a few more files. As long as I could recognize the files, I could put them back fairly easily, but I haven't had issue with ext3 yet. Have had similar issues to this with XFS, and, admittedly, far worse with Reiser.

Anyway, returning to topic, GPFS is a filesystem for shared-storage SANs and for aggregating individual node storage into a potentially fault tolerant filesystem (or filesystems). Since they ditched the RSCT stuff a while back, I've found them to be fairly robust and not overly difficult to configure (Lustre I found significantly harder than new GPFS, but lustre is easier than old GPFS to get running). It is not suitable for desktop systems.

--
XML is like violence. If it doesn't solve the problem, use more.

simple filesystem clustering for a SMALL LAN? by Anonymous Coward · 2006-03-10 08:21 · Score: 1, Interesting

Feh. That's all well and good and it SEEMS impressive. I have, however, a bunch of mostly old machines networked. I have them share files via samba (mostly to accomadate my girlfriend's windows machine), sometimes NFS and quite often, ssh. Most of the hard drives are small - together, there is a decent amount of room.

I move, by hand, files hither and fro across my little LAN. I write CDs and DVDs across this system. What I would love would be a simple filesystem that would let me tie a few of these together into one virtual network drive. It should be smart and not cut files, say, unless they are really large so that if a node goes down, you can just access the files directly anyway, and at worst, would need to join a few split files with a simple Linux "join" command. You should be able to give this system an order of preference, ie., try to copy files to node A first, and when node A gets to a percentage of full, start at node B instead, etc.

I don't need redundancy and I don't need striping. I would expect such a filesystem to run at the same speed as any network drives, with a very slight overhead as it shifts gears from one machine to the next. But this would be minor since the system would avoid fragmentation, like I said, except for the biggest files.

If would be no more or less dangerous to my data - right now, if a machine with a certain file goes down, I lose access to that file until I get it back up - this would be no different.

Something like unionFS perhaps, except that the files aren't only written to the system on the "right" or howver that thing works or perhaps something like RAID0, but applied to an abitrary folder, even network folders, rather than just a partition on the same machine.

fred

Vaporware by Anonymous Coward · 2006-03-10 08:22 · Score: 0

99.9% Vaporware. I work for IBM. This ain't flying. The only reason why I give the .01% credit is because a friend (before he was fired for jerking off in his cube) used to work on it, gave it clout in 2004, but its just not there or even CLOSE to release if it is.

You've been Sony'ed by Nom+du+Keyboard · 2006-03-10 08:27 · Score: 0, Redundant

It may be a new file system, but chances are that Sony-BMG has already got it Root-Kitted.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

It has to be said.... by Nonillion · 2006-03-10 09:28 · Score: 1, Redundant

1.6 petabytes ought to be enough for anyone......

--
"I bow to no man" - Riddick

Give it up already by Anonymous Coward · 2006-03-10 10:21 · Score: 0

The standards were already created decades ago as powers of two. I am sick of people trying to revise history just because they don't like they way it is. Hardware manufacturers are the ones who can't get it straight. We can.

Revisionists...

Who mods this $#!+ up? Get over it, revisionist! by Anonymous Coward · 2006-03-10 10:42 · Score: 0

I am so sick of people revising history. Everything relating to computers uses powers of two. That standard was set long ago. Nobody in their right mind will ever adopt these new "standards" of powers of 10, except hard drive manufacturers who only like to inflate the actual numbers.

I have never heard someone say "kibibyte" without being beaten up by yet another nerd.

Re: Plus... by MBMarduk · 2006-03-10 15:49 · Score: 1

It's TERA not TERRA (earth?) The webpage you link to has it in plain sight too.

Re:Who mods this $#!+ up? Get over it, revisionist by Anonymous Coward · 2006-03-10 15:51 · Score: 0

Yeah, revisionism sucks. Use the original meanings of the prefixes, the ones in use for centuries. Oh, and that is standard.

Using metric prefixes for powers of two is sloppy, but understandable in context. I can live with that. But it irritates me when kids label the proper use of the prefixes 'wrong', when it's the other way around.

By the way, everything relating to computers is not powers of two. Addresses have a natural connection to powers of two. Drive capacity does not; the number of platters and cylinders can be anything, and the number of sectors is not only unrelated to powers of two, but varies by cylinder. Communication rates do not. Frequencies do not.

So this is why IBM's R&D expenses are so high! by r_jensen11 · 2006-03-11 17:54 · Score: 1

I was wondering why my accounting book is so fixated on IBM! Well, now I know.

Slashdot Mirror

IBM's High Performance File System

208 comments