Turing Award Winner On The Future of Storage

← Back to Stories (view on slashdot.org)

Turing Award Winner On The Future of Storage

Posted by timothy on Wednesday September 17, 2003 @01:55AM from the even-smaller-bits dept.

weileong writes "Ars Technica highlights an interview at ACM Queue with Jim Gray, a winner of the ACM Turing award *(among other things) by one of the pioneers of RAID (among other things). Many issues touched upon, including: "programmers have to start thinking of the disk as a sequential device rather than a random access device." "So disks are not random access any more?" "That's one of the things that more or less everybody is gravitating toward. The idea of a log-structured file system is much more attractive. There are many other architectural changes that we'll have to consider in disks with huge capacity and limited bandwidth." Actual interview has MUCH detail, definitely worth reading."

14 of 227 comments (clear)

Min score:

Reason:

Sort:

Solid state is the way to go. by caluml · 2003-09-17 01:57 · Score: 4, Interesting

"programmers have to start thinking of the disk as a sequential device rather than a random access device."
I think we'd all be better off when solid state, non-mechanical disks become commonplace.

Is there any reason other than cost why we can't have 100Gb solid-state drives yet?

--
Get your own free personal location tracker
Network speed by CausticWindow · 2003-09-17 02:07 · Score: 3, Interesting

they are part of Internet 2, Virtual Business Networks (VBNs), and the Next Generation Internet (NGI). Even so, it takes them a long time to copy a gigabyte. Copy a terabyte? It takes them a very, very long time across the networks they have

Is this really true? Wasn't there a recent Slashdot story where researchers transfered a gigabyte of data, in fourteen seconds or so, on Internet 2 from California to the Netherlands?

I suppose that disk access times will be limiting factor in both ends if you were to read and write the data from/to a disk.

--
How small a thought it takes to fill a whole life
2 quotes... by leomekenkamp · 2003-09-17 02:18 · Score: 2, Interesting

Two quotes from the article (emphasis mine):

Gray, head of Microsoft's Bay Area Research Center, sits down with Queue and tells us (...)

JG: If it is business as usual, then a petabyte store needs 1,000 storage admins. Our chore is to figure out how to waste storage space to save administration.

MS bashers will have a field day on this one...

--
Wenn ist das Nunstueck git und Slotermeyer? Ja! Beiherhund das Oder die Flipperwaldt gersput.
MRAM saves the day by Markus+Registrada · 2003-09-17 02:39 · Score: 3, Interesting

All the tradeoffs will change radically when MRAM hits the streets. It's potentially denser than disk and DRAM, as fast as static RAM, nonvolatile, doesn't use power when it's not used, and can be made on regular silicon process machinery. Expect it first in cell phones next year, and then everywhere.
This doesn't just affect file storage and virtual memory. It also changes the economics of cache and main memory, and makes deployment of 64-bit CPUs more urgent. It also makes system crashes much less tolerable, because turning the computer off and on doesn't involve long shutdown and boot procedures any more.
3 Terrabytes on a credit card? by polyp2000 · 2003-09-17 02:51 · Score: 4, Interesting

Anyone know what happened to that bloke at keele who
invented a way of cramming 3 Terrabytes on a credit card. Apparently it would have cost about 35 pounds to manufacture. this was a couple of years ago, why hasnt it happened yet?

Surely something like this is the real future of storage ?

Terrabyte on a credit card

--
Electronic Music Made Using Linux http://soundcloud.com/polyp
New File System by Archangel+Michael · 2003-09-17 02:56 · Score: 2, Interesting

What current file systems need is meta data in them. That is that the File system itself stores the MetaData about the file. Think about the Mac File system, with the Meta data contained in the file itself, as the "resource fork". Now imagine a systemized, extensable meta file system, that organized files by what the Meta Data said about them.

Imagine, media files stored in such a way that both random and sequential access was optimized, where the file structure was automagically defragmented and organized behind the scenes.

Imagine a computer that watched what files were used at bootup, and organized them so that the hard drive streamed the bootup data sequentially, straight into memory.

Imagine being able to start PRELOADING applications before you even finish the second of your double clicks on the datafile.

Imagine Database files that were automagically indexed as part of the file system.

Imagine Security and encryption being built into the filesystem beyond today's capabilities, where the security and encryption does not rely upon a master controller or centralized security policies, but rather has the ability to follow the file, seemlessly.

I am sure that I haven't even begun to tap the possibilities.

--
Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
1. Re:New File System by Archangel+Michael · 2003-09-17 03:56 · Score: 2, Interesting
  
  That is just a start. That article mentions nothing about Meta Data, which is required to make advanced capabilities of the File System come alive.
  
  For Meta Data to work, there has to be some sort of STANDARDS based way of describing said data.
  
  For instance, a table. How would you describe a table? Is it Tab delimited text, Spreadsheet or a HTML based Table? Does it reference cells and or other tables? Are those available? Is the data from missing tables, available as a static value?
  
  Is the data within the table used in other work, such as a presenation or Brochure? The value is not in the system, but in the interlocking way we use it, and that needs to be described as Meta Data.
  
  --
  Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
IDE replaces DVD by G4from128k · 2003-09-17 03:02 · Score: 4, Interesting

With an ever growing collection of digital photos, I've come to the same conclusion as Jim Gray. Hard disks are superior for backups.

I currently have about 100 GB of images and it takes more than 20 4.7 GB DVD-R discs to create a full backup. Although DVD media is still slightly cheaper than new large capacity IDE drives, the added time and hassle factor of burning 20 disks far out weighs any minor costs savings. Moreover a 3.5" drive in a padded anti-static bag takes up less room in the safe deposit box than 20 DVDs (especially if you have the DVDs in protective jewel cases). And if HD-based-backup lets me avoid some future artists tax on burnable media, so much the better.

A Firewire enclosure and a rotating collection of IDE drives is the way to go.

--
Two wrongs don't make a right, but three lefts do.
Interesting Idea... by polyp2000 · 2003-09-17 03:04 · Score: 3, Interesting

Interesting thought popped when i read your post,
there is a current trend towards cramming as much storage into something the size of a 3in Hard drive.

I wonder why they dont make larger harddrives in the physical sense? A hard drive the size of a washing machine using todays technology would store a phenomenal amount of stuff, but whatabout something more reasonable like a hard drive merely twice the physical size of todays. how much more storage could you get just by scaling up the platters? anyone here good at math . Hard drives today must be up to 200-250gb.

--
Electronic Music Made Using Linux http://soundcloud.com/polyp
Re:Wait by Anonymous Coward · 2003-09-17 03:17 · Score: 2, Interesting

There are multiple levels of access within a file system. The sequential versus random decisions they are talking about is at a much lower level than you are thinking. Somewhat simplified:

Now, when software opens a file, it gets a handle to the storage and seeks all over it to get the data it needs and finally write it back. This is particularly true of files that consist of many records. Some software mmaps (memory maps) the file, mapping it into the memory address space and making it appear as a large, slow section of RAM in order to make this easier.

Relatively recently, you see many more programs which open a file, slurp the entire thing into memory, and close the file on disk. When they want to make changes, they open the file again and rewrite it from scratch. You see this more in text editors and word processors. Programming editors will often have some alternate behavior for very large files, although the threshhold for "very large file" is always increasing.

When you do this with record oriented files and or incremental save/autosave, etc, you get into journalling. You write all of the user's changes sequentially to a log file rather than saving the actual file (and re-writing it) repeatedly. This is sometimes what you are seeing when a program has a 'recovery file'. Having only one recovery file or journal for any number of open files means you are consistently writing appends to a single location and avoiding disk seeks.

What the article is getting at is that this sort of behavior will get more and more common, even moving into the FS and OS level. Support for this kind of journalling may move its way into FS handling, for instance. Also, instead of opening individual files, the FS may block transfer a whole directory into RAM at once. We already see this with advanced file systems which store small files directly in the directory inode. We may see the inodes get larger and the definition of 'small file' become steadily larger. When you have GBs of RAM and TB of storage, why not have a 64 MB+ inode?

From this point of view, random seeking within files slowly becomes irrelevent. Rather, the primary operations become streaming and append.
Re:Troll in the article by panurge · 2003-09-17 03:43 · Score: 2, Interesting

I'm going to confess that I have probably misunderstood the point. The precise bit of the article I was referring to was:
The challenge is similar to the challenge we see in the OS space. My buddies are being killed by supporting all the Linux variants. It is hard to build a product on top of Linux because every other user compiles his own kernel and there are many different species. The main hope for Oracle, DB2, and SQLserver is that the open-source community will continue to fragment. Human nature being what it is, I think Oracle is safe.
DP Is MySQL.com trying to be the Red Hat of MySQL?
JG It could be that they will step forward and provide all of those things that IBM, Microsoft, and Oracle provided, and do it for a much lower price. I think the incumbent vendors will have to be innovative to make their products more attractive.
One thing that works in the incumbents' favor is fear, uncertainty, and doubt (FUD). If you base your company on a database, you are risking a lot. You want to buy the best one. People are usually pretty cautious about where they want to put their data. They want to know that it's going to have a disaster recovery plan, replication, good code quality, and in particular, lots and lots and lots of testing.
The thing that slows Oracle, IBM, and Microsoft down is the testing, and making sure they don't break anything--supporting the legacy. I don't know if the MySQL community has the same focus on that.
At some point, somebody will say, "I'm running my company on MySQL." Indeed, I wish I could hear Scott McNealy [CEO of Sun Microsystems] tell that to Larry Ellison [CEO of Oracle].
DP The whole corporation?
JG Right. Larry Ellison announced that Oracle is now running entirely on Linux. But he didn't say, "Incidentally we're going to run all of Oracle on MySQL on Linux." If you just connected the dots, that would be the next sentence in the paragraph. But he didn't say that, so I believe that Larry actually thinks Oracle will have a lot more value than MySQL has. I do not understand why he thinks the Linux problems are fixable and the MySQL problems are not.

I was concentrating on his claims that building a system on top of Linux is particularly hard, and his mentioning Microsoft in the same sentence as IBM and Oracle. Although I make extensive use of MySQL for small systems in our consultancy, I think it is a long way from being ready for the main enterprise RDBMS. In fact, I felt he was trying to tar Linux with the MySQL brush, if you see what I mean.
I now think he probably did not mean it the way I read it. If anyone cares to mod my original post down, feel free. But I do think that, for a long article, there was actually not a lot of real content.

--
Panurge has posted for the last time. Thanks for the positive moderations.
Missing the logical boat by leandrod · 2003-09-17 04:01 · Score: 2, Interesting

> To some extent you can think of Codd's relational algebra as an algebra of punched cards. Every card is a record. Every machine is an operator.
Interesting how the guy literally wrote the book on transactions, yet grossly misrepresents Codd's work, which BTW wasn't simply the relational algebra, but even higher level: the relational model of database management, including the relational calculus.
While the algebra is somewhat procedural, the calculus is set-oriented, and they are fully equivalent. The idea is exactly not looking at records and operators, but describe what you want -- just leave the relational system set the procedures to get that in the most efficient way it can.
Incidentally this has a big impact on all Gray is discussing -- without a fairly simple and powerful data model, so much data is basically a waste. He's thinking too low level, including the object stuff he touts, but we will only find use for so much data the day we get proper relational implementations, and this excludes SQL in general and MySQL in particular.

--
Leandro GuimarÃ£es Faria Corcete DUTRA
DA, DBA, SysAdmin, Data Modeller
GNU Project, Debian GNU/Lin
Re:It's "A station wagon full of..." by AJWM · 2003-09-17 07:23 · Score: 2, Interesting

Certainly 1980s, probably circa 1983 or 1984 at the latest. I came up with the phrase (which may well have been independently coined before me, at the time I was unaware of it) when we were setting up NETNORTH, the Canadian counterpart to BITNET (networks of typically college campus mainframes, not directly part of ARPANET). There was discussion about setting up the HQ at University of Guelph (where I worked at the time - west of Toronto) or Waterloo University.

The highway in question (as in station wagon travelling on) was the Highway (7? it's been a long time) between Waterloo and Guelph (at least part of which I drove every day, since I lived in Waterloo). I don't recall the numbers now, but my calculation of the bandwidth of Hwy 7 was based on a couple of boxes of 2400' reels of 6250 BPI tape (standard IBM mainframe tape size) in a car (or station wagon) travelling at the posted 90 km/h speed limit.

Back in those days, aside from dedicated leased-line networks like BITNET or commercial X.25 packet networks like Tymnet, a 2400 baud dialup modem was considered blazingly fast. (And long distance charges were not cheap, hence the popularity of multi-hop dialup networks using UUCP or like Fidonet.)

--
-- Alastair
Re:Huge disks by Llurien · 2003-09-17 07:43 · Score: 2, Interesting

Interesting point. I guess that's partly because a human collects stuff in a more or less linear fashion. Everything you collect, create or use takes time, and time is a resource that we don't get more of simply because our computers get faster. It is possible to handle one single 4 GB file such as a movie, but it would be impossible to do something meaningfull with 4000 1MB files, it would simply take too much time. Offcourse, you could think of automated tasks operating on large sets of files, but again random access would serve no benefit here. Throughput is important in the case of a program handling a sequence of small files.