Slashdot Mirror


The Design Of The Google File System

Freddles writes "This is an interesting paper (PDF) describing the design approach to Google's file system. The design had to take account of requirements for huge file sizes, a highly responsive infrastructure and an assumption that hardware components will always fail."

50 of 210 comments (clear)

  1. In case you don't like PDF by Brahmastra · · Score: 4, Informative

    Here's the html link

    1. Re:In case you don't like PDF by redJag · · Score: 3, Funny

      In case you hate highlighting as I do:
      try this :)

      I wish I had enough RAM to use as a harddisk. Then I could...well no, I wouldn't do anything useful. It would be cool, in a geeky way.

    2. Re:In case you don't like PDF by Short+Circuit · · Score: 5, Funny

      A ramdisk would make for a great swap partition. :)

    3. Re:In case you don't like PDF by bugnuts · · Score: 2, Funny
      How ironic, that the HTML-ized file on Google is available from Yahoo!...


      Yahoo uses the evil Anti-Google FS. It's the 1's complement called GllgOe. It can store 01111111111 1111111111 1111111111 1111111111 1111111111 1111111111 1111111111 1111111111 1111111111 1111111111 bytes of data.

  2. Thoughtful... by Anonymous Coward · · Score: 5, Funny

    It was thoughtful of the poster to link to google.com for those that have never heard of it.

    1. Re:Thoughtful... by Queuetue · · Score: 5, Funny

      Absolutely - I was about to go look google up on teoma and askjeeves...

    2. Re:Thoughtful... by Anonymous Coward · · Score: 5, Funny

      Last week I had a co-worker ask how to spell it. He is MS cert'd for Win2k Pro. Don't mod this funny, it's sad.

    3. Re:Thoughtful... by Morosoph · · Score: 2, Funny

      In case of Slashdotting

      Take note: "Google is not affiliated with the authors of this page nor responsible for its content."

    4. Re:Thoughtful... by daeley · · Score: 3, Funny

      Last week I had a co-worker ask how to spell it

      I-T. Really now, how hard is that?

      --
      I watched C-beams glitter in the dark near the Tannhauser gate.
    5. Re:Thoughtful... by xoboots · · Score: 4, Insightful

      There's a reason not every search engine is considered the same. Try a simple search for a popular item. I searched for "PHP" on the three sites you mentioned. The top returned results are as follows:

      Google:
      - top result: php.net
      - 2nd place was php.net/downloads

      AllTheWeb:
      - top result: Hands-On PHP Training - 4 days $1695 (also ranked #10 on Turbo10, but not ranked in the top 20 at Google) -- oops, that is a sponsored link, but in AllTheWeb's default view, it looks like a normal link. php.net is actually ranked #1, but it appears 4th in the list of available links.

      Turbo10:
      - will not provide ANY results without Javascript turned on (BOO!)
      - top result: GBF Masonry Cleaning Services..Stone Cleaning
      - php.net ranked 5

      Draw your own conclusions, but meta-search engines existed prior to Google yet even at its launch it excelled over them in terms of provision of relevant links. It appears that it still does. At least for a first pass :)

      I suspect that one of the reasons that Google can bring higher quality links to the forefront is that being #1, they have a wider and more generous revenue base and therefore don't have to be as generous to "paying patrons" *cough cough*.

      Another problem is that meta engines have to mix "high-quality" results (say from Google) with lower quality results (say from some dippy paid for advertising search engine).

  3. Story summary by slash-tard · · Score: 4, Funny

    Google uses MS access as a backend to store all of its cache files. It is redundant by having a batch file setup with the windows "at" command to "xcopy" the data to another backup server.

  4. PDF mirror by Tyler+Eaves · · Score: 4, Informative

    PDF mirror on my server /Feels sorry for the Rochester cs server

    --
    TODO: Something witty here...
  5. Interesting... by petermdodge · · Score: 3, Insightful

    It's an interesting enough read, it certainly is interesting to see how one of the biggest-volume servers out there cope. Now, the question is, what can us little server guys do to implement the ideas therein to our server? What can we take from it?

    --


    Peter M. Dodge,
    Chief Executive Officer,
    LiquidFire Studios

    Platinum Linux - www.
  6. Just to make it clear.. by Doodhwala · · Score: 4, Informative


    Okay, so I read this paper as a part of the SOSP reading group here at school. Just want to make it clear that this is not the file system used by the front end that we all see. It is used by internal dev groups as well as the web spiders that they employ. Their unique usage has definitely led to a number of interesting choices (such as the atomic appends) for the file system design. Read the paper for more details :-)

    1. Re:Just to make it clear.. by Klaruz · · Score: 3, Insightful

      Could you cite your source please? In the first page of the paper linked:

      "It is widely deployed within Google for the generation and processing of data used by our service as well as research and development that requires large data sets."

    2. Re:Just to make it clear.. by Doodhwala · · Score: 3, Informative

      And if you read that statement, it does not mention the front-end. Generation and processing all takes place offline as most of the query results are only updated once a month (the Google-dance). And this question was asked of Howard Gobioff (one of the co-authors) at a presentation on the Google File System (GFS) at Carnegie Mellon.

  7. Hmmm. by Pig+Hogger · · Score: 4, Funny

    I'd like to see a beow...
    Never mind.

  8. Everything's stolen nowdays. by Anonymous Coward · · Score: 2, Funny


    Why the google file system is nothing but a waffle iron with a phone attached.

  9. Only a file system? by jrrl · · Score: 5, Interesting
    Back in the early days at Lycos, Danner Stodolsky, now at Akamai used so many weird little tricks to make things faster that we used to joke that we'd end up with a custom operating system. The supposed name? LycOS.

    Luckily the world was saved from this possibility.

    -John (now, one of those "why, back in my day..." story telling guys... sigh.)

    --
    Self Serving Sig: Hosting Comparison
    1. Re:Only a file system? by FireBreathingDog · · Score: 3, Interesting
      Nice menu: not alphabetized, and "Use a digital camera" appears twice with two different icons. Then there's the inexplicable and unexplained "scribus" menu item, the only item that is neither a phrase nor capitalized.

      Steve Jobs must be shitting in his pants.

  10. Is it open source? by The+Ancients · · Score: 4, Funny

    I need something for my p...err, book collection.

    1. Re:Is it open source? by daeley · · Score: 2, Funny

      book collection

      Ah, yes. You want a new-fangled "ShelFS" system.

      --
      I watched C-beams glitter in the dark near the Tannhauser gate.
  11. Re:You mean FAT don't cut it no more? by Russ+Steffen · · Score: 2, Funny

    I think you mean "WEEEEEEE.EEE." Or possibly "WEEEEEE~1.EEE."

  12. Word processor? by Anonymous Coward · · Score: 2, Interesting

    What word processor/text editor is used to write all of these technical papers? Almost every paper I've seen looks like it's written in the same program.

    1. Re:Word processor? by Saunalainen · · Score: 2, Informative

      The PDF file claims to have been made by dvips, so it was written in Latex. It was then converted to PDF using Distiller.

  13. html version by kaan · · Score: 3, Informative

    thanks to, ehh, Google, here's an html version of the article

    I didn't read the whole article (kinda lengthy) but it seems pretty informative. I found their assumptions interesting, as they reveal some of the essence of what makes Google such a great search tool. Here are a few from the article:

    - The system is built from many inexpensive commodity components that often fail. It must constantly monitor itself and detect, tolerate, and recover promptly from component failures on a routine basis.

    - High sustained bandwidth is more imprtant that low latency. Most of our target applications place a premium onprocessing data in bulk at a high rate, while few have stringent response time requirements for an individual read or write.

    - The workloads primarily consist of two kinds of reads: large streaming reads and small random reads. Successive operations from the same client often read through a contiguous region of a file.

  14. Re:You mean FAT don't cut it no more? by Wumpus · · Score: 4, Funny

    Surely you mean "WEEEEE~1.EEE".

  15. Various hardware life expectancies? by The+Ancients · · Score: 3, Interesting
    ...and an assumption that hardware components will always fail.

    I think perhaps this is something we could all take a little more seriously. Part of me realises this is a comment on the sheer data being manipulated, but then something else that sprung to mind is the gradual reduction of warranties on HDDs, for example. I wonder what sort of stats an operation of this size could gather on various hardware components, and their varying propensities to wither and die.

    1. Re:Various hardware life expectancies? by forevermore · · Score: 2, Interesting

      Gradual reduction of hard drive warranties? Didn't Maxtor just bump up the warranty on their drives to 5 years? And WD and Seagate both have 3 year warranties on their drives. Granted, I'm talking about the "good" (SATA, 8 meg cache, etc.) drives, not the cheap ones that most of us users are using rebates to get for really-cheap.

      --
      Do you really need reason for beer? Wingman Brewers
  16. Re:great. now, deal with the spam issue by winkydink · · Score: 4, Funny
    how many times have you searched for something on google, only to find that the search engine spammers have taken over almost every top 10 result?

    Ummm... not very many. Then again, I try not to search on "teen panties" very often. :)

    That reminds me of the winter I spent in Chicago. I needed some galoshes to protect my shoes and keep my feet dry. Back in New England, we called them "rubbers" (I am not making this up). Needless to say, a google search on "buy rubbers" did not yield the intended results.

    --

    "I'd rather be a lightning rod than a seismometer." -Ken Kesey

  17. Everyone still uses Latex in university. by Anonymous Coward · · Score: 2, Funny

    Just for covering their penis, not reading papers.

  18. Fabulous Insights by dolo666 · · Score: 4, Informative

    I really enjoyed that read about the file system Google uses. The fact that they usually append to their files, is of special note. By appending data you only need to know a simple pointer address. Seems quick enough. Add a bunch of threaded concurrent writes and you could get into trouble on other systems... The "atomic append" seems interesting because of the use of multiple machines to append simultaneously (hazard free).

    64meg chunk size is pretty huge, but I'm guessing that's blocked out based on continual threads of data, not typical files.

    At first glance, this file system seems fairly wasteful. But hey, Google likely require speed and reliability over cost. Right?

    This reminds me of the discussions about not-so-far-off database filesystems coming to an OS near you.

  19. When will it be in the kernel? by caluml · · Score: 3, Funny
    I hope they're going to release it to us mere mortals. I mean, they're probably the only people that need millions of gigabyte+ files floating around thousands of machines, but it would be nice to see

    [ ] Google File System.

    in the kernel config.

    Must be 12pm - the updatedb script it running.

  20. And starting with Linux 2.7... by JessLeah · · Score: 4, Funny

    ...the Linux kernel will have googlefs support. It will be marked (EXPERIMENTAL), though, and will only run on 10,000-node Babelfish clusters...

    1. Re:And starting with Linux 2.7... by Anonymous Coward · · Score: 3, Interesting

      Actually this sounds exactly like the sort of file system that would be useful in a render farm.

      How long before ILM or Weta has a GFS disk array?

  21. they published it ... by trick-knee · · Score: 5, Interesting

    ... which may not have happened from just any company of google's prominence. I mean, they have highly successful business and technical infrastructure models and they didn't HAVE to share it with anyone.

    I wonder what they believe will protect their business from poaching of these ideas?

    1. Re:they published it ... by Anonymous Coward · · Score: 2, Insightful

      The catch up Law.

      Basically it says that if you spend all your time playing catch up you never be first.

      If the other Search engines use the GoogleFS then you know they aren't the leader. Sort of like if kernal.org was running windows 2003 or if www.msn.com was running on linux.

      Now if they go and create a FS so they can be the same as google then they are just catching up. Once they catch up to Google, Google will be somewhere else.

      The other thing is they're are lots of Clustered file systems around so it's not like they have the only one. They've just optimsed one for their needs.

      Basically if the other companies copy the idea it would take them at least a year to get it working by then the Google FS will have more features or they may have another bootle neck eg Google NUMA or the like.

    2. Re:they published it ... by hankaholic · · Score: 4, Insightful

      I wonder what they believe will protect their business from poaching of these ideas?

      Perhaps the fact that it's taken many very smart people a good amount of time to implement and tune the original design, even after having come up with the basic layout?

      Go take a look at the ReiserFS Future Vision page -- you'll see some more interesting discussion of filesystem design, and overall direction. There are a few solid developers working full-time on the concepts discussed in the Reiser docs, and they still have enough work to keep them busy for years to come.

      Google releasing information regarding the structure of their systems is a bit like John Carmack discussing the structure of his graphics engines: there's a hell of a distance between a conceptual description and a fine-tuned, tested, working implementation.

      Given Google's history, I'd also imagine that they're on the lookout for up-and-coming young researchers. As such, if some grad student takes their work and extends it, they can certainly benefit.

      --
      Somebody get that guy an ambulance!
  22. RAIC?? by More+Karma+Than+God · · Score: 3, Interesting

    Could we call Google a Redundant Array of Inexpensive Computers?

    What else can it be programmed to do? Could this become the basis for a personal computer where you just add computers seamlessly when you need more power?

    --
    Go here to create your own Slashdot dis
  23. In case you don't like links at all by Bingo+Foo · · Score: 2, Funny

    In case you don't like reading stories and links before posting, remember this is Slashdot.

    --
    taken! (by Davidleeroth) Thanks Bingo Foo!
  24. Google cache by Skreech · · Score: 5, Funny

    In case Google gets slashdotted, here is the Google cache for Google.

    1. Re:Google cache by CableModemSniper · · Score: 3, Funny

      Best part is the disclaimer at the top:

      Google is not affiliated with the authors of this page nor responsible for its content.

      --
      Why not fork?
  25. GFS and GWS? by cpopin · · Score: 2, Funny

    They designed their own file system as well as Web server? Did they design their own receptionists? If so, I want to work there!

    --
    -=- Many seek good nights and lose good days.
  26. Prevayler anyone? by 12357bd · · Score: 2, Informative

    The in-memory master behaviour described in the paper ressembles a lot the Prevayler software.

    --
    What's in a sig?
  27. GooFS? by hajejan · · Score: 2, Funny

    Yeah, that'll definitely sell.

    --
    The Mini Repository - more links
  28. PC #1782563 by can56 · · Score: 2, Interesting

    See Verity Stobs article -- Cold Comfort Server Farm -- in the August/2003 edition of Dr. Dobb's Journal, for the sad truth about Googles' server farm. Sniff ;-(

  29. LaTex is not a word processor by maxmg · · Score: 3, Informative

    It's more of a "text compiler" where you concentrate on writing the content and leave all of the formatting to a template that is responsible for transofmring the content into (normally postscript) output. Anybody who has worked with LaTex and then moved to Word, only to have that stupid piece of sh*t bunch all images in a document together, on top of each other, on the first or last page of their document will appreciate the LaTex workflow. And LaTex absolutely rocks when it comes to formulas.

    That being said, LaTex comes with a siginificant learning curve, and due to its nature misses some of the features that are important in a business environment (most notably changes tracking). There are some pseudo-wysiwig frontends for LaTex, such as Lyx, but they are firmly targeted at an academic audience. Most scientific papers require submissions in .ps format, processed with a speified LaTex templates (at tleast they did when I was at Uni).

    --
    I asked for a refund - and got my monkey back.
  30. Is there still a Google dance? by harmonica · · Score: 2, Interesting

    I thought the Google dance was history, and the index is now being updated more continuously (how exactly, I don't know)?

  31. People, people by Epistax · · Score: 2, Funny

    The question really on all our minds is can you play doom on it?

  32. What a waste.... by abramsh · · Score: 2, Insightful

    Should have just bought one of these: SGI SAN 3000 It would be easier and cheaper to manage, scales better, and you wouldn't have to spend the money to create and maintain the file system.