Slashdot Mirror


Large File Problems in Modern Unices

david-currie writes "Freshmeat is running an article that talks about the problems with the support for large files under some operating systems, and possible ways of dealing with these problems. It's an interesting look into some of the kinds of less obvious problems that distro-compilers have to face."

6 of 290 comments (clear)

  1. Re:Why large files by Big+Mark · · Score: 5, Insightful

    Video. Raw, uncompressed, high-quality video with a sound channel is fucking HUGE. Look how big DivX files are, and they're compressed many, many times over.

    And compressing video on-the-fly isn't feasible if you're going to be tweaking with it, so that's why people use raw video.

    -Mark

  2. data warehouse, and any database for that matter by CrudPuppy · · Score: 5, Insightful

    my data warehouse at work is 600GB and grows at a rate of 4GB per day.

    the production database that drives the sites is like 100GB

    welcome to last week. 2GB is tiny.

    --
    A year spent in artificial intelligence is enough to make one believe in God.
  3. Re:Wrong point of view. by KDan · · Score: 4, Insightful

    Two words:

    Video Editing

    Daniel

    --
    Carpe Diem
  4. Have you ever seen some people's email? by alen · · Score: 4, Insightful

    On the Windows side many people like to save every message they send or receive to cover their ass just in case. This is very popular among US Government employees. Some people who get a lot of email can have their personal folders file grow to 2GB in a year or less. At this level MS recommends breaking it up since corruption can occur.

  5. Re:Wrong point of view. by cvande · · Score: 5, Insightful

    In a world everything is small and manageable. Unfortunately, some databases need tables BIGGER than 2gb. Even splitting that table into multiple files still finds you with files larger than two gb. Try adding more tables? OK. Now they've grown to over 2gb and the more tables the more complicated everthing gets. I still need to back these suckers up and a backup vendor that I won't name can't help me because their software wasn't large file (for Linux) ready. So let's get into the game with this and make it the default so we don't need to worry about these problems in the future. Linux IS an enterprise solution.....(my $.02)

  6. Re:Wrong point of view. by costas · · Score: 4, Insightful

    Maybe in your problem domain that's true. I work with retailer data mines and we've hit the 2GB file limit, oh, 4-5 yrs ago? We've been forced to partition databases causing maintainance issues, scalability issues, and the like, just because of the size of a B-tree index.

    True, it looks like the optimal solution is lower-level partitioning, rather than expanding the index to 64bits (tests showed that the latter is slower), but that still means that the practical limit of 1.5-1.7 GB per file (because you have to have some safety margin) is far too constraining. I know installations who could have 200GB files tomorrow if the tech was there (which it isn't, even with large file support).

    I am also guessing that numerical simulations and bioinformatics apps can probably produce output files (which would then need to be crunched down to something more meaningful to mere humans) in the TB range.

    Computing power will never be enough: there will always be problems that will be just feasible with today's tech that will only improve with better, faster technology.