Slashdot Mirror


The Future of Databases

gManZboy writes "Ever wonder where database technology is going? This is something that Turing award winner Jim Gray from Microsoft has given a lot of thought to. He recently published an article in which he looks at the many forces pushing database technologies forward, and what those new technologies will look like. Gray writes, 'the greatest of these [research challenges] will have to do with the unification of approximate and exact reasoning. Most of us come from the exact-reasoning world -- but most of our clients are now asking questions that require approximate or probabilistic answers.'"

10 of 315 comments (clear)

  1. Turing award winner? by Anonymous Coward · · Score: 5, Funny

    As in, he passed the Turing test?

  2. Why complicate things so much? by bigberk · · Score: 5, Interesting

    How many times have we heard of huge sites going down because databases become corrupt or unrecoverable, or of the huge resource strain (memory and CPU) from a large database?

    In my opinion, the future of databases is nothing so complicated as pitched here -- but rather a move to simpler, more reliable back ends where the filesystem is the database. This is certainly the vision pitched by Hans Reiser and reiserfs, which aims to put more database like intelligence within the filesystem. So you eliminate extra unnecessary layers that just eat up resources and create fragile databases.

  3. A real problem comes full circle by Anonymous Coward · · Score: 5, Interesting

    Data is data. Just data. Save it, read it, sort it how you like. Efficient results mean having rapid, low-latency access to data.

    Add code to it, and you have data+code.

    OF course, code is data, and thus data can be treated as code, and handled by other code. LISP does this moderately well.

    But you can't avoid the fact that, as it stands, databases are just engines for keeping your data structures outside your code, or when you add code to them, engines for reading your data structure for you so that you don't have to think about how to do it. ... except that you still do, because SQL isn't a way of avoiding logical errors. ... and that they still don't save time. At best, they allow for some parallelism, external access to the data, and a separation of concerns.

    I'm getting rather tired of the fad that databases should be tacked on to everything, ranging from a shopping list to guidance systems. When did adding overhead become the mark of skill?

  4. Great Article by Spaceman40 · · Score: 5, Insightful

    The requirements for a database today aren't too much different from those twenty years ago - except for what we want to get out of them.

    Now that data mining is a $[insert large number here]million industry, databases are being asked to do a lot more processing with this data than before. For example: old database query = get these attributes from tuples that match this pattern. New database query = determine how likely a user who has accessed 30 or more times this last month is to subscribe to the second-level pay service within the next ninety days, with or without an email advertising said service.

    --
    I [may] disapprove of what you say, but I will defend to the death your right to say it.
  5. In other words ... by Daniel+Dvorkin · · Score: 5, Insightful

    ... MBA's want the magic glowy box to do their thinking for them.

    Fortunately, Microsoft will be there to take their money.

    --
    The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    1. Re:In other words ... by YrWrstNtmr · · Score: 5, Funny
      I don't know how many times I've heard that thought process over the years.

      [MBA tool]"I want to come in in the morning, push a button, and have the program distribute all my stuff."

      [me]"If I could make it do that, I could make it push its own button, and the company wouldn't need you anymore."

      [MBA tool]"Oh."

  6. $article_title by $blowhard. by Anonymous Coward · · Score: 5, Funny

    $techology is dying. It will be replaced with $replacement. Insert 4000 more words sprinkled with $random_buzzwords. I am so smart! The end.

  7. Bioinformaticists (and spies) use this a lot by Dioscorea · · Score: 5, Informative

    most of our clients are now asking questions that require approximate or probabilistic answers

    Bioinformatics databases are a good example of this. DNA and protein sequence databases are often searched by approximate string-matching algorithms based on "dynamic programming" to hidden Markov models and other stochastic grammars.

    Historically, drug target-hunters in Big Pharma created a market for accelerated hardware to facilitate dynamic programming searches, some of which (e.g. Paracel's Fast Data Finder chip) was originally marketed to government agencies who, um, shared an interest in approximate string-matching ;)

  8. I want clustered databases for high-availability by SpecialAgentXXX · · Score: 5, Interesting

    The "next great advancement" in databases will be when I can setup 2 or more linux servers and have them act as a single database server. Our database server is the most expensive item in our datacenter because it's an N-way IBM server.

  9. Sure... by Anonymous Coward · · Score: 5, Funny

    Could someone summarize it without using the letter 'e'?

    Sure.

    Th Futur of Databass
    Postd by timothy on Monday May 02, @08:12PM
    from th your-flight-status-is-'mayb' dpt.

    gManZboy writs "vr wondr whr databas tchnology is going? This is somthing that Turing award winnr Jim Gray from Microsoft has givn a lot of thought to. H rcntly publishd an articl in which h looks at th many forcs pushing databas tchnologis forward, and what thos nw tchnologis will look lik. Gray writs, 'th gratst of ths [rsarch challngs] will hav to do with th unification of approximat and xact rasoning. Most of us com from th xact-rasoning world -- but most of our clints ar now asking qustions that rquir approximat or probabilistic answrs.'"

    Hmmm, I kind of like 'databass'.