The Future of Databases

← Back to Stories (view on slashdot.org)

Posted by timothy on Monday May 2, 2005 @12:12PM from the your-flight-status-is-'maybe' dept.

gManZboy writes "Ever wonder where database technology is going? This is something that Turing award winner Jim Gray from Microsoft has given a lot of thought to. He recently published an article in which he looks at the many forces pushing database technologies forward, and what those new technologies will look like. Gray writes, 'the greatest of these [research challenges] will have to do with the unification of approximate and exact reasoning. Most of us come from the exact-reasoning world -- but most of our clients are now asking questions that require approximate or probabilistic answers.'"

5 of 315 comments (clear)

Min score:

Reason:

Sort:

Great Article by Spaceman40 · 2005-05-02 12:23 · Score: 5, Insightful

The requirements for a database today aren't too much different from those twenty years ago - except for what we want to get out of them.

Now that data mining is a $[insert large number here]million industry, databases are being asked to do a lot more processing with this data than before. For example: old database query = get these attributes from tuples that match this pattern. New database query = determine how likely a user who has accessed 30 or more times this last month is to subscribe to the second-level pay service within the next ninety days, with or without an email advertising said service.

--
I [may] disapprove of what you say, but I will defend to the death your right to say it.
In other words ... by Daniel+Dvorkin · 2005-05-02 12:26 · Score: 5, Insightful

... MBA's want the magic glowy box to do their thinking for them.

Fortunately, Microsoft will be there to take their money.

--
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Re:Why complicate things so much? by dioscaido · 2005-05-02 12:34 · Score: 4, Insightful

This random example just server to clarify what you mean -- How implement a airline database that has entries for 1,000,000 customers, 150,000 flights a year, and 12,000,000 reservations a year? and what would a query look like to find an open flight on a particular date range, and register a reservation? And how would doing all this on a ReiserFS be any less prone to data corruption than an often backed up database?
Re:A real problem comes full circle by zappepcs · 2005-05-02 12:52 · Score: 4, Insightful

I think I agree with the parent. Databases are methods of storing and retrieving data. Trying to make queries fuzzy, or less structured is just wrong.

If you want to be able to ask probablistic type queries of a database, you need to add some code between you and the database.

More to the point, the fuzzier your logic is, the higher the probability that your database will not contain all of the answers on its own, and you will have to cross reference your data to the data owned by someone else or gathered from a different disparate source.

It sounds like M$ is going to try to re-invent data warehousing? and then of course, patent it.

Trying to make the database do everything is not right and simply doesn't make sense. The code that accesses the data for you needs to do the fuzzy probablistic stuff.

P.S. I have no faith that M$ (no matter who they hire) can effectively provide the code required to make it work in the idealistic manner spoken of... mostly because they would have to patent accessing other people's data before they could do it.

Just my thoughts

--
Support NYCountryLawyer RIAA vs People
Re:The future of databases is... no Database at al by rossifer · 2005-05-02 16:53 · Score: 4, Insightful

[misc drivel] Read more at:
http://www.prevayler.org/

Oh my dear god. You've never actually used Prevayler have you? Prevayler isn't nearly as useful on actual data problems as Prevayler's worshippers would have you believe.

I know this because I tried to use it. If you'd ever tried to use it, you'd know how unbelievably poorly it performed when attempting to implement real world queries. You have to implement every query in Java, and Java is a particularly poor implementation choice for creating complex queries.

What if I said that this can be as fast as 8000 times faster than Oracle?

This "performance comparison" that the Prevayler group trots out is particularly funny as their test uses a single ArrayList of objects as in-memory "storage" and then "queries" it by index. Not exactly a realistic problem. Try a query across four classes with a few million instances of each class and you'll quickly discover what relational databases are good for.

Regards,
Ross