A New Data Model for the Web

*sigh* by Anonymous Coward · 2005-07-26 21:20 · Score: 3, Interesting

Do we take two steps back every week in this industry or what? RSS is a text file format. It's not a "data model".

What are the operators for manipulating this data? What is the type system? How is integrity guaranteed? How do I build a distributed database system with it?

There is only one complete data model: the relational model. Demonstrate to me how this "new" data model is not either 1) some subset of the relational model or 2) a bunch of nonsense, not a data model at all.

He's got one thing right: XQuery (return to the hierarchic databases of yesterday) and RDF (return to the network model, but with a fixed 3-value schema) are nothing to waste your time on.

To me his assertions are like saying, for example, the fundamental theorems of electromagnetism no longer apply to cell phones because they can now play MP3s, or something. Makes no sense.

Unfortunately, there is nobody left in this industry that has any clue about databases.

Re:*sigh* by tenco · 2005-07-26 23:01 · Score: 1

Do we take two steps back every week in this industry or what? RSS is a text file format. It's not a "data model".
I think "data model" refers to distribution and the scalability of distribution in this case.
Re:*sigh* by malachid69 · 2005-07-27 02:23 · Score: 2, Interesting
There is only one complete data model: the relational model.
I have an issue with this statement. It could be because I *hate* SQL, but, let's see what other's think... According to http://www.google.com/search?hl=en&lr=lang_en&c2co ff=1&oi=defmore&q=define:data+model there are a few definitions for data model... among them are:
- A data model is a collection of descriptions of data structures and their contained fields, together with the operations or functions that manipulate them.
- A data model is the structure in which a computer program stores persistent information.
- A method for describing data structures and a set of operations used to manipulate and validate that data
- A data model is a description of the organization of data that is stored in a computer system.
- A description of a specification and representation paradigm for data. (I really like this one personally)
- The logical means of organization of data for use in an information system.
That's probably enough definitions... Based on those, it would seem that RDF, XML (when using an XSD or DTD), SQL, even the filesystem itself would qualify as data models. I think your assumption that only relational databases are valid is a bit presumpuous. That is the reason why most XML databases are not true XML databases -- because many of them store their data via SQL in the background. Some ( http://www.techworld.com/applications/features/ind ex.cfm?featureid=910 ) would even suggest that relational databases are NOT a good or optimal solution.
--
http://www.google.com/profiles/malachid
Re:*sigh* by Anonymous Coward · 2005-07-27 05:57 · Score: 1, Informative

SQL has very little to do with the relational *model*. Put it out of your mind completely when discussing database theory. I'm not talking about database *products*.

The relational model specifies a set of relational operators, much like mathematical operators. SQL looks nothing like this, just to get the value of a single relvar you have to type "SELECT * FROM Foo" instead of the simpler and clearer "Foo". That's just the tip of the iceberg..

Anyway there are at least two definitions of "data model". One is a model for a particular set of business rules (customers, orders, line-items), the other definition is "a general system for the storage, manipulation, and retrieval of data". We're talking about the second definition. When we say a "complete" data model, we mean one that describes all possible cases, and also describes what each piece of data means and how it relates to the real world.

SQL is not a complete data model in this sense, it doesn't specify how to connect the data in the database to the real world for instance (relational model says: use predicate logic). SQL databases do have some underlying model of course, but who knows what it is, I don't think anybody has described it exactly, and it can probably be shown to reduce to a subset of the relational model. For instance, SQL allows duplicate "rows", so you'd need to introduce a hidden "row id" attribute to map them to relational tuples (which are based on sets, no dupes).

XML is a text file format. Even if you "retrocon" a data model out of it, it will be a hierarchic data model, which is what we had in the 60's before relational. Why did we reject hierarchic data? It's not general enough. Sometimes "customer" is the root of the tree, but sometimes you want "order item". Sometimes you want many-to-many relationships, which XML only supports via a different mechanism (pointers, essentially). Sometimes you want a "theta join", which could be an arbitrary relationship between data. Also XML contains multiple types of data (nodes, attributes, implicit node ordering) which is just a smelly design. In the relational model ALL data is in relations, including ordering, nesting, and anything else that XML makes implicit.

I suggest reading some basic database theory books and then revisiting your ideas. Anything by Chris Date is excellent, and he has a title on O'Reilly now so maybe people won't be afraid of reading his work.

Here's a hint, in order to demonstrate that relational is not the only complete data model, you have to introduce a data model that is at least as complete as the relational model, and then prove that it is not the relational model in disguise (in other words, that there's no way to automatically map your operators to relational operators, and so on). I'm not sure what such a model would look like.
Re:*sigh* by Godeke · 2005-07-27 08:06 · Score: 1

Some would even suggest that relational databases are NOT a good or optimal solution.

Actually, the point behind Tutorial D, Rel, etc is that the current *implementation* of relational databases are broken ... from a mathematical view point. The relational concepts have been deviated from and that is the downfall of SQL as an implementation of relational databases.

Of course, SQL survives (although broken and inconsistent) because it works today and it puts bread on the table... as opposed to a theoretical set of "pure relational" languages which are not yet mature enough to deliver. Additionally, the base article makes the point that HTML worked when SGML didn't because it mixed concerns and operated at a level that "rank and file" people could operate on. I have played with the "pure relational" languages and I have to say that SQL may be ugly, but I have taught non programmers to devise simple queries. These new languages are less "intuitive" to those not trained in the arts (although pretty to my eye as a programmer), and perhaps reflect a dichotomy similar to HTML vs SGML.

--
Sig under construction since 1998.
Re:*sigh* by pete6677 · 2005-07-27 09:47 · Score: 1

Strictly in the buzzword sense, RSS is becoming the new XML, in that people make it out to be a much bigger deal than it really is. Right around 2000-2001, business people were raving about XML being the "dawn of a new era", where technical people were thinking "it's just a damn text file with some markup". RSS is much the same, non-technical types think it's an entire new technology instead of what it is: a new way of doing essentially the same old stuff.
Re:*sigh* by malachid69 · 2005-07-28 04:03 · Score: 1

XML is a text file format.
While true, that argument really has little merit. That is like saying MDB is a binary format. So what.
Sometimes "customer" is the root of the tree, but sometimes you want "order item".
Yes, and anyone who has worked with XML or object-oriented databases will show you that is very simple to do. You have a Customer schema, and an Order schema. Big deal. You do the same thing with relational tables.
Sometimes you want many-to-many relationships
When you move away from the idea that ALL of the data has to be maintained in a single XML file, this is very VERY simple to do. And again, relational databases do the same (multiple tables).
Also XML contains multiple types of data (nodes, attributes, implicit node ordering) which is just a smelly design. In the relational model ALL data is in relations, including ordering, nesting, and anything else that XML makes implicit.
You've lost me here. In XML, you have nodes that contain nodes and have attributes. In relational databases, you have tables with rows and columns, and the tables and columns have properties. Also, XML easily supports relationships when you understand my previous comment. As far as ordering goes, that is determined by the schema. Also, I don't see how SQL datatypes are any more relational than the XML data types (which can include SQL datatypes at a whim).
I suggest reading some basic database theory books and then revisiting your ideas.
I have used many forms of databases... Relational, associative, XML-based, Object-oriented, etc. I have even written a few. I have provided feedback on FAQs that dealt with them. What it really comes down to is that your exposure to XML is very limited, and you fail to see that relational databases have drawbacks that other types of databases fix. That is extremely obvious by your comment that the relational data model is the only complete one. My guess is that you use something like MySQL very often and have learned to like it.

--
http://www.google.com/profiles/malachid
Re:*sigh* by museumpeace · 2005-07-29 03:03 · Score: 1
Yeah, I came away with a similar impression: Bosworth, or at least those touting his speech, are ignoring the truely significant differences between web searching and DB queries in these areas
- performance...he talks about scaling as if DB size and number of users were the only dimensions of scaling issues...time is an important dimension as well, espcially when you speak of concurrent update attempts.
- databases support writing, not just searching, records and that entails levels of privelege beyond what the web easily supports.
- API: SQL is an api and supports programmatic use of databases. THAT CANNOT BE SLOPPY Users have the intelligence to sort out relevant responses to a query, programs do not and the precision of a SQL response from a properly organized database IS mandatory. Can you imagine a "sloppy" access of your medical records that left a doctor with picture of your health history that was actually a composite of several persons who matched your search parameters to tolerances "close enough" for search engines? Sheesh!
- Current RDB and SQL are just barely adequate in supporting a few levels of access privelege...ultimately access authorization is damn hard to distribute in the way Bosworth seems to pine for
--
SLASHDOT: news for people who can't concentrate on work or have no life at all and got tired of yelling back at the TV.

Not really by Linus+Torvaalds · 2005-07-26 21:20 · Score: 4, Interesting

He is emphatic that RSS 2.0/Atom are the next big thing and represent the new data model for the web.

Here's the thing: RSS 2.0 and Atom really don't have a revolutionary data model. They are just file formats that list short descriptions, in a sequential order, with a bit of meta data, that get polled on a regular interval. That's all.

They are only popular because the use pattern is different to normal web pages. The tech itself is pretty mundane. Internet Explorer 4.0 has something similar with "channels", way back in the 90s.

You could have done the same thing with a subset of HTML 2.0 in the 90s. The main reasons people didn't is because they didn't think of it and the need wasn't as great.

The Semantic Web, on the other hand, is doing new stuff. Some of it we don't know how to do yet. Some of it is immediately practical, some of it isn't. The Semantic Web is more of an idea than a tangible product.

By saying that RSS and Atom somehow "beat" the Semantic Web, he's comparing apples to oranges. It just doesn't make sense.

The reason the web took off so well was because it was built from a few simple principles that could be generalised. Resources that could be addressed. Simple, text-based markup. Simple, text-based protocol.

The Semantic Web will probably take off in the same way, with various bits already being used to varying degrees of success (e.g. Mozilla already uses RDF). But it's a much bigger problem, so expecting it to take off just as quickly is naive.

Re:Not really by MoonFog · 2005-07-26 21:44 · Score: 3, Informative

Semantic web is perhaps best described as a framework. I totally agree that it's a pointless comparison. RDF/RDFS and OWL build upon XML so it would make more sense to say that RSS could be a building block for further extensions on the semantic web using for example OWL to represent data.

One of the reasons it appears to move along so slowly now is that the research is handling a lot of issues and as van Harmelen has said, they're afraid to enter the same pitfalls as the research in artificial intelligence where there has been a lot of buzz, but not many concrete results. That's not to say that there aren't any issues with the semantic web, but it's still coming along. OWL is being extended with OWL-S and OWL-QL and the issues of security and privacy are being looked at. Besides, even though ontologies are a new development on the web, they are nothing new overall, something I guess AI researches can testify to.

Recommended book for those who want to extend your knowledge on SW A Semantic Web Primer
Re:Not really by bblfish · 2005-07-27 03:08 · Score: 1

In any case there is a lot of work going on in creating OWL models for Atom. The two are not at all incompatible - and with a little more work they could have been nearly indistinguishable.
Another really good book that covers all the bases is Service Oriented Computing which gives a very good view as to how the Semantic Web, Agents, Web Services and RESTful apis fit together. This is a really serious book, but it helps get an understanding of the problems that are attempting to be solved.
Re:Not really by rho · 2005-07-27 03:23 · Score: 1

Therein lies the problem: the SW is a framework, which means what, exactly? Sounds good; nerds love it. What does it do for me?
RSS/Atom is a product. I can see immediately that it is, or is not for me. The SW is just ideas. Good ideas, but nothing in the sack.

--
Potato chips are a by-yourself food.
Re:Not really by MoonFog · 2005-07-27 03:41 · Score: 1

Therein lies the problem: the SW is a framework, which means what, exactly? Sounds good; nerds love it. What does it do for me?

Currently? Not much, it's still fairly new and being developed all the time. XML is an integral part of the semantic web and you use that no?

RSS/Atom is a product. I can see immediately that it is, or is not for me. The SW is just ideas. Good ideas, but nothing in the sack.

Unfortunately, it's still in the starting blocks, but the plans have always been to take it step by step. The latest step to be filled in was a full fledged ontology language (OWL) and that became a recommendation in February last year. Not exactly a lot of time to mature.
Re:Not really by AuMatar · 2005-07-27 06:36 · Score: 1

XML has nothing to do with the semantic web, except as one possible implementation. The semantic web is the idea of putting machine parsable meta-data into documents, so that machines can parse it out and understand what type of content is in the data. XML is a way of doing this. But calling it an integral part of the SW is like calling HTTP an integral part of it. Its a technology it can use, but is just one way of implementing it.

As for using XML- actually, I don't think I've looked at any XML in well over a year. XML is like the emperors new clothes of technology. It sounds nice and people make it seem amazing, but it doesn't really do anything. Parser generators have existed since the 60s. The hard part of text analysis isn't parsing, its interpreting the resulting tokens. XML doesn't help with that whatsoever. So why use an ugly and way too heavyweight markup like XML over other plain text formats?

I do think SW is an idea doomed to failure though. The problem is- metadata doesn't work. Metadata cannot be machine implemented. This means a human needs to sit around and markup the data. This takes time, effort, and is bug prone. People by and large have better things to do. The end result is that they won't due it, and without widespread use the SW won't ever come to exist. Working on search technology is far more important. Search gets much of the same data, without requiring human generated metadata. Google already offers me 95% of the benefits of the semantic web, with no effort. Until you can get a way for a machine to fill in metadata for arbitrary documents, the idea is dead. And doing that pretty much requires an AI capable of human level abstraction. In 50 years we haven't come close. Don't expect that to happen in the next half century either.

--
I still have more fans than freaks. WTF is wrong with you people?

A war between the humans and computer scientists by Felonius+Thunk · 2005-07-26 21:38 · Score: 3, Insightful

I'm downloading the speech now, but if it's anything like this great speech he gave last year, it will be well worth listening to. That one changed my mind about what great things might look like. I've realized the great and wonderful content management system that my group is building is utterly doomed, for example, and I already have a new job in hand. It's all about the sloppiness.

Just listened to the whole thing by QuantumG · 2005-07-26 22:26 · Score: 0

Good presentation. It reminded me of an email I got the other week on my local LUG mailing list. Someone was complaining about how strict XML processing is vs HTML processing. If you miss a tag in HTML, yeah, no problem, the parser will forgive you. Miss a tag in XML, sorry, no rendering today. The result? No-one writes XML by hand (unless they're a masochist) and that means your average Perl, Python, PHP coder will actually have to read some docs or a specification to remember how to output this stuff so they just won't bother. Bosworth says that's why RSS 2.0 beats the pants off RSS 1.0, anyone can create these files and the freely available libraries that handle this stuff are really really fault tolerant. He says a lot of stuff about scalability and other stuff, but you can just listen to the mp3 if you wanna hear what he said.

--
How we know is more important than what we know.

Re:Just listened to the whole thing by Anonymous Coward · 2005-07-27 00:18 · Score: 3, Insightful

Open tag - close tag - encode amper's and, greater-than, less-than.

Use appropriate character-encoding and -decoding at I/O-borders.

Finished.

Everyone who is not able to do these things correctly by hand or to make his script output correct XML should continue flipping burgers and does not belong in this industry.

What kind of Kindergarten is IT turning into?

Fuck.
Re:Just listened to the whole thing by Maian · 2005-07-27 00:22 · Score: 1

Well, XML was primarily meant to be machine-read. It just has the added benefit of being human-understandable and human-writable.
Re:Just listened to the whole thing by Linus+Torvaalds · 2005-07-27 00:23 · Score: 4, Insightful

Miss a tag in XML, sorry, no rendering today. The result? No-one writes XML by hand

Actually, it works the other way around. Because syntax errors are immediately obvious when writing XML, it's a lot easier to write by hand, because when you make a mistake, you notice it straight away.

The reason why so many people use libraries with XML is because it's a standard format with libraries for practically every language. Using a library often saves time compared with writing stuff by hand.

that means your average Perl, Python, PHP coder will actually have to read some docs or a specification to remember how to output this stuff so they just won't bother.

Rubbish. They'll do exactly what they did to learn how to generate HTML - look at a few examples and make their own that looks like the example. <?php echo('<foo>My XML Document</foo>'); ?> is no harder than <?php echo('<h1>My HTML Document</h1>'); ?>

Bosworth says that's why RSS 2.0 beats the pants off RSS 1.0, anyone can create these files and the freely available libraries that handle this stuff are really really fault tolerant.

Both RSS 1.0 and RSS 2.0 use XML syntax and have freely available libraries anybody can use. But didn't you just say that nobody will bother using XML formats because people won't read the documentation that tells them how to use such libraries?
Re:Just listened to the whole thing by snorklewacker · 2005-07-27 03:11 · Score: 1

> Miss a tag in XML, sorry, no rendering today.

My own XML parser has error reporting and recovery (if hardly anything else). Sorry to hear that yours is so broken. Try a different one.

Java dies on you if you miss a curly brace, yet people keep on using it.

--
I am no longer wasting my time with slashdot

Heh heh... by Anonymous Coward · 2005-07-26 22:26 · Score: 1, Funny

Heh heh heh... He said 'seminal'...

Content, Availablity... by Vo0k · 2005-07-26 22:56 · Score: 3, Insightful

There's way more to successful formats than the structure. But let me name two essentials.
What use is a format of data if the data itself is useless?
How can a format take off when only few have access to publishing in it?
That's the way Gopher went. Only admins could add pages. Meantime, most of people with access to the net, were able to create their own ~/public_html
Now RSS is the big thing. People add RSS to everything. Where are MSIE's "channels"? Spamvertisment available to the chosen few. Revolutionary video tape technologies competetive to VHS: None in shops, few movies available. And so on, and so on...

--
Anagram("United States of America") == "Dine out, taste a Mac, fries"

oblg. extra linkage by Anonymous Coward · 2005-07-26 23:05 · Score: 0

login details for this site are:

u:newidea
p:ideaman

via BugMeNot.

This is the speech in mp3 and the speech in AAC/M4B (for iTunes/iPod).

Re:oblg. extra linkage by whatever3003 · 2005-07-26 23:15 · Score: 1

... if you enjoyed the talk (and perhaps even if you didn't) do what the site says and ..
Put your money where your ears are... and consider donating. Registration for IT Conversations is free it seems, but bandwidth isn't.

--
"Those who do not want to imitate anything, produce nothing." -- Salvador Dali
Re:oblg. extra linkage by Anonymous Coward · 2005-07-27 18:43 · Score: 0

Well, that's f'ing stupid. Why don't you just click the "continue without registration" button?

an aggregation model by ear1grey · 2005-07-26 23:27 · Score: 3, Interesting

The slashdot story mis-sells the content of the speech. For me it was just AB talking about how it would be useful to have a simple system of aggregation that goes beyond subscribing to an RSS feed.

It's not a new data model & the semantic has not failed, in fact, it's more important when considering how to work with the diverse resulting data.

--
boakes.org

Really enjoyed, but not sure I buy by astrashe · 2005-07-26 23:36 · Score: 5, Insightful

This is a great talk, and I really enjoyed it, but I'm not sure I buy it.

I haven't really digested the talk, so maybe that's why. But this is my gut reaction against what he's saying.

I don't think that geeks fully acknowledge the role of what I think of as bibliography in the web ecosystem.

I was an English major. Let's say that you want to learn about Faulkner. If you go to the card catalogue, and search for books about Faulkner, you get a lot of hits -- more books than you could ever read. It's essentially useless.

What you really need is a bibliography -- something written by a Faulkner scholar who says "these are the really important and groundbreaking books about Faulkner." That's one of the cool things about Encyclopedia Brittanica -- at the end of their articles, they tend to give you a run down of some of the key books on the subject.

So if you want to read a biography of George Washington, EB will let you find the right one. That's important, because there are so many biographies of George Washington out there.

That's my key point. If you go to a university library and use the catalogue to do a mechanical search for books about George Washington, the results aren't very useful. But if you read the bibliography at the end of the Encyclopedia Brittanica article, it's extremely useful.

I'm trying to draw a distinction between mechanical searches, on one hand, and selections based on human judgement on the other.

Google is useful in larege part, I think, because page rank lets you find what are essentially good bibliography pages. You use a dumb mechanical search to put you in touch with people who know their subjects and who have good judgement (hopefully).

The other day, for example, I was thinking about an old programming language called APL. I searched for it, and found a couple of pages that seemed to have collected just about everything APL -- anecdotes, personal histories, tutorials, implementations, pictures of the goofy APL keyboards, etc.

The Google powered web is cool because it combines the mechanical and the bibliographic so well. Google gets me to the bibliography -- it pulls that needle out of the haystack. But it's the bibliography that lets me drill down.

This is important. The really good stuff I read about APL didn't come directly from the actual google result page. There was a link in between -- the google result page took me to the APL bibliography page, and from there I was able to hit the meat of the matter.

We've seen, over the past decade, an explosion in which mechanical searching can do. Because it's been getting so much better so quickly, it's dominating the way we think about how we find information. It's causing us to give bibliography -- the judgement of experts -- short shrift.

But bibliography is absolutely key to the google ecosystem.

My problem with attempts to impose more structure on data is that it always breaks things. It's beefing up mechanical searches, which are already very good, and it does it at the expense of bibliography.

I buy the argument in this lecture more than the guy making it does. He complains about heavier structures, and how the complexity will prevent people from producing and consuming information. I think that almost any move away from what we have now will do the same thing. The more you structure information, the harder it is for people to provide bibliography.

The point is that the ideal medium for bibliogrphy is free form -- one person saying, "this is what I think" to another.

The genius of google is that page rank gives you a mechanical way to uncover the best bibliographies. The best ones tend to show up at the top of the results.

In the old days, there was alta vista, and there was yahoo. Yahoo used human beings to categorize data manually. They'd put sunglasses next to the best sites in many categories -- flag something as a "cool site". Alta vista was pure mechanical searching, with no human judg

Re:Really enjoyed, but not sure I buy by Maian · 2005-07-27 00:33 · Score: 1

I wholeheartedly agree.
Honestly, I don't know much about the Semantic Web, but I have my doubts. In addition to its mechanical nature, I suspect the Semantic Web may eventually be plagued by abuse: search engine optimization. HTML is presentable so even if it's being abused with SEO, the human can verify whether it's crap or not. Can the machine do that?
That's where the search engine comes in. Something that sifts thru the available data and presents the tidbits that are ripe for picking.
I predict that this very act of picking relevant data is going to be biggest technology of tomorrow. Imagine a search engine-powered message board. I'm not talking about a message board that has a search feature; no, I'm envisioning a message board that's powered by a search engine: posts are linked automatically to relevant posts, and relevant posts of a topic are automatically presented to viewers. Something like /. except even better, since it would get rid of most redundant posts.
That's the future I'm talking about.
Re: Really enjoyed, but not sure I buy by gidds · 2005-07-27 00:37 · Score: 2, Insightful

Don't you think that Google itself is functioning like a bibliography? The important pages, the ones most worth seeing, are likely to be the most linked-to, and so appear at the top of the list. The rating is done by every web site creator, and the collation by Google; doesn't that make PageRank effectively a bibliographic tool?

--
Ceterum censeo subscriptionem esse delendam.

Great by Mensa+Babe · 2005-07-26 23:38 · Score: 3, Funny

A new data model?

Couldn't we please focus on implementing the old data model correctly first?

--
Karma: Positive (probably because of superiour intellect)

a case of capt. obvious? by tod_miller · 2005-07-26 23:58 · Score: 4, Insightful

or is it just me? I know it is hard to predict the way technology is going, the only reason HTML still is around is because it works, and was widely adopted, and nothing else gives any [real] benefits (for now).

as far as I am concerned, however you split up content, style, updates, 'sitefiles' (my collective analogue for rss and related technologies) the fact is one coherent, styled document must be the end result.

Too much is being read into content management and RSS. Yes RSS is cute, I use it to have a BBC and CNN link in my firefox, and I just one click to read articles, not go to the site.

RSS and podcasting is the worst combination of not-new hype ever. Downloading a file through the web, wow new! :-)

Seriously, pod casting should be renamed downloading audio.

--
#hostfile 0.0.0.0 primidi.com 0.0.0.0 www.primidi.com 0.0.0.0 radio.weblogs.com

Re:a case of capt. obvious? by Tablizer · 2005-07-29 15:57 · Score: 1

or is it just me? I know it is hard to predict the way technology is going, the only reason HTML still is around is because it works, and was widely adopted, and nothing else gives any [real] benefits (for now).

Most developers who used to build "rich client" apps generally agree that web-based GUI's are a pain in the ass and lacking decent cross-browser widgets such as editable data grids, collapsable outline/trees, combo boxes, and others. Most companies like web apps because they are much easier to deploy. It is the deployability, not the user interface or developer productivity companies like.

--
Table-ized A.I.

Centralized database logic doesn't scale? by Free_Trial_Thinking · 2005-07-27 01:03 · Score: 1

I didn't understand why he said that? I've always heard it was good to put all your logic into the DB.

Would anyone care to explain that a little? And please dumb it down a lot, I'm not that smart in databases.

Re:Centralized database logic doesn't scale? by Anonymous Coward · 2005-07-27 01:45 · Score: 0

I didnt quite follow that either. So im going to give an answer i think he was getting at. Least you can tell me if were on the same page? Or if im wrong surely i will get corrected.
Was the scale issue based on the fact that a database resides on one machine? That you couldnt scale out DB logic over multiple machines? i get the idea that you could have like say a server farm but not a DB farm?
Re:Centralized database logic doesn't scale? by mwg_stpaul · 2005-07-27 03:25 · Score: 1

It's more a case of maintainability.
If "doesn't scale" simply means "I need more proc/mem/disk" you can always throw more horsepower at the problem, but that shifts the solution to a question of how much money you have to spend on toys. That's not what I'm guessing he's referring to, though.
Without listening to or reading the presentation, I assume he's talking about the standard n-tier development/deployment model. Keep your presentation layer, business logic layer and data layer separate so that you can maintain/enhance/throw away any one of them without destroying the others.
You can get away with business logic in the db layer on smallish projects, or ones where you can guarantee that the coupling between the logic and it's representation in the data model won't change much over time. Other than that, it's a shortcut that will bite you eventually.
Re:Centralized database logic doesn't scale? by rho · 2005-07-27 03:35 · Score: 1

As I understand him, putting all your eggs in the database basket limits you to the speed of your database server.
This is, of course, pure bunk because Google does exactly this and Google scales well. Difference is the money available to you, Web programmer, and Google, Web moneybags. It's bunkum, but very wise bunkum nonetheless, unless you have a billionaire uncle who signs documents without reading them.
The workaround for the limits of your database is, Adam claims, to share your data in RSS/Atom feeds where it can be cached by other servers, RSS readers, etc., freeing your server to serve your expected and accounted-for load.

--
Potato chips are a by-yourself food.
Re:Centralized database logic doesn't scale? by sheldon · 2005-07-27 03:37 · Score: 1

I suspect it depends on what you mean by "business logic".

Some argue that you shouldn't even put foreign key restraints in your database... the app can handle that for you and it'll make it faster.

Others argue that it is key to maintain the integrity of your data. If this means putting lot's of logic in the database in the form of procs, views, triggers, etc... that's what you need to do. Better to normalize and have accurate data than to denormalize and have speed.

It all depends on what your needs are.
Re:Centralized database logic doesn't scale? by TuringTest · 2005-07-27 05:12 · Score: 1

I think he meant that a centralized approach doesn't support the logic of *all users*. You can put *your* logic in your Flicker-web-images DB, but would you also merge in it all the latest scripts developed by your users on top of your web app?

No, you publish an API and other people use that API to access your contents. The logic of *their* web applications is in their sites, not in your DB.

--
Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
Re:Centralized database logic doesn't scale? by anomalous+cohort · 2005-07-29 02:57 · Score: 1

If "doesn't scale" simply means "I need more proc/mem/disk" you can always throw more horsepower at the problem, but that shifts the solution to a question of how much money you have to spend on toys.
Let's give a concrete example to this. Consider a hypothetical 2 tier system with a thin client talking to a centralized database where all the business logic is being handled. This system gets deployed on a top-of-the-line Sun Fire server running Oracle. The system is successful and its usage grows rapidly. One day, that server can't keep up with the demand. The next stop is moving to DB2 on a mainframe.

Do you have any idea what the cost of a mainframe is? Your company has to be the size of a government in order to afford one. It would be cheaper to rewrite the application than to upgrade the hardware. When its cheaper to rewrite than to upgrade, then you don't have a scalable system.

If the business logic is handled on a separate app server and if the system is designed in order to be cluster aware, then all you have to do is buy another server of equal strength to the first server. Thus, the centralized database running all business logic is not scalable.

Spam killed the simple pagerank by ear1grey · 2005-07-27 01:23 · Score: 1

The important pages, the ones most worth seeing, are likely to be the most linked-to...

In the days before comment and referral spam, that might have been true. It remains to be seen if rel="nofollow" (a semantic annotation of sorts) proves successful in re-invigorating the importance of the simple link.

--
boakes.org

Oh boy... now you've done it by wbren · 2005-07-27 02:05 · Score: 1

The average Slashdot story links to a 2-5 minute article, and most people don't even bother to read that before they post a comment. Since this story links to a 40-minute MP3 that no one will bother listening to, the comments page should be an interesting read...

--
-William Brendel

Very skeptical by Exaton · 2005-07-27 02:12 · Score: 2, Insightful

Sorry, I trust Sir Tim Berners-Lee more than I trust "Adam Bosworth".

That guy can start by learning how to add some <br />'s in what he writes (go check out his blog -- horrendous !) before pretending to talk about Web fundamentals.

Re:Very skeptical by styrotech · 2005-07-27 13:37 · Score: 1

I'm sure if you asked Tim Berners-Lee about Web fundamentals he would tell you not to use /> as it is not semantic markup and to use paragraphs instead - like the blog in question does.

What does understanding Web fundamentals have to do with someone using excessive paragraph lengths? That's bad writing rather than bad markup.
Re:Very skeptical by Exaton · 2005-07-28 05:00 · Score: 1

OK, score one for you :-) I never use line breaks apart from what PHP's nl2br() will generate, so I wasn't careful with my remark.

Mea culpa, and all that ^_^;

Does it matter (Re:Not really)? by philipmather · 2005-07-27 02:27 · Score: 1

Nevertheless, for the masses in their office cubicles RSS feeds are the next "big thing".

Try telling the masses that the next big thing is a new data model for the web, based on semantics, and 99% of them will ask you what "semantic" means, never mind the intangible data model that is the real underlying improvement.

Show them a little program that sits on their desktop and feeds them the latest from CNN, the BBC etc and they understand that.

Web development and IT in general is running a real risk of falling into a litany of problems that similar industries have, without trying to mitigate them or indeed learn anything from them.

As MoonFog points out, we run the risk of doing what AI researchers did. Promising the Silver Bullet was just around the corner... "If the average Joe can't understand a computer then we can make the computer understand them!" ...and then being able to deliver that promise in the time frame we promised.

How about mobile telecoms companies such as Nokia or 3? They accidentally invented the bane of good social graces in the 20th century, text messaging, and have forever since been trying to come up with the next great money spinner. Have MMS, 3G, video phones, PTT, mobile gaming etc... ever really produce as much money or been as much of a success as texting?

Finally, the manufacturing industry and mechanical Engineers (esp. in the UK). It may have been Maggie who nailed the coffin shut (and to this day the manufacturing industry in Britain is a bit of a mess, just look at Rover) but it was foreshadowed by a calamitous shifting of project management responsibilities beforehand. Large mechanical engineering projects went from being managed by those who did it but for the love (Isambard Kingdom Brunel right through to Sir Frank Whittle) to "professional" managers who couldn't maintain the level of innovation, back finally to the Engineers themselves who promptly allowed everything to go over budget and over time because nobody had taught them about project management.

Where am I going with all this ()? Is RSS our predictive text, a nice addition but not truly the next big thing? So what is the "next big thing"? Web 2, The Semantic Web? Possibly, but we need to be careful, our management structures are currently in the stage of evolution between leaving those who did it for the love (Hello Linus ;^) and Professional Management (and I think we can all link back to "those" discussions) who may not be able to understand the potential and maintain our innovation, leading us down the path of the AI'ers.

If the masses can't tell the difference between the apples and the oranges then the comparison won't bother them, just give them the apple now and work on making the orange even better.

--
Regards, Phil

Re:Does it matter (Re:Not really)? by ClosedSource · 2005-07-27 17:02 · Score: 1

"Try telling the masses that the next big thing is a new data model for the web, based on semantics, and 99% of them will ask you what "semantic" means, never mind the intangible data model that is the real underlying improvement"

Actually, it's a quite logical question to ask. Research projects without any discernable end or application are often indistinguishable from Bullshit.
Re:Does it matter (Re:Not really)? by philipmather · 2005-07-27 20:03 · Score: 1

Especially for non-specialists in a technical area , I'm not saying esoteric research rarely has value, in our field it's usually the most valuable.
Now the IT field is "mature"(-ish), it's getting a lot more exposure. As developers we need to be presenting simple (not trivial) but elegant little demos to people built on top of whatever the latest great "model" is and then asking for money and contributions.
A lot of otherwise promising projects are doing it the other way round...
"We/I've got this great framework/design, if I could have £X I could make it do Y" ...rather than saying...
"Look at Y, it's only small but isn't it neat? It's built on top of this framework/design, but I need £X to make some larger neat things and perfect the framework." ..., obvious? Maybe, but people still over look the deeper advantages. If you've got funding for a project that lacks a phyical manifestation the great temptation is to "tweak" the framework a bit more before building that neat thing on top. Whereas with a real product on top it constrains you a bit but does give you something to abate the commercial pressure with. Specifically you're manager/sugar daddy/whatever will be able to distinguish it from bull shit because he's got something to play with.
It's so easy, we're developers after all, we've reached a concluesion (if not the end) when a "product" is finished but if people carry on as is we'll risk developing (sorry) a reputation for hype and not a lot else.

--
Regards, Phil

SQL Is Not Relational ( & incorrect citation) by Anonymous Coward · 2005-07-27 03:27 · Score: 2, Informative

SQL was/is a industry attempt at a language for relational databases. Codd (father of relational database theory) criticized weaknesses in the SQL implementation. Do not make the mistake of thinking "SQL = Relational".

The language Tutorial-D in the article you refer to is yet another language for relational databases! Darwen and Date are critics of SQL implementations; they are NOT critics of the relational database as you imply. They are instead the strongest relational database proponents.

Indeed the relational model is the only model with logically provable underpinnings. In ON DOCUMENT- VS. DATA-BASES Chris Date explains:

Types are things we can talk about. Relations are sets of statements that we can utter about those things. What can you do without the latter? The problem of OO is that they have just types, no relations.

And about "document databases" (this would include HTML & XML):

A document mixes data with layout (presentation). Databases deal with the former, intentionally leaving the latter to applications. Furthermore, the structure of the document is not such that it lends itself to the kind of inferences that are made from databases. What is the atomicity, selectivity, and correctness for a document base?

RSS Same As NNTP Newsgroups... by Anonymous Coward · 2005-07-27 03:46 · Score: 2, Insightful

but with

no standardized means of replying/interacting,
no means of maintaining topicality,
no means of adding attachments,
no bonafide archive,
poor performance,
egos that want to control responses to their posts.

NNTP is an irreplaceable source of technical information. In contrast the world wouldn't skip a beat if all RSS feeds stopped tomorrow.

sin(*sigh*)^2 by Anonymous Coward · 2005-07-27 05:18 · Score: 0

"He's got one thing right: XQuery (return to the hierarchic databases of yesterday) and RDF (return to the network model, but with a fixed 3-value schema) are nothing to waste your time on."

Well damn! There goes my Firefox extension.

Re:sin(*sigh*)^2 by dorkygeek · 2005-07-28 00:59 · Score: 0

RDF (return to the network model, but with a fixed 3-value schema)
I don't see a problem with a 3-tuple scheme. It is defined as (subject, predicate, object), e.g. "roses are red" (pleeeease, no 'roses are blue' jokes now), which makes it possible to build a graph connecting everything you like. By introducing two statements about the same subject, you can add more properties, therefore no need for bigger-than-3-tuples.

--
Windows is like decaf - it tastes like the real thing, but it won't get you through the day.

Web services moving to RSS by glinden · 2005-07-27 06:44 · Score: 1

In the speech, Adam Bosworth predicted that "RSS 2.0 and Atom will be the lingua franca that will be used to consume all data from everywhere" because they "are simple formats that are sloppily extensible."

It's true that many seem to be moving in this direction. For example, A9's OpenSearch is a simple extension to RSS. The Findory API offers simple, RSS-based access to news and blog search results. Yahoo offers a few services through more the more complex Yahoo APIs, but offers many more through Yahoo RSS, including news and web search results.

It seems that most web services may end up standardizing on simple REST protocols using RSS and Atom.

NEW Data Model by MrCopilot · 2005-07-28 02:42 · Score: 1

Is he also played by Brent Spiner? I hope he doesn't have that puss chip.

Is there a Dweeb mod point?

--
OSGGFG - Open Source Gamers Guide to Free Games

Re:SQL Is Not Relational ( & incorrect citatio by malachid69 · 2005-07-28 03:52 · Score: 1

The language Tutorial-D in the article you refer to is yet another language for relational databases! Darwen and Date are critics of SQL implementations; they are NOT critics of the relational database as you imply. They are instead the strongest relational database proponents.

Yes, I understand that Tutorial-D is an attempt to make a more-correct implementation. I was mostly referring to these two comments from said article:

Relational databases as we know them today are, however, far from optimal - at more than one level.

and

It should come as no surprise, then, that work has been going on for some years to devise a more correct alternative to the relational database as we know it today.

Perhaps I overstated it -- but the point that CURRENT implementations (which is what most would suggest to use instead) are not that good still remains.

XML:A document mixes data with layout (presentation).

XML, used correctly, should include NO presentation in the XML data file itself. That is to be left up to the XSL. Even when using XHTML, it should be left up to the CSS. When I refer to an XML database, there is no presentation whatsoever in the XML file.

Furthermore, the structure of the document is not such that it lends itself to the kind of inferences that are made from databases. What is the atomicity, selectivity, and correctness for a document base?

That is the purpose of the XSD (or DTD or RelaxNG, or whatever you prefer). Using XSD, you have a very large amount of control over the structure of the document.

--
http://www.google.com/profiles/malachid

54 comments