Researchers Create Database-Hadoop Hybrid

← Back to Stories (view on slashdot.org)

Researchers Create Database-Hadoop Hybrid

Posted by kdawson on Tuesday July 21, 2009 @07:02AM from the try-this-at-home dept.

ericatcw writes "'NoSQL' alternatives such as Hadoop and MapReduce may be uber-cheap and scalable, but they remain slower and clumsier to use than relational databases, say some. Now, researchers at Yale University have created a database-Hadoop hybrid that they say offers the best of both worlds: fast performance and the ability to scale out near-indefinitely. HadoopDB was built using PostGreSQL, though MySQL has also successfully been swapped in, according to Yale computer science professor Daniel Abadi, whose students built this prototype."

122 comments

Min score:

Reason:

Sort:

Please stop by Anonymous Coward · 2009-07-21 07:06 · Score: 3, Interesting

Uber-cheap is not a word, and it doesn't even make sense because you're saying it's "above cheap". Stop making up stupid shit.
1. Re:Please stop by CorporateSuit · 2009-07-21 07:14 · Score: 1
  
  German prepositions do not have direct english equivalents. I suppose being an "Ubermensch" would be talking about the HATS that people wear, since that's what's Over the Mensch (person). Stop getting your panties in a twist over things you're wrong about.
  
  --
  I am the richest astronaut ever to win the superbowl.
2. Re:Please stop by Anonymous Coward · 2009-07-21 07:20 · Score: 0
  
  Can you commemt on this system's uber scalability?
3. Re:Please stop by Anonymous Coward · 2009-07-21 07:25 · Score: 0
  
  Oh, hey, nice straw man. "Superior to average cheap" doesn't really make any more sense.
4. Re:Please stop by spatley · 2009-07-21 07:28 · Score: 1
  
  how about Ultra-Cheap
  hmmmm...
5. Re:Please stop by Anonymous Coward · 2009-07-21 07:29 · Score: 0
  
  I'm not sure what is worse. All the fatasses running (hobbling, waddling) around or their attitude of blaming their fatass-ness on everyone and everything but themselves.
6. Re:Please stop by CorporateSuit · 2009-07-21 07:35 · Score: 4, Insightful
  
  Considering that "Ubermensch" was translatable to "Superman" then "Ubercheap" would be "Supercheap"
  
  It's called a prefix. We use them in the English language. This one has recently been adopted into our language. Pick up the pace or shut up about things you don't know.
  
  --
  I am the richest astronaut ever to win the superbowl.
7. Re:Please stop by Anonymous Coward · 2009-07-21 07:38 · Score: 1, Insightful
  
  Uber-cheap is not a word, and it doesn't even make sense because you're saying it's "above cheap".
  You remind me of my English teachers. Every year they kept saying that "ain't" isn't a word because it's not in the dictionary. Then one day I looked in the dictionary and it was there. The lesson I learned was that humans create words and "rules" of language aren't really rules at all. They are merely traditions. I suppose you think the French are just speaking bad Latin? No, languages change. From Old English to Middle English to Modern English it changed. I bet all along the way there was some know-it-all jackass pedant saying "thither isn't a word!"
8. Re:Please stop by jd · 2009-07-21 07:39 · Score: 1
  
  Since a high price is above a low price, "above cheap" means "expensive".
  
  --
  It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
9. Re:Please stop by gandhi_2 · 2009-07-21 07:44 · Score: 2, Funny
  
  I am a metaphor for war.
  You are for metaphor war.
  
  --
  THL phish sticks
10. Re:Please stop by Anonymous Coward · 2009-07-21 07:50 · Score: 0
  
  So throwing any mal-adapted prefix on any word is cool by you? Okie dokie. How about: "Double-plus-cheap"? Needlessly tacking on dated modifiers onto words looks mighty lame, doesn't reflect well upon the speaker, and tends to color the listener's perception of the message.
  But feel free to keep being a jerk.
11. Re:Please stop by sifur · 2009-07-21 07:50 · Score: 0
  
  Please stop.
12. Re:Please stop by Freetardo+Jones · 2009-07-21 07:52 · Score: 2, Informative
  
  Considering that "Ubermensch" was translatable to "Superman" then "Ubercheap" would be "Supercheap"
  
  No, it wouldn't. It would be word soup that any German would find to be awkward. To say something is "super cheap" they would say something like "superpreiswertes" which would literally translate as "super inexpensive". They wouldn't use Ã¼ber in such a situation.
13. Re:Please stop by Anonymous Coward · 2009-07-21 07:53 · Score: 0
  
  That's only true if you order by price desc.
14. Re:Please stop by Anonymous Coward · 2009-07-21 08:12 · Score: 0
  
  Uber-cheap is not a word, and it doesn't even make sense because you're saying it's "above cheap".
  You remind me of my English teachers. Every year they kept saying that "ain't" isn't a word because it's not in the dictionary. Then one day I looked in the dictionary and it was there. The lesson I learned was that humans create words and "rules" of language aren't really rules at all. They are merely traditions. I suppose you think the French are just speaking bad Latin? No, languages change. From Old English to Middle English to Modern English it changed. I bet all along the way there was some know-it-all jackass pedant saying "thither isn't a word!"
  So in other words, a teacher's use of faulty reasoning influenced your views towards language. It sounds like that teacher stuck with literature and grammar because logic was not her strong suit.
  
  What she should have told you is that "ain't" is slang and has a wide reputation for being used by some rather backwards, unsophisticated, uneducated, functionally illiterate people. It is reminiscent of the classic "redneck" stereotype and often goes with it. She should have explained that whether it is right or wrong, people will judge you negatively when you use forms of language that have such associations. That's because when you adopt the speech and/or mannerisms of a group, you are identifying yourself as a member of that group whether or not this was your intention. This might not matter when you're ordering a cheeseburger but could seriously hurt your chances at something like a job interview. That would have been a meaningful explanation that actually communicates the problem she had with your use of this word. It would have been much better than the trivially-falsified faulty reasoning that she gave you.
  
  So-called "ebonics" has similar problems. If the changes to English displayed in this "dialect" did anything to decrease the ambiguity or to increase the expressive power of the language, then I'd have no problem with it. Unfortunatley ebonics does none of these things. It's just a bastardization of an existing language by a group of people who desperately want to stand out and be different, and so is nothing like the "French are not just speaking bad Latin" example that you gave. It's just poor English that is the product of an uneducated socio-economic underclass and there's nothing wrong with saying so, except that saying so doesn't make you want to celebrate it. This is compounded by "political correctness" which means that anyone who points this out, or rejects this and all other forms of insanity and/or stupidity for the sake of preserving a cultural identity that refuses to assimilate, is branded a "racist" no matter how well-founded his objection may be. This institutionalized madness goes as far as "teaching ebonics" as a "second language" in public schools.
  
  Right now mainstream "black America" regards itself as living in a "white country" and resents it. It therefore wants to have its own counter-culture that flies in the face of the prevailing culture. It then gets surprised and angry when this results in opposition, and demands that its counterculture be accepted unopposed and that only a racist would not wish to do so. The easiest way to help black-white race relations would be for the majority of black people to realize that America was supposed to be a melting-pot, and that they can assimilate into America the same way that the Scottish, Irish, French, British, Chinese and others have done (nothing racial about it). You know what all of those other groups have in common? They can all come to this country "fresh off the boat" and within one generation their children can speak perfect English and otherwise cannot be distinguished from other Americans except for their physical appearance. That's because they understand that it's silly to live in America when you don't want to be an American.
  
  What we're doing right now is making us b
15. Re:Please stop by camperdave · 2009-07-21 08:13 · Score: 1
  
  Yes, I guess it should be Infra-Cheap.
  
  --
  When our name is on the back of your car, we're behind you all the way!
16. Re:Please stop by nog_lorp · 2009-07-21 08:17 · Score: 1
  
  Word soup? Have you *HEARD* German lately? Most of their speech is made up of huge conglomerations of words and prefixes and suffixes.
  For example, the word for CPR in german? Herzkreislaufwiederbelebung (heart-circle-run-again-enlivenment).
17. Re:Please stop by Freetardo+Jones · 2009-07-21 08:24 · Score: 1
  
  Of sure, German is an extremely over verbose language at times, but the fact of the matter is that CorporateSuit, despite all his blusterings, is about as clueless in German as he tries to claim others are.
18. Re:Please stop by Anonymous+Psychopath · 2009-07-21 08:24 · Score: 2, Insightful
  
  We've been co-opting other language's words into English for a long, long time now. To a growing number US citizens prefixing anything with "uber" is the same as saying "ultra" or "super". You know the saying "it's all over except for the shouting"? Yeah, that's pretty much where this is.
  Feel free to mod this entire thread, including the parent, uber off-topic.
  
  --
  Eagles may soar, but weasels don't get sucked into jet engines.
19. Re:Please stop by msimm · 2009-07-21 08:26 · Score: 1
  
  Pushing a translation into colloquial English does not make it a model for translation. When I'd first come across ubermensch reading Nietzsche it was described to mean 'overman'.
  
  --
  Quack, quack.
20. Re:Please stop by Anonymous Coward · 2009-07-21 08:29 · Score: 0
  
  The or Sehr Billig!
21. Re:Please stop by Anonymous Coward · 2009-07-21 08:32 · Score: 0
  
  Well, it's more like heart-bloodcircuit-reanimation. You're purposely splitting things into pieces that are too fine there.
22. Re:Please stop by misexistentialist · 2009-07-21 08:36 · Score: 1
  
  "Overcheap" would work, although it would have to be read somewhat ironically since "over" usually has a negative connotation. Anyway the Germans call cell phones "Handys", and really shouldn't complain about what we borrow from their language.
23. Re:Please stop by Anonymous Coward · 2009-07-21 08:38 · Score: 0
  
  And an asshole, to boot.
24. Re:Please stop by CorporateSuit · 2009-07-21 09:07 · Score: 1
  
  the fact of the matter is that CorporateSuit, despite all his blusterings, is about as clueless in German as he tries to claim others are.
  I suppose that all comes from living in Germany for several years, speaking nothing but German with around 1,000 people a week, face to face. I suppose anyone who had gone through such rigors would end up being "clueless" in German as well. All sarcasm aside, perhaps you are more right than you think. Some Germans don't consider Koelsch or Hessisch (the dialects I ended up speaking) to be real German at all (Although they are more understandable than Bayerisch or Frankfurterisch - which is like Hessisch on crack). "Ich bin am lesen", while correct in some dialects, is unacceptible grammar in others.
  
  And adding to the other ways to say "cheap" would be "Ueberaus Billig", "T.I.P." or "sehr guenstig"
  
  --
  I am the richest astronaut ever to win the superbowl.
25. Re:Please stop by Hoi+Polloi · 2009-07-21 09:10 · Score: 4, Funny
  
  This thread is uber-dumb.
  Cartman would say it was "hella-stupid".
  
  --
  It is by the juice of the coffee bean that thoughts acquire speed, the teeth acquire stains. The stains become a warning
26. Re:Please stop by GameMaster · 2009-07-21 09:14 · Score: 1
  
  The way I see it, the real question should be "does it increase the ambiguity of the language or decrease it's expressive power?". As long as someone understands what is being said (with slang like "ain't" that has been in use long enough so it is widely known) then I don't see a problem with it. We may become, somewhat Balkanized in the short-term, but, hopefully, this will serve to get those conservatives used to living in a pluralistic society and will wear down some of their xenophobia. I see the real problem as being the mindset of fearing anyone that talks differently from you. Sure, a little of it is natural but we have been way past that point for a long time and it's something that the US, as a culture, should train itself away from.
  
  --
  
  Rules of Conduct:
  #1 - The DM is always right.
  #2 - If the DM is wrong, see rule #1
27. Re:Please stop by MikeBabcock · 2009-07-21 11:49 · Score: 1
  
  Uber doesn't mean 'above' in popular parlance, its an absolute measure of greatness. Therefore 'uber' cheap would refer to more cheapness.
  
  --
  - Michael T. Babcock (Yes, I blog)
28. Re:Please stop by msimm · 2009-07-21 12:13 · Score: 1
  
  If I could tag comments I'd tag 'lol'. We should be able to tag comments, and those comments could be called /twits, in memory of some other service.
  
  --
  Quack, quack.
29. Re:Please stop by Anonymous Coward · 2009-07-21 13:32 · Score: 0
  
  Leopold, is that you?
30. Re:Please stop by Anonymous Coward · 2009-07-21 15:09 · Score: 0
  
  The way I see it, the real question should be "does it increase the ambiguity of the language or decrease it's expressive power?". As long as someone understands what is being said (with slang like "ain't" that has been in use long enough so it is widely known) then I don't see a problem with it. We may become, somewhat Balkanized in the short-term, but, hopefully, this will serve to get those conservatives used to living in a pluralistic society and will wear down some of their xenophobia. I see the real problem as being the mindset of fearing anyone that talks differently from you. Sure, a little of it is natural but we have been way past that point for a long time and it's something that the US, as a culture, should train itself away from.
  There's a difference though between a culture that really does have its own independent traditions and its own unique ways, and a culture that is merely an uneducated bastardization of an existing one born of a desperate need to maintain a separate identity based on people who look like you do. Right now, too many American blacks have bought into that second category. Meanwhile, everybody thinks this is about race and they don't seem to understand it's about culture.
31. Re:Please stop by moonbender · 2009-07-21 23:29 · Score: 2, Insightful
  
  Compounds parse easier with correct parantheses: (herz)(kreislauf)(wiederbelebung) or (heart)(circle-run)(re-activation), where each of the bracketed words is itself a common compound. FWIW, Cardiopulmonary resuscitation has more characters than the German term. German and English aren't very different, in fact, in terms of compounds; English also has a huge number of compound words, even though they are often not spelled as a single word: circuit breaker, for instance. As English compounds get increasingly entrenched, the compounds tend to get hyphenated, and eventually they are written as a single word.
  
  --
  Switch back to Slashdot's D1 system.
32. Re:Please stop by moonbender · 2009-07-21 23:35 · Score: 1
  
  You're right that ueber would not conventionally be used as a prefix in this situation, but we weren't talking about the German prefix ueber, but about the English prefix uber, which was adopted from German. The fact that you wouldn't say ueberbillig in German doesn't mean that it's improper to use ubercheap. It makes you sound a bit like an ass, but I would argue that it's in line with other conventional uses of the "uber" in English. To put this in a different perspective, English probably uses the Latin prefix super in situations in which it would not have been used in Latin; or at least if it did, nobody would care.
  
  --
  Switch back to Slashdot's D1 system.
33. Re:Please stop by beerbear · 2009-07-22 03:01 · Score: 1
  
  Oh yeah cardiopulmonary resuscitation is a lot better... that's why everyone uses the acronym.
  
  --
  Hold my beer and watch this!
PostGreSQL by tcopeland · 2009-07-21 07:08 · Score: 4, Informative

It's PostgreSQL... but I sympathize with the mixed case confusion and refer you to this Postgres vs PostgreSQL permathread.

--
The Army reading list
1. Re:PostGreSQL by Anonymous Coward · 2009-07-21 10:58 · Score: 0
  
  Raspberries to the PostgreSQL project for settling on a name with a pronunciation that's counterintuitive and difficult. For the life of me, I want to say post-gray-sequel every time I see it, not post-gress-cue-ell.
2. Re:PostGreSQL by WuphonsReach · 2009-07-24 00:38 · Score: 1
  
  And... a lot of times in written communication, I simply say "pgsql".
  
  I do try to capitalize PostgreSQL in more formal communication / documentation, however.
  
  --
  Wolde you bothe eate your cake, and have your cake?
If it works as described it will be VERY important by Etylowy · 2009-07-21 07:09 · Score: 2, Interesting

If both the performance and scalability is as good as described I can safely say that this is the most important thing of the decade and not only for DBMS.
Handling large portions of data would get cheaper by an order of magnitude at least and scaling out would be way cheaper than now as well. I do hope it's true.
What about Essbase? by r_jensen11 · 2009-07-21 07:11 · Score: 1

I thought Essbase was supposed to be one of the best databases for managing too much information. Is this supposed to be an alternative, or act as something in-between using Essbase and a mysql server?
1. Re:What about Essbase? by hemp · 2009-07-21 09:18 · Score: 1
  
  Transaction speed has never been high point of Essbase, nor storing anything but numerical data.
  Changes to the data are not reflected immediately except in the lowest level members until it is re-calculated. It is not unusual to find calc scripts that run for 8+ hours.
  
  --
  Skip ------ See the latest from http://www.anArchyFortWorth.com
Re:If it works as described it will be VERY import by Anonymous Coward · 2009-07-21 07:14 · Score: 1, Insightful

It won't deliver. In the mean time for those of us living and working in the real world, hard-drives will be bigger and faster, file systems will get better, and SSDs will start to shit all over spinning platters.
Typical by MarkPNeyer · 2009-07-21 07:17 · Score: 1

The grad students do all the work, and the professor takes all the credit. Anyone can come up with ideas, the real work is in actually getting things done. This is the reason I stopped grad school with my MS even though I LOVE computer science, more than anyone i've ever met.

--

My blog
1. Re:Typical by Archangel+Michael · 2009-07-21 07:50 · Score: 1
  
  I hate to disagree with you ... but ....
  Anyone can come up with ideas is true, HOWEVER not all ideas are GOOD ones. The problem with coming up with GOOD ideas is often people don't have a basic understanding of the problem or the implications of various ways of implementing an idea.
  Getting people to do the work is often not quite as easy as it seems. First you have to have qualified people. They have to be motivated to actually complete the work given.
  As for degree programs at schools and such, a MS is nothing more than a piece of paper ascribing practical and theoretical knowledge of a certain level. MS (MA) degree shows an extra two years of mastery that a BS does not. Employers like MS degrees over BS degrees because they also take extra initiative (goal setting) that a regular BS degree doesn't.
  I'm not sure what good a MS degree is after say 5 or 10 years in a career, other than being a feather in a cap.
  I take a look at what I know, verses what I knew graduating college, and I know substantially more, and more practical knowledge, things that no MS piece of paper can show. However, I cannot quantify or qualify my knowledge other than my work experience.
  I recently had a week of training that was completely useless for me. I already either knew, or didn't need to know to do my job, and the material I didn't know I could have learned on my own, should the need arise.
  That being said, I am going to go get that MS paper now (25 years after college), so that if and when I need to look for job, I have that feather in my cap. It seems fairly important to those people doing hiring.
  
  --
  Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
2. Re:Typical by Anonymous Coward · 2009-07-21 08:47 · Score: 0
  
  They're fucking Yale students. Getting them to produce quality work is like getting the sun to rise in the morning. You just sit back and watch it happen.
3. Re:Typical by Anonymous Coward · 2009-07-21 09:48 · Score: 0
  
  They're fucking Yale students. Getting them to produce quality work is like getting the sun to rise in the morning. You just sit back and watch it happen.
  Damn right, just like George W. Bush!
  Mission accomplished.
4. Re:Typical by grepya · 2009-07-21 10:11 · Score: 1
  
  I take a look at what I know, verses what I knew graduating college, and I know substantially more, and more practical knowledge, things that no MS piece of paper can show.
  Does your extensive post-collegiage learning include constructing a multi-clause sentence in the English language (..and I wouldn't even mention the spelling error) ? Ordinarily I wouldn't be an asshole about this except you screwed up the exact sentence where you're bragging about your amazing skills, acquired over a long professional career. And whatever that career might be, writing a readable sentence in your main language is a basic skill (and I know you're an American from your other posts).
5. Re:Typical by Archangel+Michael · 2009-07-21 15:31 · Score: 1
  
  Can't spell worth shit, doesn't negate my intelligence. I know idiots who spell perfectly. I know very intelligent people who spell well, and idiots who can't spell like me. Spelling is NOT a sign of intelligence nor education level.
  And I didn't know I was going to be "graded" on a spur of the moment post to a web log. Had I known you were a lurking grammar nazi, I would have proofread my post more carefully. Perhaps even hiring someone to draft(write) it for me to post as my own.
  Knowing my weaknesses (spelling) is a strength. That, and I don't become a grammar nazi, who misses the point while nitpicking.
  I seem to have made it through college without much of a hassle with either issue you seem to point out. I wonder what that says about the college system or even the "American" Education system?
  
  --
  Agent K: A *person* is smart. People are dumb, stupid, panicky animals, and you know it.
6. Re:Typical by grepya · 2009-07-26 09:28 · Score: 1
  
  It's not just your writing that sucks. Going by your response, your reading comprehension is not so hot either. I like your rant (which, your probably did get proofread ironically) but it was about *THE WRONG THING*. I'd leave it at that. Maybe you should spend some time on reflection rather than defending the indefensible.
The SQL language is also an issue by chrysalis · 2009-07-21 07:19 · Score: 4, Insightful

Scalability is one thing, but what we appreciate in SQL-free databases is also that they don't require SQL.
When what we want is just to retrieve a record, calling get(id) is way easier and more secure than building an SQL statement, and way cheaper than using an ORM.
The Tokyo Cabinet API is absolutely excellent in this regard. And there's no need to learn yet another domain-specific language like SQL, just use the language you use for the rest of the app.
Now, SQL-zealots would troll "but how would you do with ?".
And yes, for complex requests as in data mining, SQL and XPath make sense. For people who aren't developpers, SQL makes sense as well. For interoperability with 3rd-party apps, SQL is also useful, just as FAT is still useful today in order to share filesystems between operating systems.
But for the rest of us, SQL is cumbersome. Databases like MongoDB make you achieve similar results in a more natural way instead of forcing you to learn SQL and to rethink everything in a tabular way.

--
{{.sig}}
1. Re:The SQL language is also an issue by maxume · 2009-07-21 07:30 · Score: 1
  
  Is there something terribly dangerous about parameterized queries?
  
  --
  Nerd rage is the funniest rage.
2. Re:The SQL language is also an issue by Anonymous Coward · 2009-07-21 07:36 · Score: 0
  
  That they normally take two RTs?
3. Re:The SQL language is also an issue by Abcd1234 · 2009-07-21 07:36 · Score: 0
  
  When what we want is just to retrieve a record, calling get(id) is way easier and more secure than building an SQL statement, and way cheaper than using an ORM.
  "Cheaper" in what sense? You can't mean cost, as I'm not aware of ORMs introducing any cost overhead. So presumably you mean the cost of marshaling the data between SQL and your object model.
  So, how, pray tell, do these new magical DBs convert the stored data to/from objects that your software can consume? Do they not marshal at some level? And if so, how is it that they're any cheaper than an ORM layer built over a relational database?
4. Re:The SQL language is also an issue by jd · 2009-07-21 07:38 · Score: 1
  
  I would argue that all solutions that currently exist for databases are ideal for some specific set of problems AND some specific set of users for each problem within that initial set.
  There is no "perfect" solution that will work for all types of data, be it a flatfile structure, a hierarchical structure, a relational structure, object-oriented or some combination of those. (The star-structure of OLAP databases is a hybrid, for example.)
  What would be good is if there was a suitable metalanguage in which you could define an abstract idea of the search and have a source-to-source compiler turn that into a suitable specific solution for the type of database in use, or even the specific database engine itself.
  (There are some nice tools for producing specific database SQL queries out of an abstract definition, but I know of nothing that can cross database design methodologies.)
  
  --
  It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
5. Re:The SQL language is also an issue by Underfoot · 2009-07-21 07:55 · Score: 1
  
  I would just like to state for the record, that IMHO SQL is a beautiful thing. Its ease of interoperability (both between languages and backends) has saved my butt on numerous occasions (not to mention the ease with which you can go from very simple to very complex depending on the need of your application) ...
  
  ...and you can get rid of it and replace it with OOP when you pry it from my cold dead hands.
  
  --
  I mentioned tinker-toys once in a post - now I'm modded down for life.
6. Re:The SQL language is also an issue by BitZtream · 2009-07-21 08:08 · Score: 2, Insightful
  
  When what we want is just to retrieve a record, calling get(id) is way easier and more secure than building an SQL statement, and way cheaper than using an ORM.
  Yes, I'm an SQL troll, but ... if using SQL to get a row by a unique ID is too hard for you and too insecure, there is no amount of code which is going to fix your problem, which is that you are a shitty developer who is far too lazy to make a function or macro to wrap around the simple sql request.
  There are PLENTY of reasons to not like SQL, but your example is so simplistic that it just makes me think the problem you have is you, not what the database backend is or how its used.
  As far as being forced to think of your data in a tabulare way due to using SQL, you think this is a bad thing. It isn't. Its retarded that you're going to use an SQL backend anyway, and then try to think of the data in some other form. Adding some silly layer on top to make it easier for you to understand just indicates you're too lazy to understand your own data and how to use it efficiently.
  I hate when people make stupid posts like yours. I end up sounding like an SQL fanboy and I don't even fucking like it.
  
  --
  Persistent Volume manager for Kubernetes - https://github.com/dwimsey/openshift-pvmanager
7. Re:The SQL language is also an issue by Tablizer · 2009-07-21 08:20 · Score: 1
  
  When what we want is just to retrieve a record, calling get(id) is way easier and more secure than building an SQL statement
  What's so hard about building a wrapper function for the times you want simple:
  
  function getRecord(int id) { return(query("select * from foo where id=%", id)); }
  
  And you'd still have SQL for the times you want fancier stuff.
  
  --
  Table-ized A.I.
8. Re:The SQL language is also an issue by ChaosDiscord · 2009-07-21 08:38 · Score: 4, Insightful
  
  "a record, calling get(id)"
  So you're relating "id" to "a record." I assume that the record in question is a blob of potentially binary data that your program parses however it wants. So you want to relate unique identifiers to blobs. You can do that quite easily with SQL. Looking up a given unique identifier quickly is something your average relational database is very good at. And writing the wrapper function to implement your hypothetical get() function is trivial in most languages. I'm completely at a loss for what your SQL-free database is offering me in this case. It's saving you from the horror of writing 10 lines of code, once, to implement get(in)? 60 minutes with a good SQL tutorial will teach you everything you need to know. Sure, there is a lot more you can learn, but for the simple case you're describing you can understand SQL at only the most simple level.
  Or are you handwaving the "a record" is actually automatically squeezed into one or more variables or objects in your code? You say get("ChaosDiscord") and out pops the UserObject populated with the relevant information. Of course, at this point you need to start teaching you database, or at least your database wrapper, how your objects are structured, and how to serialize them. This is admittedly a bit of a nuisance, but an SQL-free database doesn't magically make the problem go away. Sure, an SQL-free database can provide a layer to simplify or automate it, but so can a layer on an SQL database (Ruby on Rails is perhaps the best known). Sure, you'll need to tell it that username is a string, userid is an integer, and so on, but you only have to say it once in SQL instead of in your program. The total work hasn't gone up.
  Ultimately, you appear to be complaining that SQL is too powerful (and thus complex) for your needs. But you can easily learn and use a subset of SQL that corresponds to what you claim you're looking for in an SQL-free database! You might as well complain that Java is too powerful it has thousands of classes you don't need. The time to learn the relatively minor amount of SQL you need is insignificant compared to the time to develop any non-trivial application. If even that hour is too much, you can outsource the work to a geeky college student for some pizza and soda.
  There are some compelling reasons to look at SQL-free databases, but "SQL is too powerful" isn't one.
  
  --
  Search 2010 Gen Con events
9. Re:The SQL language is also an issue by Anonymous Coward · 2009-07-21 09:01 · Score: 0
  
  You are opposed to SQL because you aren't used to it. It isn't cumbersome for those of us who are used to it. One person's burden is easy for another.
10. Re:The SQL language is also an issue by Anonymous Coward · 2009-07-21 10:37 · Score: 0
  
  When what we want is just to retrieve a record, calling get(id) is way easier and more secure than building an SQL statement, and way cheaper than using an ORM.
  So you write your app, as programmers do, and you treated the DB like a filing cabinet, as programmers do. Then when the product has been in production use for a while, the boss rings you and says: "We need to know the daily distribution of payments since the new system came on-line. When can you get that for us?" And *you* say, "No problem! I'll just write a Java/Ruby/Whatever program to trawl through every record in our database, do the calculations procedurally and print the result. It'll only take a couple of weeks to do that. Oh, but we'll have to run it over the weekend because it'll kill the server."
  Then your *boss* will say, "You're fired." And you'll deserve it, because you failed to realise that in almost all serious applications, the data you work with is the reason the application exists - not the other way around
11. Re:The SQL language is also an issue by MikeBabcock · 2009-07-21 11:51 · Score: 2, Insightful
  
  I understand putting another API above SQL to make it simpler to use, but avoiding using SQL because its powerful makes no sense.
  
  --
  - Michael T. Babcock (Yes, I blog)
12. Re:The SQL language is also an issue by chez69 · 2009-07-21 12:41 · Score: 1
  
  Not to be a troll, but this sure sounds a lot like IMS. Write a program to analyze the data.
  some mainframers would be laughing their asses off.
  
  --
  PHP is the solution of choice for relaying mysql errors to web users.
13. Re:The SQL language is also an issue by Alpha830RulZ · 2009-07-21 16:10 · Score: 1
  
  And there's no need to learn yet another domain-specific language like SQL,
  SQL, "domain specific"? Wow. I am taken aback. Over 30 years of coding, I think SQL is singlehandedly the most productive addition to the development environment I can think of since the compiler. There are a lot of reasons that using a SQL database might not make sense (small platform, single user, low cost, small required footprint, etc) but domain specificity isn't on my list. I can't think of a less domain specific development technology out there.
  If you are working with piles of data, SQL is much easier, and generally much faster in both development time and execution time than what you'd cobble up in code. It hides the details of storage from your app, which is usually a Pretty Good Thing. Apps that use SQL can be readily written to scale from small to quite large environments, without significant code changes. I don't know why you'd want to give these benefits up.
  
  --
  I was taught to respect my elders. The trouble is, it's getting harder and harder to find some.
14. Re:The SQL language is also an issue by Anonymous Coward · 2009-07-21 17:50 · Score: 0
  
  I can not agree more. I started working with RDBMS on dBaseII in the early 80's. While everything else has changed the principals of relational database management have remained consistently relevant and true. ref: Edgar F. Codd
  
  "For people who aren't developpers, SQL makes sense as well."
  WTF! I think that ranks as one of the stupidest statements I have ever read on slashdot!
  Howewver, it probably explains all the D3v31opr#s who write incredible underperforming and insecure PHP websites because they don't bother to learn some basic SQL.
15. Re:The SQL language is also an issue by ArsonSmith · 2009-07-21 18:16 · Score: 1
  
  Some people are able to grasp new concepts and others cant???
  I'll get off your lawn now.
  
  --
  Paying taxes to buy civilization is like paying a hooker to buy love.
16. Re:The SQL language is also an issue by chrysalis · 2009-07-21 18:19 · Score: 1
  
  Talked about "records" and "id" because people familiar with SQL might not be familiar with other kind of databases, but you took it the wrong way.
  Now, ask Google. How many critical vulnerabilities were due to SQL injections? How many similar vulnerabilities were found in SQL-free databases?
  I agree with you that workarounds do exist, and that developpers are to blame instead of SQL, but in the real world, SQL is how a lot of services are compromised by kiddies.
  Why do we need to invent wrappers?
  Why do tools like GreenSQL exist?
  Why do we need ORMs?
  Why does a stock PHP have 5 different APIs just to issue basic MySQL queries?
  Why are there hundreds of third-party PHP abstraction layers and ORMs?
  How come every PHP project reinvents yet another SQL abstraction layer?
  How come, even though both are using the same SQL database server, it's a real hell to merge code of a PHP application with another PHP application, because they invented their own abstraction layers?
  Why do people want to hide SQL at the first place (even Apple in newer iPhone SDKs)?
  Maybe just because there's something wrong with the SQL language itself.
  
  --
  {{.sig}}
17. Re:The SQL language is also an issue by Anonymous Coward · 2009-07-21 18:26 · Score: 0
  
  Yes, SQL is a beautiful thing. It's Microsoft Excel without any UI.
18. Re:The SQL language is also an issue by Anonymous Coward · 2009-07-21 18:33 · Score: 0
  
  Looks like you never dealt with denormalized and sharded data.
19. Re:The SQL language is also an issue by chrysalis · 2009-07-21 18:55 · Score: 1
  
  Actually it's quite the opposite. For any complex task, writing a script for MongoDB, CouchDB or TC/TT is way easier and faster than an unbearable 100-lines SQL statement, that even you are unable to understand the day after. Plus it's able to get things that just can't be written as an SQL query.
  And your "we'll have to run it over the weekend because it'll kill the server" is also why when you need to extract stuff out of a large dataset, you write a script to process data in chunks, not a single SQL statement. If SQL is so wonderful and the answer to everything, why do stored procedures exist?
  
  --
  {{.sig}}
20. Re:The SQL language is also an issue by chrysalis · 2009-07-21 18:57 · Score: 1
  
  Not exactly since Excel macros can perform loops at least.
  
  --
  {{.sig}}
21. Re:The SQL language is also an issue by chrysalis · 2009-07-21 19:01 · Score: 1
  
  Marschaling is still required, but you don't need to think about being restricted to a schema, columns, types, to define identifiers for everything, to do explicit joins, etc. Just store your objects as they are in memory.
  Look at MongoDB: http://www.mongodb.org/
  
  --
  {{.sig}}
22. Re:The SQL language is also an issue by chrysalis · 2009-07-21 19:04 · Score: 1
  
  "id" is MP3 data. How do I find the title of the song? Will your 3 liners do the trick?
  In a document-oriented database id wouldn't be an issue. No need for any wrapper.
  
  --
  {{.sig}}
23. Re:The SQL language is also an issue by chrysalis · 2009-07-21 19:11 · Score: 1
  
  > WTF! I think that ranks as one of the stupidest statements I have ever read on slashdot!
  Tons of people aren't exactly writing PHP websites, but are still able to install vbulletin, phpbb, phpnuke, joomla, wordpress on mutualised hosting. And then they fire phpmyadmin in order to remove bogus users, to count the number of posts or visits, etc. SQL perfectly makes sense for this.
  
  --
  {{.sig}}
24. Re:The SQL language is also an issue by growse · 2009-07-21 20:26 · Score: 1
  
  SQL injection vulnerabilities don't exist because of the database, they exist because of the crappy programmer who doesn't know how to use the database being let loose writing production code. It's a bit like saying "Lets blame the internet for Cross-site scripting vulnerabilities!".
  
  And there's nothing wrong with SQL. There's a lot wrong with people who think SQL will solve every single problem under the sun. Unfortunately, those people seem to be employed writing 3rd-party abstraction layers and ORMs.
  
  --
  There is nothing interesting going on at my blog
25. Re:The SQL language is also an issue by mabinogi · 2009-07-21 20:57 · Score: 2, Insightful
  
  The answers to those questions will say a whole lot about why PHP sucks, but very little about SQL.
  in particular:
  
  Why does a stock PHP have 5 different APIs just to issue basic MySQL queries?
  Because the PHP developers have re-invented the wheel five times and still haven't figured out it's not supposed to have sharp corners. Nothing to do with SQL. Perl's DBI is a good example of a database abstraction layer done right.
  
  --
  Advanced users are users too!
26. Re:The SQL language is also an issue by elnyka · 2009-07-22 01:11 · Score: 1
  
  Some people are able to grasp new concepts and others cant???
  I'll get off your lawn now.
  And the later types are the ones that write shitty code on a regular basis? /sarcasm-gone-amock :-/
27. Re:The SQL language is also an issue by elnyka · 2009-07-22 01:18 · Score: 1
  
  Looks like you never dealt with denormalized and sharded data.
  Denormalized data (except in certain cases) is usually a sign of bad design, not an intrinsic RDBMs attribute.
  As for sharded data (assuming that it's properly normalized, otherwise see previous paragraph), and assuming that it's properly sharded among functionally-sound partitions, what's the trouble in implementing the hypothetical request? Badly partitioned data is just as denormalized data; signs of someone who didn't know what he/she was doing.
  One could also imagine a hypothetical scenario where data has been entered in a non-sql, key-value store in such a way that makes it near impossible to make any sense out of it. But just as in your hypothetical counter-example, this counter-example would be an attribute of bad design and not necessarily a sign on the limitations of non-sql key-value stores.
28. Re:The SQL language is also an issue by elnyka · 2009-07-22 01:25 · Score: 1
  
  Actually it's quite the opposite. For any complex task, writing a script for MongoDB, CouchDB or TC/TT is way easier and faster than an unbearable 100-lines SQL statement, that even you are unable to understand the day after. Plus it's able to get things that just can't be written as an SQL query.
  Can you provide concrete proof of this? "For any complex task" pretty much uses universal quantification over all problems dealing with data representation. Considering that some (not all) data (with its associated meaning and function) is best represented using relations, while others are better represented using network models (think cyclical graphs), and others are best represented as simple key-value mappings, I would find it hard to believe that truly and verily any complex data manipulation task is best represented writing a MongoDB/CouchDB/TC/TT script.
  Now, if you decide to reply by saying "well, not all, but most", please provide proof, or at least some logical demonstration that this is indeed true. For, if it is, holy crap, you need to write a dissertation on this.
  
  And your "we'll have to run it over the weekend because it'll kill the server" is also why when you need to extract stuff out of a large dataset, you write a script to process data in chunks, not a single SQL statement. If SQL is so wonderful and the answer to everything, why do stored procedures exist?
29. Re:The SQL language is also an issue by elnyka · 2009-07-22 01:43 · Score: 1
  
  Shoot, I f* up the quotes, so let me try again.
  
  Actually it's quite the opposite. For any complex task, writing a script for MongoDB, CouchDB or TC/TT is way easier and faster than an unbearable 100-lines SQL statement, that even you are unable to understand the day after. Plus it's able to get things that just can't be written as an SQL query.
  Can you provide concrete proof of this? "For any complex task" pretty much uses universal quantification over all problems dealing with data representation. Considering that some (not all) data (with its associated meaning and function) is best represented using relations, while others are better represented using network models (think cyclical graphs), and others are best represented as simple key-value mappings, I would find it hard to believe that truly and verily any complex data manipulation task is best represented writing a MongoDB/CouchDB/TC/TT script.
  Now, if you decide to reply by saying "well, not all, but most", please provide proof, or at least some logical demonstration that this is indeed true. For, if it is, holy crap, you need to write a dissertation on this.
  
  And your "we'll have to run it over the weekend because it'll kill the server" is also why when you need to extract stuff out of a large dataset, you write a script to process data in chunks, not a single SQL statement.
  Who told you that for that type of task you would use a single SQL statement?
  
  If SQL is so wonderful and the answer to everything, why do stored procedures exist?
  Encapsulation. Reuse. You know, all the nice things associated to modules and procedures. Stored procedures are the same. In any non-trivial system, data might change its structure over time. Say, I have a SQL statement A that goes over one or more inter-related but heterogeneous database structures X, Y, Z, producing a set of values with structure or interface B. And imagine that SQL statement A is used at different locations in my code.
  Due to business changes, to obtain set of structure B, now I have to operate with structures X, Y, Z". So the newly adapted SQL is A'. Say, I'm intelligent. So I put my SQL in a stored procedure P which is called throughout my code. Changes from A to A' are encapsulated inside the store procedure. I can unit test that store procedure and have a level of assurance (but not a certainty) that it will work throughout my code.
  Or maybe I'm not intelligent; I'm an idiot that doesn't know SQL and much less stored procedures (what are stored procedures for anyways?). Then I have to go through the code replacing every instance of A into A', and unit test every changed piece of code. Man, I can understand someone not knowing SQL and stored procedures, but for f* sake, these are just CompSci 101 concepts man. Please get your basic principles right before trying to lecture on what's good or not in software.
  Another good reason for using stored procedures if for fine-grained execution control. You don't want just any schmuck that is clueless about SQL to execute random SQL producing cartesian joints or a long-ass query that doesn't use an index. You test and fine tune not only your SQL and server-calling SQL code, but also the underlying data structures, partitions and indexes in your integration/UAT environment prior to deployment on production. This review and Q&A is assured by forcing every SQL statement into its own procedure.
  Alas then, you simply enforce the same principles of modularity and encapsulation on SQL that you also find universally in any other modern programming language. That you cannot even come to that conclusion on your own (even if you don't know SQL) is completely unacceptable for anyone that does coding for a living.
  You don't need to use a SQL database when you don't functionally need that. I'd be the first to say a lot of
30. Re:The SQL language is also an issue by chrysalis · 2009-07-22 02:17 · Score: 1
  
  It's a bit like saying "Lets blame the internet for Cross-site scripting vulnerabilities!".
  No, it's about having less issues by using modern tools, rather than trying to find who's to blame.
  If HTML/JS/CSS/HTTP could be redesigned today, do you think that the way a browser manage cookies, XHR requests and sandboxing in general would be the same as it is today? Do you think that the SMTP protocol that was good enough 30 years ago is not a big pile of crap nowadays, even, just like ORMs, their content is now shown in webmails? SQL is just like SMTP. Or the FAT filesystem. An old thing. There are worthy proposals and even working products that could superscede them, but because of legacy applications and people who want to stick with the same technologies till the end of the universe, these old things remain. They just get bloated with new extensions instead, in order to keep up with mandatory requirements.
  they exist because of the crappy programmer who doesn't know how to use the database
  "Our GUI _is_ user-friendly! Customers just don't understand how to use it because _they_ are clueless!"
  "hey, and there's a workaround to make them use it properly. Just tell them to top an GUIORM (pick one out of hundreds), an extra layer that hides our wonderful GUI and shows another GUI that's closer to reality, and transparently translates events both way, while introducing some new limitations (thinking about that other ORM that has no way to issue OR-like statements) and some grain of extra sluginess!"
  
  --
  {{.sig}}
31. Re:The SQL language is also an issue by mbourgon · 2009-07-22 02:24 · Score: 1
  
  Didn't realize Joe Celko was a Slashdotter.
  
  --
  "Sometimes a woman is a kind of religion, she can save your soul & set you free from all your sins" - Bad Examples
32. Re:The SQL language is also an issue by growse · 2009-07-22 02:49 · Score: 1
  
  It's a bit like saying "Lets blame the internet for Cross-site scripting vulnerabilities!".
  No, it's about having less issues by using modern tools, rather than trying to find who's to blame.
  If HTML/JS/CSS/HTTP could be redesigned today, do you think that the way a browser manage cookies, XHR requests and sandboxing in general would be the same as it is today? Do you think that the SMTP protocol that was good enough 30 years ago is not a big pile of crap nowadays, even, just like ORMs, their content is now shown in webmails? SQL is just like SMTP. Or the FAT filesystem. An old thing. There are worthy proposals and even working products that could superscede them, but because of legacy applications and people who want to stick with the same technologies till the end of the universe, these old things remain. They just get bloated with new extensions instead, in order to keep up with mandatory requirements.
  Of course, if you were designing something today to do the job of 'relational database', you'd probably get something different from SQL. That doesn't change the fact that today, the SQL / RDBMS combo is the best tool for solving a lot of problems. That doesn't mean that people won't try and use it improperly, but those people are idiots. People don't stick with SQL because it's old, they stick with SQL because it gets the job done, and in the hands of someone who knows what they're doing, it gets the job done well as long as the job is a sensible one for which a relational database makes sense. Like storing millions of customer records.
  
  they exist because of the crappy programmer who doesn't know how to use the database
  "Our GUI _is_ user-friendly! Customers just don't understand how to use it because _they_ are clueless!"
  "hey, and there's a workaround to make them use it properly. Just tell them to top an GUIORM (pick one out of hundreds), an extra layer that hides our wonderful GUI and shows another GUI that's closer to reality, and transparently translates events both way, while introducing some new limitations (thinking about that other ORM that has no way to issue OR-like statements) and some grain of extra sluginess!"
  Typically, you don't have any control over how smart your users are, so you make GUIs nice and simple to use. However, the analogy doesn't hold for developer tools and technologies. You do have control over how smart your developers are, and if they're stupid to use SQL properly, they're idiots. It's not the fault of the technology that muppets use it improperly.
  
  --
  There is nothing interesting going on at my blog
33. Re:The SQL language is also an issue by pamar · 2009-07-22 03:07 · Score: 1
  
  I can not agree more. I started working with RDBMS on dBaseII in the early 80's. ...
  Hate to nitpick, but if you are talking of the IBM product, I think you better say "DB2".
  dBaseII was an Ashton-Tate product, originally conceived for CP/M. The dBase line introduced RELATIONAL functionalities only with DBase IV (and they allegedly never really worked).
34. Re:The SQL language is also an issue by Tablizer · 2009-07-22 03:19 · Score: 1
  
  My example is based on the original example given. But this illustrates part of my point: when you need features you didn't originally anticipate (or ask for), you have the power of SQL to help out.
  
  --
  Table-ized A.I.
35. Re:The SQL language is also an issue by TheLink · 2009-07-22 03:59 · Score: 1
  
  > you don't need to think about being restricted to a schema, columns, types, to define identifiers for everything, to do explicit joins, etc. Just store your objects as they are in memory.
  
  That's not a disadvantage in many cases, especially for the long term. There's a benefit of forcing people to use SQL to talk to the DB. It becomes a layer of abstraction, somewhat like a standard protocol or interface.
  
  When you use SQL, your database can be used by 100 different people and programs, and when you add columns/fields, if the coders have been doing things properly (not assuming column order and number of columns), everything still works. When lots of things can use the same DB, that database often becomes more valuable and useful.
  
  In contrast what does "Just store your objects as they are in memory" mean, when you have 100 different programs? What happens if a program needs a new data field attached to the returned object? Can the other 99 programs still work without changes? Or will some require a recompile since the DB is now returning a bigger/different object?
  
  If you need recompile or modifications for stuff like that, that means only a few programs can share the same database (otherwise it becomes a maintenance nightmare), and that could limit the extent of how useful that database could be. You might then have to add layers of abstraction so that it becomes more accessible (and stays accessible) to many other programs. And one day one of those layers might end up looking like SQL ;).
  
  To quote myself: One man's impedance mismatch is another man's layer of abstraction.
  --
  
  Too many replies beneath your current threshold
36. Re:The SQL language is also an issue by ahabswhale · 2009-07-22 06:26 · Score: 1
  
  It's because SQL isn't just SQL. It's all the cruft that goes with it. Accessing a DB from an OO language is simply a major fucking pain in the ass and much harder than it should be even when using the ORM du jour. A lot of this complexity comes from the fact that OO and RDBS just don't play well together no matter how you slice it. Instead of focusing on the business domain you end up spending far too much time dicking with the data layer. A lot of this would go away by using an OO database but then you lose the scalability of these other technologies.
  
  --
  Are agnostics skeptical of unicorns too?
37. Re:The SQL language is also an issue by Abcd1234 · 2009-07-22 07:03 · Score: 1
  
  Marschaling is still required, but you don't need to think about being restricted to a schema, columns, types, to define identifiers for everything, to do explicit joins, etc. Just store your objects as they are in memory.
  Well, that's all very interesting, but it doesn't sound any different than your average OODBMS, something which has been around for a *long* time (I worked with Versant nearly a decade ago doing exactly what you describe). Heck, the Smalltalk world has had intelligent object databases for years (Gemstone comes to mind, among others). So, again, what's the big deal?
  As an aside, I'm also not convinced that being forced to a define a proper schema for one's data, that's separate from the overlaying object model, is necessarily as bad a thing as you seem to think. Nor is it that difficult, given how intelligent ORMs are these days. But, when it comes down to it, that's as much a philosophical argument as anything else.
38. Re:The SQL language is also an issue by ChaosDiscord · 2009-07-22 07:19 · Score: 1
  
  I believe the great-grandparent poster was talking about simple key-value stores, similar to the Tokyo Cabinet system he mentioned. When people talk about Anti-SQL or SQL-Free, that seems to be what they're always talking about, although usually on the larger end with things like BigTable and HBase. My criticism was directed in that direction. Compared a key-value store to a subset of SQL, or even a key-value store implemented in SQL, the complexity difference is negligible for any but the most simplistic of projects. As such, the great-grandparent poster's objection to SQL and relational databases compared to key-value stores was wrong. But I must yield that object storage is an area where object oriented DB have a clear advantage over relational DBs.
  
  --
  Search 2010 Gen Con events
39. Re:The SQL language is also an issue by cratermoon · 2009-07-22 08:09 · Score: 1
  
  > That's not a disadvantage in many cases, especially for the long term. You Aren't Going to Need It.
40. Re:The SQL language is also an issue by LionMage · 2009-07-22 13:36 · Score: 1
  Judging from your response to ahabswhale, it seems that you've pretty much made up your mind that everyone else is wrong and you're right, but I'll take a stab anyway at why I think you're missing the point:
  
  Looking up a given unique identifier quickly is something your average relational database is very good at. And writing the wrapper function to implement your hypothetical get() function is trivial in most languages. I'm completely at a loss for what your SQL-free database is offering me in this case. It's saving you from the horror of writing 10 lines of code, once, to implement get(in)? 60 minutes with a good SQL tutorial will teach you everything you need to know. Sure, there is a lot more you can learn, but for the simple case you're describing you can understand SQL at only the most simple level.
  Hoo boy... so much here, where to begin? First off, the condescension is uncalled-for. It's not that people who hate SQL are somehow incapable of writing SQL queries, or learning how to. I think the objections are twofold:
  
  Everything in SQL assumes that data is formatted in a tabular fashion. For data that's truly tabular, it makes sense. But trying to represent certain other data in a SQL database is just painful. For instance, it's overkill for an associative array such as a large hashtable. I've had DBAs tell me tree structures are "trivial" to represent in SQL, but my general experience has been that the standard ways to implement this (and the hacks that some commercial DBMSes employ to optimize this) are tortured and tortuous.
  The extra layers of indirection and ceremony you have to go through to marshal and unmarshal data to/from a relational database. If you have many disparate fields and you want to do complex selection based on multiple fields or multiple conditions, a relational database makes sense, but again if what you have is essentially a giant hashtable, the extra layers of indirection for what is essentially a bunch of key-value pairs is way overkill.
  To use the example you latched upon in the GP post, you talk about how trivial it is to write a wrapper function to give us a get() method equivalent. Sure, it might only take 10 lines of user-generated code, but let's consider what is going on behind the scenes:
  
  That wrapper function will need to take that SQL and turn it into some form that can be processed more efficiently by the DB. If you take advantage of prepared statements, this might be a one-time operation. However, consider that some databases such as MySQL don't have a good or even a working implementation of prepared statements -- in the case of MySQL, I have it on good authority from one of the devs that prepared statements never worked right, so the code for that was yanked when they did the Drizzle fork; in fact, the JDBC drivers for MySQL kind of "roll their own" in their implementation of PreparedStatement, instead of letting the database do the work.
  The statement to execute then gets shipped through the network connection to the database for execution. In all but the most trivial applications, the database won't reside on the same machine, so there's network overhead to be concerned with -- that'll be true whether you're talking about a SQL database or a no-SQL alternative. It does seem to me that in the case of a giant hashtable in the sky, the request is going to be a lot smaller over the wire than even invoking a prepared statement with a single parameter.
  Once the request hits the DB engine, there's going to be some non-zero execution time. On the SQL side, even with a prepared statement, the parameter has to be plugged in and the compiled expression executed. On the no-SQL, giant hashtable side, we have little in the way of processing to do other than doing the actual lookup, which is hard-coded logic in the engine.
  Retrieval should be a trivial matter, right? Well, in a traditional relational database, it's true that the column used as the primar
41. Re:The SQL language is also an issue by badkarmadayaccount · 2009-07-23 02:01 · Score: 1
  
  PostgreSQL allow turning any programming language into a query language, AFAIK. BTW, this may be off-topic, but I'm pretty sure I'll get the most DBMS geek eyes on this article, so here it goes - would it be possible/feasible to integrate the compiler system and cache with the VCS within the database? My idea is about getting the flexibility of Portage/pkgsrc systems without the hassle of compiling the whole thing, start to finish. I'm pretty sure most compile time options can be recalculated quickly, and reflected in the database as binary diffs, or maybe just a link time option.
  
  --
  I know tobacco is bad for you, so I smoke weed with crack.
42. Re:The SQL language is also an issue by ChaosDiscord · 2009-07-23 04:39 · Score: 1
  
  The grandparent ultimately asserted that "for the rest of us, SQL is cumbersome," calling out that an "SQL-free" database is "easier", "more secure," and "cheaper."
  If we're talking about essentially key-value stores, SQL can do it well.
  It's harder, but we're talking about an hour or so of work, an hour's worth of value you can reuse for future projects. For all of the complaints about needing to worry about tables with lots of fields and serialization, it's moot if you just want a key store. All that worrying about getting your data into tabular form is irrelevant. (I'm not sure what your point about trees is; I'm not aware of an especially more elegant way to store trees in a simple key-value store that I couldn't implement with similar ease in a relational database.)
  Writing secure SQL is not hard; every modern language supports placeholders that will handle your escaping.
  Now a simple key-value store is "cheaper," there is less going on under the hood, less built-in functionality ("power") available and thus less complexity. But, are you really writing the next Google? Sure, you want to believe your application will have millions of users, but... really? "The rest of us" are writing much, much smaller scale applications. Little things like the phone company's billing system, Wikipedia, and EVE Online. And we're not feeling terribly encumbered by our databases.
  I find your comparison to LISP/Java and C relevant, but for the opposite reason. Key-value stores and C are the simple, obvious solution. They're really fast because they don't do much, and you can accurately predict what's going on under the hood. LISP, Java, and a relational database are powerful, resultingly complex, and it's hard to predict exact run-time performance. But many real world programs frequently don't need to be all that fast, the benefits of all that extra functionality speed development, if you add the functionality yourself you end up paying the cost or more anyway, and years of research have shown that you can make them quite fast. C programmers for years wailed that all those extra layers of indirection was way overkill, that Java "is often more complex than necessary and has too much overhead" and "that much of the overhead is hidden from the average developer, which for today's lazy programmers makes for some very inefficient code." At least today, C programmer standing by this argument are wrong for most real-world problems.
  And, of course, part of the price of a key-store system is that many seemingly simple tasks become hard. Sure, you can map from product ID to a product's information, but how do you query on price, name, or manufacturer? As best I can tell the answers are to throw more computers at the problem, or start maintaining your own indexes and hoping you don't screw them up. Related but separate pieces of information (products, customers, orders) don't have an elegant representation; you end up maintaining the relationships in your code; the key-store system can't catch errors. You can add layers to help you with all of these things, but as you go, you'll find you're reimplementing a relational database. On the up side, you only pay for the features you need. But the down sides are many: an existing relational database has dedicated many years of work into optimizations and research, time you can't afford; your system is non-standard, increasing the time it takes a new team member to get up to speed; if your needs are relatively modest, the cost of development will exceed the cost of using off-the-shelf software.
  Systems like Hadoop or Google's MapReduce have a lot of advantages, as you note. They do scale, especially across multiple machines, in a way that relational databases suck hard at. They can provide high reliability in the face of unreliable machines. (And on a large enough system, the mean time to failure is now.) Maybe you're doing something where those or other advantages are important. Great! But "the rest of us" aren't especial
  
  --
  Search 2010 Gen Con events
Re:If it works as described it will be VERY import by Etylowy · 2009-07-21 07:22 · Score: 1

It it will deliver it will change much. Not for your average blogger with a $10 hosting, wordpress and all his 100 readers but for all the folks that have sites successful enough to go beyond that a single DB server can deliver. Now you have to work really, really hard to make it all work with replication as pretty much no free CMS offers data sharding. Now you won't have to. Just get a DB cluster (as a service) that works out of the box with none/very little modification to the software you are using. The wall that they currently hit at the point they have to invest loads of money to continue growth will be gone.
MySQL? by trisweb · 2009-07-21 07:38 · Score: 3, Funny

No offense to the creators (well, maybe some offense) but why the heck would you want to put MySQL in where PostgreSQL already was? That's like taking out your star quarterback and putting in, well, me!

--
"!"
1. Re:MySQL? by Anonymous Coward · 2009-07-21 07:49 · Score: 0
  
  brett farve retired because he grew beyond his bones capacity.
  you are fresh and squishy. here is your helmet.
2. Re:MySQL? by Anonymous Coward · 2009-07-21 08:15 · Score: 0
  
  Actually, I was thinking the exact opposite: what a shame that the project was based on a failed platform with no market share and even less market relevance. I was excited about using 10 years' worth of mysql code in a cloud environment.
3. Re:MySQL? by scorp1us · 2009-07-21 08:31 · Score: 3, Informative
  
  MySQL has its fan boys from circa 1994-2001. During this period, the MySQL license was much more permissive, and gained a certain momentum from PHP that carries it through to this day. At the same time, PostgreSQL was still using Cygwin on Windows, the INSTALL had a table of contents, and was lacking performance enhancements (particularly on Windows). Eventually Cygwin was dropped and the threading was happy on windows, and the performance enhancements were good. Along with this came a much shorter INSTALL file and all reason to use MySQL had disappeared. But once you know something, people like to keep on using it. Then MySQL got things like triggers, foreign key constraints and full ACID compliance. So in the end it ended being a wash. However, and not to start a flame war, it seems that PostgreSQL, having been feature-complete (ACID, foreign keys, etc) maintained a performance edge. But also to this day MySQL has a very fast table implementation, provided you don't need things like ACID compliance. For a variety of applications, this is "good enough" and the trade-offs of feature completeness vs performance are worth it. Disclaimer: I have used both extensively in the past. I prefer PostgreSQL, but now use neither. Now I only do SQLite (embedded tables) or Oracle (for hot replication).
  
  --
  Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
4. Re:MySQL? by DrEldarion · 2009-07-21 09:19 · Score: 1
  
  Well, the name is certainly a lot catchier...
5. Re:MySQL? by Anonymous Coward · 2009-07-21 13:12 · Score: 0
  
  MySQL has its fan boys from circa 1994-2001. During this period, the MySQL license was much more permissive, and gained a certain momentum from PHP that carries it through to this day. At the same time, PostgreSQL was still using Cygwin on Windows, the INSTALL had a table of contents, and was lacking performance enhancements (particularly on Windows). Eventually Cygwin was dropped and the threading was happy on windows, and the performance enhancements were good. Along with this came a much shorter INSTALL file and all reason to use MySQL had disappeared. But once you know something, people like to keep on using it. Then MySQL got things like triggers, foreign key constraints and full ACID compliance. So in the end it ended being a wash.
  However, and not to start a flame war, it seems that PostgreSQL, having been feature-complete (ACID, foreign keys, etc) maintained a performance edge. But also to this day MySQL has a very fast table implementation, provided you don't need things like ACID compliance. For a variety of applications, this is "good enough" and the trade-offs of feature completeness vs performance are worth it.
  Disclaimer: I have used both extensively in the past. I prefer PostgreSQL, but now use neither. Now I only do SQLite (embedded tables) or Oracle (for hot replication).
  Oh shit... I will never use a DB that makes me stop everything to do vacuum cleaning. NEVER!
  When you do millions of writes everyday it's simply unuseable.
  It's everyone blind? Or for some strange reason it's only me that the vacuum cleaning lasted like 2 hours everyday? My Hardisk have too many corners?
  I tend to use SQL only when it's super necessary, ACID is an scholastic myth and foreign keys have only sense inside SQL (Or as a notation).
  SO
  If for having that I have to stand a DB is blocking my database 2hours each day, and going slower for anything else that uber coplex join select querys that only the people eligible to use haddoop will use, then sorry.
  I choose MySQL
6. Re:MySQL? by Anonymous Coward · 2009-07-22 01:50 · Score: 0
  
  Oh shit... I will never use a DB that makes me stop everything to do vacuum cleaning. NEVER!
  When you do millions of writes everyday it's simply unuseable.
  It's everyone blind? Or for some strange reason it's only me that the vacuum cleaning lasted like 2 hours everyday? My Hardisk have too many corners?
  I tend to use SQL only when it's super necessary, ACID is an scholastic myth and foreign keys have only sense inside SQL (Or as a notation).
  SO
  If for having that I have to stand a DB is blocking my database 2hours each day, and going slower for anything else that uber coplex join select querys that only the people eligible to use haddoop will use, then sorry.
  I choose MySQL
  You are several years outdated in your information about PostgreSQL and vacuum.
  You are not only outdated, but also completely wrong.
  You really should take a new look at PostgreSQL.
  You ought to find it enlightening.
  Or maybe you are just a troll?
Re:If it works as described it will be VERY import by gandhi_2 · 2009-07-21 07:47 · Score: 3, Funny

I can't say I'm looking forward to bigger, faster, shit-covered platters...but hey. Who am I to stand in the way of progress?

--
THL phish sticks
Comic books won't substitute for reality by Anonymous Coward · 2009-07-21 08:17 · Score: 0

Considering that "Ubermensch" was translatable to "Superman" then "Ubercheap" would be "Supercheap"
Considering that you like to talk out your ass, your name would be "shit for brains".
Uber and Super both mean "above", knucklehead. Same proto-indo-european root, in fact.
1. Re:Comic books won't substitute for reality by CorporateSuit · 2009-07-21 09:35 · Score: 1
  
  Uber and Super both mean "above", knucklehead. Same proto-indo-european root, in fact.
  Today may just be the day that you learn that a word may have more than one definition. In fact, the word you use "root" refers not just to a word's origin, but it can also refer to a very important part of plants. Do not squander this opportunity. It will open an entire new world of linguistics. I have nothing but hope for the grand future that awaits you and your once-tunneled view of the English language.
  
  --
  I am the richest astronaut ever to win the superbowl.
2. Re:Comic books won't substitute for reality by Anonymous Coward · 2009-07-21 20:46 · Score: 0
  
  hang on - how can you use the "words can have more than one meaning" argument to refute his argument without destroying your own in the process?
This technology already exists by Anonymous Coward · 2009-07-21 08:48 · Score: 0

Intersystems Cache
"World's fastest Object Database"
1. Re:This technology already exists by Bill,+Shooter+of+Bul · 2009-07-21 10:27 · Score: 1
  
  And how much more than "Free" does it cost?
  
  --
  Well.. maybe. Or Maybe not. But Definitely not sort of.
2. Re:This technology already exists by Anonymous Coward · 2009-07-21 13:49 · Score: 0
  
  Where does the word "free" appear in the above article?
  Consider the value of the data and the value of software development effort when you throw around the word "free"
3. Re:This technology already exists by Bill,+Shooter+of+Bul · 2009-07-21 14:40 · Score: 1
  
  Not sure what you were talking about, but hadoop and postgres are open source. Unless they're stupid, they wouldn't make the resulting product closed source.
  I'm not going to make the whole free software pitch here, but lets just say I believe in the superiority of the development process and the end product through my experiences developing and using software.
  I have no confidence in Intersystems Cache's long term survival.
  
  --
  Well.. maybe. Or Maybe not. But Definitely not sort of.
Yaaaaay by caywen · 2009-07-21 08:54 · Score: 1

(In my best Special Ed impersonation)
Yaaaaaay, now we can scale out Hadoop! Yaaaaay! Yaaaay Hadoop! Yaaaaay!
SQL is legacy by Anonymous Coward · 2009-07-21 08:55 · Score: 0

Take a look at Intersystems Cache
Fastest object database in existence
Fastest SQL database in existence
Learn how it works and you will see how SQL is nothing more than a mighty kludge
Bits of software are tools.. by msimm · 2009-07-21 09:07 · Score: 1

We might create the software intending it to do and be used in one way, but how it will actually be used is determined by the users. Postgre and MySQL don't carry any intrinsic values, only the values which their users discover and, well, use. Without users they have no good or bad features.

So why is it that people feel the need to rally around or defend them? After all, only the developers who have done the work are capable of understanding the snips and criticism leveled against them, and these are the people who have given their work away, to you and me.

MySQL excels at some things. Postgre also excels at some things. If users feel there is too much overlap then they can work to reproduce these features in a single tool, such as Postgre should they feel it has more utility. But to discount a tool many people find useful shows a core misunderstanding of what it is that determines the software's value.

Postgre can not be better then MySQL, it can only provide varying degrees of value. And that value is determined by the user.

--
Quack, quack.
1. Re:Bits of software are tools.. by Anonymous Coward · 2009-07-21 10:17 · Score: 0
  
  fuck you and your balanced argumentation
2. Re:Bits of software are tools.. by photon317 · 2009-07-21 10:41 · Score: 1
  
  I've used both pretty extensively in a wide variety of environments, and I don't take such a balanced view at all. IMHO, the best answer to most database-related problems is to use PostgreSQL or SQLite. MySQL sits somewhere between them in terms of reliability, scalability, ACIDity, etc, and kinda fails at being good at anything in particular. For that matter, even if you *like* where MySQL lies on those tradeoffs, compared to either of the other two mentioned products (especially Pg), the quality of the code and the development process both flat-out suck, and are very amateurish in places by comparison. The only reason I use it anymore is for applications that require it and haven't been ported to PostgreSQL or SQLite (as appropriate) yet.
  And that was before all the fiasco in recent years with Oracle buying Innobase, then Oracle buying Sun which had bought MySQL, and the several emerging MySQL forks, etc... which doesn't leave me feeling great about the future of MySQL if I were starting a new project today.
  
  --
  11*43+456^2
Give me their hands!!! by eronysis · 2009-07-21 09:18 · Score: 0, Offtopic

I firmly believe that any and all patents involving any naturally occurring human process or material should be abolished. Furthermore, some of those attorneys should be seeding our first body bank (nod to Niven and Pohl).
Other approaches to scalable SQL by owenomalley · 2009-07-21 09:32 · Score: 1

There are also two Hadoop subprojects that either support SQL or will shortly. They both translate SQL queries into map/reduce programs. They are:
http://hadoop.apache.org/pig/
http://hadoop.apache.org/hive/
try cloudbase by Anonymous Coward · 2009-07-21 16:16 · Score: 0

Taken from cloudbase's user doc (http://cloudbase.sourceforge.net/index.html#userDoc): CloudBase is a high-performance data warehouse system that scales horizontally on commodity hardware or a cloud computing network. It is developed by Business.com and released to open source community under GNU GPL licence 2.0 Built on top of a Hadoop's map-reduce architecture, CloudBase enables business analysts using ANSI SQL to directly query large-scale log files arising in web site, telecommunications or IT operations. But, unlike other map-reduce approaches, it does not create or require the use of a programming language on top of map-reduce.
Don't you read Information Week? by daschlag · 2009-07-21 16:44 · Score: 1

"Cheap 2.0".
ummm no by Anonymous Coward · 2009-07-21 18:41 · Score: 0

for many reasons in many hadoop use cases schemas are not usable. saying but we need schemas for performance is fail. also this is not a fair comparison. vertica uses compression. others do not. this is biased beyond imagination.
Unobligatory Monty Python Reference by cowboy76Spain · 2009-07-21 20:09 · Score: 1

And yes, for complex requests as in data mining, SQL and XPath make sense. For people who aren't developpers, SQL makes sense as well. For interoperability with 3rd-party apps, SQL is also useful, just as FAT is still useful today in order to share filesystems between operating systems.
But for the rest of us....
Sorry, but could not help thinking but to this line from "Life of Brian":
But apart from the sanitation, the medicine, education, wine, public order, irrigation, roads, the fresh-water system, and public health, what have the Romans ever done for us?
More seriously, if the main example of the trouble with SQL is that you want to be able to find a record by id with less keystrokes, I do not see how this can be so much of a problem.

--
Why can't /. have a rich-text editor? Editing your own HTML is so XXth century.
The wrong computerWorld article is linked by Anonymous Coward · 2009-07-22 03:53 · Score: 0

I think the summary was supposed to link to this: http://www.computerworld.com/s/article/9135726/Yale_researchers_create_database_Hadoop_hybrid?source=CTWNLE_nlt_dailyam_2009-07-21