Web Analytics Databases Get Even Larger
CurtMonash writes "Web analytics databases are getting even larger. eBay now has a 6 1/2 petabyte warehouse running on Greenplum — user data — to go with its more established 2 1/2 petabyte Teradata system. Between the two databases, the metrics are enormous — 17 trillion rows, 150 billion new rows per day, millions of queries per day, and so on. Meanwhile, Facebook has 2 1/2 petabytes managed by Hadoop, not running on a conventional DBMS at all, Yahoo has over a petabyte (on a homegrown system), and Fox/MySpace has two different multi-hundred terabyte systems (Greenplum and Aster Data nCluster). eBay and Fox are the two Greenplum customers I wrote in about last August, when they both seemed to be headed to the petabyte range in a hurry. These are basically all web log/clickstream databases, except that network event data is even more voluminous than the pure clickstream stuff."
Actual, yes there was. It's a very subtly new rule on the properly use of adverbs and adjectives.
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
CmdrTaco is a plain white-bread murriken
It's a little known fact that he's actually multi-grain.
True confidence comes not from realising you are as good as your peers, but that your peers are as bad as you are.
Slow news day alert!
The topic in question is blindingly obvious to anyone who has heard of this newfangled "Internet" thing, and frankly is not worth an article in the first place. /. reporting at it's finest... For shame.
Furthermore, such a blatant error in the headline and summary is simply ridiculous. Do the submitters or editors not reread text prior to submission? This is sloppy
There is no psychiatrist in the world like a puppy licking your face - Ben Williams
is this ok?
2/12 can be expressed more simply as 1/6.
If you have ever touched one of their Web sites and caught their cookie, your tracks can be followed into unexpected places. This data is a gold mine for them, if they can figure out how to sell it without pissing off users with how much they know.