Facebook's Project Prism, Corona Could Ease Data Crunch
Nerval's Lobster writes "Facebook recently invited a handful of employers into its headquarters for a more in-depth look at how it handles its flood of data. Part of that involves the social network's upcoming 'Project Prism,' which will allow Facebook to maintain data in multiple data centers around the globe while allowing company engineers to maintain a holistic view of it, thanks to tools such as automatic replication. That added flexibility could help Facebook as it attempts to wrangle an ever-increasing amount of data. 'It allows us to physically separate this massive warehouse of data but still maintain a single logical view of all of it,' is how Wired quotes Jay Parikh, Facebook's vice president of engineering, as explaining the system to reports. 'We can move the warehouses around, depending on cost or performance or technology.' Facebook has another project, known as Corona, which makes its Apache Hadoop clusters less crash-prone while increasing the number of tasks that can be run on the infrastructure."
So glad to see FB coming up with an original idea like decentralized server infrastructure. Oh wait....
sudo make me a sandwich
So Facebook is finding ways to further mine user data? Given their poor stock performance, a shrinking legitimate user base, and a more savvy audience, I'm sure this is just a veiled attempt to sell your data to more people through "innovation".
I'm not sure I would want to trust my data with a system named for something prone to holes and large mass ejections.
Everything is better with chainsaws.
Doesn't the Facebook dept. of the NSA know that they already used the "operation" codeword Corona for the first spy satellites?
-- You are in a maze of little, twisty passages, all different... --
Fantastic, that almost certainly means another change to the API/SDK processes. I can't wait for the number of grandparents joining to hit the critical mass where it all falls over and people leave for another network, ideally one with decent privacy and a decent API for public data.
Please consider this account deleted, I just can't be bothered with the spam anymore.
I have to give Facebook this: They're better at this sort of thing than anyone.
Just compare timeline to Slashdot comment history. Beyond the most recent comments, slashdot only allows you to search your history if you subscribe. Even then it's with a plea to please not do it too much...
Reddit is a more modern site I dare say, but they too don't let you go through your full comment history. After the first 1000, it gets really hard to trace them down.
When I drink corona, I do not crunch data too well. Unless it involves a deck of cards!
I don't know why we don't abstract all this cloud data storage to a truly vendor neutral, competitive scheme.
Imagine if we as users could provide our own server (at home, or designate Amazon, Google, etc) for our cloud storage. Facebook becomes just one application and anything we post is encoded in JSON formatted files. Then we actually own what we post. Of course, Facebook can cache the data (since first post generally is the most needed) and only go back to your cloud for older data. But you'd have every wall post, every photo, all of your data shared. Then Facebook can give you 1gig free, but you can also buy 5gig from google and have them use that instead. It gives them infinite storage, they only need to manage their cache.
Slashdot's rate-of-post filter: Preventing you from posting too many great ideas at once.
""Facebook recently invited a handful of employers into its headquarters "
Which employers? Why? Is facebook going to start providing companies with the data of job applicants?
"If any question why we died, Tell them because our fathers lied."