Facebook Details the Software Engineering Behind Graph Search
Nerval's Lobster writes "Facebook's Graph Search, its new and powerful way of searching the social network for all manner of information, has drawn a lot of attention since its January unveiling. Some have praised its innovation; others have wondered openly whether its search abilities will end up threatening Google and LinkedIn. Still more have questioned what it all means for users' privacy—always a touchy subject in conjunction with Facebook. The social network previously revealed how it's adjusting its hardware infrastructure to deal with the spike in traffic that will come from interactions with Graph Search (short answer: the Disaggregated Rack, which will break up hardware resources and scale them independently of one another). Now, in a new blog posting, it's offering a bit more with regard to the software side of things, and how the company repurposed an existing system to solve Graph Search's enormous engineering challenge. Bottom line: Facebook's engineers and executives finally decided on Unicorn, an inverted-index system they'd had in development for quite some time."
"Facebook's Stalker Search, its new and powerful way of searching the social network for all manner of information about you, has drawn a lot of negative attention since its January unveiling. Few have praised its innovation; fewer have wondered openly whether its search abilities will end up threatening Google and LinkedIn. Most have questioned what it all means for users' privacyâ"always a touchy subject in conjunction with Facebook. The social network previously revealed how it's adjusting its hardware infrastructure to deal with the spike in traffic that will come from interactions with Stalker Search (short answer: the Disorganized Rack, which will break up hardware resources and scale them independently of one another). Now, in a new blog posting, it's offering a bit more with regard to the software side of things, and how the company repurposed an existing system to solve Stalker Search's enormous engineering challenge. Bottom line: Facebook's engineers and executives finally decided on Unicorn, a mythical flying horned horse they'd had in the basement for quite some time."
I love knowing how exactly Zuckerberg will strip-mine our connections for $profit.
Mermaids wouldn't have worked.
Solving Unix problems since 1989...
I think it's really funny how they call a drop in traffic a "spike in traffic". Facebook was a fad and people have moved on.
Seven puppies were harmed during the making of this post.
You get as much privacy from Facebook/Gmail/Hotmail/etc as you pay for. Sometimes, you get less.
If you're unhappy with those terms, you probably shouldn't use the service.
Koans and fables for the software engineer
Tech sites love to post over and over again about these new Facebook features. But by the time they actually roll out to users everyone's forgotten about them.
Would really like to see Unicorn become open source.
Where I work we use datamarts spread across several data warehouses, which is quite similar to the FB way.
Since we use a bottom-up design model, creating so called solutions using this indexer would be very straightforward.
KERNEL PANIC -SIGFAULT AT ADDRESS #51A54D07
It's differentiation based on values, interuser traffic going down is less consequential than metauser traffic going up. The value system in place is obviously corporate. Has there ever been any doubt in your mind? If so, then consider yourself slow or idealistic. At least you can decide that for yourself.
...and still others replied to those comments.
Sorry about the mess.
"Hard part of startup is make money from wheel after reinvent it." -Devops_Borat
And does anyone on staff at Slashdot know the difference?
XML is a known as a key material required to create SMD: Software of Mass Destruction
“What are the differences between Mark Zuckerberg and me? I give private information on corporations to you for free, and I’m a villain. Zuckerberg gives your private information to corporations for money and he’s Man of the Year.” – Julian Assange I can't confirm if, where, and when he said this, but regardless the idea rings true for me.
Nobody actually cares about graph search, it's a total non-event. Unless you work at facebook or read /. you probably didn't particularly notice. Google are not sweating. See also Bing.
SELECT 1 AS is_deleted, `id`, `from`, `time`, `text` FROM `wall_deleted` WHERE `to` = %d AND `media_type` = %d AND `media_id` = %d UNION SELECT 0 AS is_deleted, `id`, `from`, `time`, `text` FROM `wall` WHERE `to` = %d AND `media_type` = %d AND `media_id` = %d ORDER BY `time`'
Right now it's just a toy. They need to expand on it greatly so that we can do legitimate social network analysis research with it and even use it to make decisions.
We need the ability to run any algorithm we want, we need an API to build apps to take advantage of it, we need the full range and capabilities of social network analysis, we need to be able to use intelligent agents to regularly collect opinion and other analytic information about our friends to help us make better decisions.
How many of my friends like X is important, but that's not going deep enough in my opinion. How many of my friends use certain phases, now we are getting somewhere useful. How popular are certain phrases in my social network? Useful. What are the common attitudes and trends of my social network? Useful. What about location data? How many hours do my friends spend on the road or are they mostly at home? What about even deeper? Specific topics like say how many of my friends like violence, and then it can show that certain friends like violent movies, video games, books, are members of the NRA, etc.
There is more, such as associations the ability to find correlations, do regression analysis, and even make predictions on what my friends might like or what topics they might dislike.
Such as the political positions of our friends, how useful is that?
What products our friends might want in the future, how useful is that?
How our friends feel about certain things and who in the social network have feelings in common, how useful is that?
The job prospects and career prospects of our friends, how useful is that?
When all the data points are connected and social network analysis properly conducted you can learn a lot of the mysteries about people that wouldn't ordinarily be easily known. This is incredibly important and can make Facebook actually useful to me.
We need to allow app makers to do the things and offer the services we can't, the really intrusive stuff that we need plausible deniability over, and by monetizing our data via licensed app services which perform tasks which we find morally ambiguous we can keep our new and desperate shareholders happy in both ways.
Please consider this account deleted, I just can't be bothered with the spam anymore.
We need to allow app makers to do the things and offer the services we can't, the really intrusive stuff that we need plausible deniability over, and by monetizing our data via licensed app services which perform tasks which we find morally ambiguous we can keep our new and desperate shareholders happy in both ways.
But it's not really intrusive. People can change their privacy settings. Also big corporations are allowed to do it, so why not let everyone else in on it?
>> Some have praised its innovation
Er...what? 28 comments in 8 hours tells me no one cares about Graph Search - not even on SlashDot.
Even Zuckerberg's sister didn't know how to properly use the privacy settings, and she used to work there.
I would like to see their platform work on the combined datastores of linked open data (http://linkeddata.org/). Get fast answers because much of the processing is pre-indexed.
Surely this isn't a leak?
wow, that's a toughie.
Next week they'll knock our socks off by introducing "GraphSearch 2.0- Now with ORDERY BY!"
Re: "run it on a RAM sled with between 128 GB and 512 GB of memory" Google gave me absolutely nothing on RAM sleds. I've used RAM disks for years and even know of hard disk's that are flash-backed RAM for performance. 128GB-512GB of RAM? If I needed that in a server, SGI (rip) and others have it. I doubt that's what they mean, though, as it's expensive custom stuff. So, what is a RAM sled? And where are they bought or how are they set up? Thanks ahead of time for any answers.