Slashdot Mirror


Inside Facebook Data Mining Research Group

holy_calamity writes "Technology Review has an in depth profile of the team at Facebook tasked with figuring out what can be learned from all our data. The Data Science Team mine that information trove both in the name of scientific research into the patterns of human behavior and to advance Facebook's understanding of its users. Facebook's ad business gets the most public attention, but the company's data mining technology may have a greater effect on its destiny — and users lives."

30 comments

  1. This can't be right... by Anonymous Coward · · Score: 1

    According to common knowledge they just burn it to DVDs and sell it to unnamed businesses nobody knows.

    I'm going with uninformed nerd rage on this.

  2. Re:first ! by enickel · · Score: 1

    Glad to see that the quality of our first posts is still worldclass.

  3. A worst case scenario by Anonymous Coward · · Score: 0

    As a result of an avalanche of bits, being trapped underground in a Facebook data mine.

    1. Re:A worst case scenario by linatux · · Score: 1

      I'd be more worried about a methane explosion!

  4. TL;DR by slasho81 · · Score: 5, Informative

    I went against my intuition and read TFA. The whole 4,200 words of it.

    It's a complete fluff piece and doesn't contain any interesting new knowledge regarding human behavior or social networks, which you would expect from an "in depth" article about Facebook's data mining.

    There are some tidbits regarding old stuff (4 degrees of freedoms between "friends"), obvious stuff (93% of friends met in real life), and a bunch of other vaguely presented stuff with questionable validity.

    1. Re:TL;DR by linatux · · Score: 5, Funny

      I followed my intuition & barely skimmed the summary. Mining FB would be like making a BBC documentary about reality TV.

    2. Re:TL;DR by Anonymous Coward · · Score: 1

      [i]At Facebook, our engineers collaborate to create an open environment where ideas win and are executed quickly.[/i]

      At Facebook, we don't understand grammar, and we use images of birds holding pens that looks like bird with strange penises.

    3. Re:TL;DR by Raenex · · Score: 1

      It's a complete fluff piece and doesn't contain any interesting new knowledge regarding human behavior or social networks, which you would expect from an "in depth" article about Facebook's data mining.

      Really? I found a lot in the article interesting:

      • "Since last fall, Facebook has also been able to collect data on users' online lives beyond its borders automatically: in certain apps or websites, when users listen to a song or read a news article, the information is passed along to Facebook, even if no one clicks "Like." Within the feature's first five months, Facebook catalogued more than five billion instances of people listening to songs online."
      • "For the first time," Marlow says, "we have a microscope that not only lets us examine social behavior at a very fine level that we've never been able to see before but allows us to run experiments that millions of users are exposed to." [emphasis mine]
      • "So he messed with how Facebook operated for a quarter of a billion users. Over a seven-week period, the 76 million links that those users shared with each other were logged. Then, on 219 million randomly chosen occasions, Facebook prevented someone from seeing a link shared by a friend. Hiding links this way created a control group so that Bakshy could assess how often people end up promoting the same links because they have similar information sources and interests. " [emphasis mine]
      • "He found that our close friends strongly sway which information we share, but overall their impact is dwarfed by the collective influence of numerous more distant contacts--what sociologists call "weak ties." It is our diverse collection of weak ties that most powerfully determines what information we're exposed to."
      • "One of Marlow's researchers has developed a way to calculate a country's "gross national happiness" from its Facebook activity by logging the occurrence of words and phrases that signal positive or negative emotion. Gross national happiness fluctuates in a way that suggests the measure is accurate: it jumps during holidays and dips when popular public figures die. After a major earthquake in Chile in February 2010, the country's score plummeted and took many months to return to normal. That event seemed to make the country as a whole more sympathetic when Japan suffered its own big earthquake and subsequent tsunami in March 2011; while Chile's gross national happiness dipped, the figure didn't waver in any other countries tracked (Japan wasn't among them). "
      • "But some of his team's work and the attitudes of Facebook's leaders show that the company is not above using its platform to tweak users' behavior. [..] In April, influenced in part by conversations over dinner with his med-student girlfriend (now his wife), Zuckerberg decided that he should use social influence within Facebook to increase organ donor registrations. Users were given an opportunity to click a box on their Timeline pages to signal that they were registered donors, which triggered a notification to their friends. The new feature started a cascade of social pressure, and organ donor enrollment increased by a factor of 23 across 44 states. "
      • "Marlow says that in the 2010 election his group matched voter registration logs with the data to see which of the Facebook users who got nudges actually went to the polls. (He stresses that the researchers worked with cryptographically "anonymized" data and could not match specific users with their voting records.) "
      • "In a kind of passing of the technological baton, Facebook built its data storage system by expanding the power of open-source software called Hadoop, which was inspired by work at Google and built at Yahoo. Hadoop can tame seemingly impossible computational tasks--like working on all the data Facebook's users have entrusted to it--by spreading them across many machines inside a data center. But Hadoop wasn't built with data science in mind, and using it for that purpose requires speci
  5. And if you want to join their data science team... by adamkennedy · · Score: 1

    ... Facebook is running an open call data science competition to win an interview/job on their data science team.

    (Disclosure: My work is running the competition for them)

  6. Re:FB, stop telling everybody what I comment or li by Anonymous Coward · · Score: 2, Informative

    Go whine on one of the hundred Facebook groups for this you nitwit, nobody here cares, Facebook doesn't care. If you don't like it, stop using it. End of story.

  7. Threat to mankind by Anonymous Coward · · Score: 0

    http://archive.org/details/EbenMoglen-WhyFreedomOfThoughtRequiresFreeMediaAndWhyFreeMedia

    Facebook cannot be allowed to datamine the human condition or we're all damaged.

  8. In the interest of transparency by Anonymous Coward · · Score: 1

    Somebody post the identities and interests of the team to the open Internet, for the whole world to see.

    Let us track and analyze them like the livestock they believe us to be.

  9. Try google+ by Anonymous Coward · · Score: 1

    Yes, I left facebook several years ago and recently joined google+. What is on here?
    Well...... an endless stream of crap. If you find friends on it, you can put them in your "circles". FANTASTIC!!!
    I guess I'm just too old for this stuff, going back to share my warez on the bbs.

  10. Re:And if you want to join their data science team by girlintraining · · Score: 4, Interesting

    ... Facebook is running an open call data science competition to win an interview/job on their data science team.

    Anyone with half a brain will run away screaming from that offer, but not for the obvious reasons. A company that's recently post-IPO has mostly multimillionaires for employees -- and they can and will treat anyone who isn't like dirt. In a few years, if Facebook manages to turn around it's epic failure of an IPO (Well, from a business standpoint... Zuckerberg and his crew are still flush with cash) and grows their employee base by a significant amount, it may be worth considering.

    But right now, it's a job for the kids fresh out of college; they won't know that the mistreatment isn't normal and might actually stick around for a couple of years before burning out.

    --
    #fuckbeta #iamslashdot #dicemustdie
  11. Re:first ! by thereitis · · Score: 0

    Sure, but can it _clean_ my PC?

  12. Re:And if you want to join their data science team by Anonymous Coward · · Score: 0

    A company that's recently post-IPO has mostly multimillionaires for employees?

    Like, >50%? Guess again! The money doesn't distribute so evenly!

    epic failure of an IPO?

    The company's goal is to sell a share of itself for the highest price it can. How did Facebook fail?

  13. Re:And if you want to join their data science team by girlintraining · · Score: 2
    --
    #fuckbeta #iamslashdot #dicemustdie
  14. Re:And if you want to join their data science team by Anonymous Coward · · Score: 2, Informative

    ... Facebook is running an open call data science competition to win an interview/job on their data science team.

    Anyone with half a brain will run away screaming from that offer, but not for the obvious reasons. A company that's recently post-IPO has mostly multimillionaires for employees -- and they can and will treat anyone who isn't like dirt. In a few years, if Facebook manages to turn around it's epic failure of an IPO (Well, from a business standpoint... Zuckerberg and his crew are still flush with cash) and grows their employee base by a significant amount, it may be worth considering.

    But right now, it's a job for the kids fresh out of college; they won't know that the mistreatment isn't normal and might actually stick around for a couple of years before burning out.

    I work at Facebook, and I can tell you that:
    1) Most employees are not multimillionaires
    2) I have been treated respectfully by everyone, from Zuck down, regardless of whether they are multi-millionaires, or hired last week
    3) I work with data at Facebook. It's one of a handful of places on the planet with this rich of a data set
    4) I'm not a kid fresh out of college and I've worked for a lot of companies in my career. The people I work with are the most talented I have met. I consider it a privilege to work with them.

  15. I just looked at the faces of them, smirks by Anonymous Coward · · Score: 0

    You read the article, I just looked at the faces, creeps, everyone of them.

    You can see the smirks on their faces, as if they know they're doing wrong and do it anyway.

  16. Re:And if you want to join their data science team by lightknight · · Score: 1

    Ah, Kaggle. That website / company is of personal interest to myself these days. If only their rewards were higher (with the exception of the Heritage Health Prize, which seems to be setting an example to the others; it could be higher, of course, but the fact that it's over a million has the obvious result of drawing in hideous numbers of teams; as such, they will probably get what they're actually looking for, which will save their company billions).

    It's the other competitions that are...kind of weak with the prize money. And what they want is arguably more difficult to create than the Heritage Health Prize.

    Now, if we could only get more companies to participate, and find a way to mitigate what I think might be slightly excessive taxation on those prizes...which is one of my darker fears: finding a solution, and having 40% of the money walk away. A million is a lot of money, true, but accounting for inflation and purchasing power, after taxes...it works out to two year's worth of salary for some of the better paid programmers out there. As I said, it's interesting, and tempting, but I'd look to sweeten the deal even further.

    What would be really funny is if the NSA started posting challenges, with large rewards, for various algorithms.

    --
    I am John Hurt.
  17. They do data mining? by Anonymous Coward · · Score: 0

    As far as I can tell, their data mining is pretty simple:

    if($user.gender == 'male' && $user.relationship_status == 'single') {
    &display_ad(type => 'dating');
    } else {
    &display_ad(type => 'housewares');
    }

  18. Re:And if you want to join their data science team by Johann+Lau · · Score: 0

    Yet you post anonymously.

    Also, talented at what? At having a spine? At paying attention? What kind of monkey talent, exactly, are we talking about?

  19. Re:And if you want to join their data science team by Raenex · · Score: 1

    which is one of my darker fears: finding a solution, and having 40% of the money walk away. A million is a lot of money, true, but accounting for inflation and purchasing power, after taxes...it works out to two year's worth of salary for some of the better paid programmers out there.

    I'd like to know just what percentage of programmers are getting paid 300k per year after taxes in salary. I'm guessing not a whole heck of a lot.

  20. Re:And if you want to join their data science team by lightknight · · Score: 1

    Hmm. I believe the median salary for programmers in San Francisco was $150k last I checked...I will have to look into it.

    You have to keep in mind, however, that I did specify 'the better paid' programmers, and that the sheer number of millionaire / billionaire programmers out there will skew the average. It might actually be easier to find someone being paid a few million than $300K...

    --
    I am John Hurt.