Slashdot Mirror


Concerns Over Microsoft's Internet User Profiling

jcatcw writes "Microsoft research on Internet user profiling could lead to tools that help repressive regimes identify anonymous dissidents, the Reporters Without Borders advocacy group warned last Friday. Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time. "In China, it is conceivable that this type of technology would be used to spot Internet users who regularly access such 'subversive' content as news and information websites critical of the regime," the group said."

144 comments

  1. If I were head of an oppresive regime... by Anonymous Coward · · Score: 4, Funny

    I'd be uneasy about partnering with a bunch of totalitarian control freaks like Microsoft.

    1. Re:If I were head of an oppresive regime... by Anonymous Coward · · Score: 5, Funny

      That said, I reckon I could guess the gender of a web surfer 80% of the time.

      if (site.equals("slashdot.org")) male = true;
      else if (site.equals("cutepuppydogs.com")) male = false;

    2. Re:If I were head of an oppresive regime... by nschubach · · Score: 1, Funny

      Great.. I can see it now:

      Due to a recent trend in programming. Feminist groups have decided they don't like being referred to as !male. They demand rule defined paring of boolean values "male" and "female" where setting male to false would trigger female equality to true. More on this at 11.

      --
      Every time I start to have faith in humanity, I ruin it by driving to work between 7 and 8 am.
    3. Re:If I were head of an oppresive regime... by Old+Benjamin · · Score: 0

      Especially since the software I already have does the job with 90% accuracy.

      --
      "The quickest way to end a war is to lose it" -Orwell
    4. Re:If I were head of an oppresive regime... by mrmeval · · Score: 1

      z0MG PONIES

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    5. Re:If I were head of an oppresive regime... by willabr · · Score: 1

      Jezz... what next a Microsoft Van taking pictures of me?

    6. Re:If I were head of an oppresive regime... by Patrik_AKA_RedX · · Score: 1

      Unnecessary. You know those little lights on your electronic equipment? They told you those are just light emitting diodes. What they didn't told you is that those are actually little cameras. And Big Bill is watching you... And I really mean you Willabr. So put on you pants and comb your hair. We, members of the oppresive regime secretly installed in your country, have feelings too, you know.

  2. oh noes! by Anonymous Coward · · Score: 4, Funny

    Unless you happen to be the only 20-25 year old male in China I think you're safe.

    1. Re:oh noes! by Short+Circuit · · Score: 4, Insightful

      Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time. "In China, it is conceivable that this type of technology would be used to spot Internet users who regularly access such 'subversive' content as news and information websites critical of the regime," With that kind if inaccuracy, it could hardly be considered a reliable indicator of identity. It seems much more tuned to generating demographic data.

      Not that China couldn't use that in targeted propaganda...
    2. Re:oh noes! by rapidweather · · Score: 1
      This entire post has just gotten censored (self censorship), I am still hoping for a free copy of Vista from Microsoft for posting "favorable" comments on Slashdot.
      Not their idea, mine. Leaving no stone unturned, I say.
      If I did actually get a free copy of Vista, I'd put it on the living room table right next to the picture of Thomas Jefferson (1743-1826). His picture is on the Two Dollar Bill laying there.

      As you can see, I am horribly bored this evening, but I am doing a test:

      I'm running the linux OS you see in the Screenshots link below on a SanDisk 2 GB Cruzer Micro, partitioned like this:
      SDA1 700 MB for Knoppix (the CD itself)
      SDA5 700 MB for Persistent Home Directory
      SDA6 400 MB for GIMP and K3B swap, additional storage also.
      SDA7 150 MB linux swap.

      Here is the output from TOP:
      Tasks: 42 total, 1 running, 41 sleeping, 0 stopped, 0 zombie
      Cpu(s): 5.5% user, 3.9% system, 0.0% nice, 90.6% idle
      Mem: 385800k total, 218864k used, 166936k free, 6456k buffers
      Swap: 155192k total, 0k used, 155192k free, 130392k cached
      (Uptime not shown, is 2:05)

      This PC is a HP Pavilion 8250, Celeron 267 MHZ.
      To get the USB drive booted up, I use a small HDD with MSDOS 6.21, and these files.
      The small "boot" drive only runs for about 20 seconds, the Menu comes up, you make a choice, then the Linux OS runs entirely from the USB drive. The necessary "loadlin" command line is:
      loadlin vmlinuz initrd=miniroot.gz BOOT_IMAGE=knoppix xmodule=s3virge fromhd=/dev/sda1 home=/dev/sda5

      Browser is Firefox 2.0.0.4

      Thanks for listening...

      Rapidweather

    3. Re:Oh noes! by Anonymous Coward · · Score: 0

      Not to start a holy war or anything, but guns are designed for the purpose of killing. Chainsaws are designed for the purpose of cutting wood. The user may modify either purpose, of course, but it's not the same. Maybe you could cut a tree with a gun if you had a truckload of ammo, or you can shoot at somebody in a non-vital part and not kill them (straight away, may still die as a result of the wound) or you can shoot close to somebody to convince them to act as they are told (but what convinces them is the threat of you using the gun for its intended purpose, which is to kill). You could kill somebody with a chainsaw, just like you could kill somebody with just about anything because we are fragile things and easily broken but comparing guns and anything-not-designed-for-the-purpose-of-killing is logically flawed. Yes, "guns don't kill people" they are the tools that people build and design for that purpose.

  3. Title should be: by Anonymous Coward · · Score: 0

    Group rips Microsoft over Internet user profiling research
    group said to be mostly male ranging in age from 25-40

  4. Poor accuracy by mws1066 · · Score: 4, Informative

    80% and 60% are both actually very poor accuracies. I wouldn't be worried; this won't be taken seriously as any type of reliable profiling.

    --
    Nothing is more dangerous than a programmer with a screwdriver.
    1. Re:Poor accuracy by Lockejaw · · Score: 4, Insightful

      I am worried about such low rates. People seem inclined to believe anything a sufficiently large computer says.

      --
      (IANAL)
    2. Re:Poor accuracy by ajanp · · Score: 5, Insightful

      That same day, Erik Bratt, a Microsoft marketing communications manager, fired a preemptive salvo about the company's age- and gender-guessing research. Bratt first downplayed the research results, saying: "[The researchers] actually found that they could not, with a high degree of accuracy, predict age from Web browsing activity." He also swore the Redmond, Wash. developer off using the resulting algorithms. "Microsoft currently has no plans to use the capabilities found through this research in our products and services," Bratt said. So... they tell people that they have the ability to kind of, maybe, sort of, predict age and gender, but that their current algorithms are basically just a bunch of BS. Then they decide to also mention that despite the fact that the foundation for this technology doesn't really work accurately and the have no plans to use it in any products, they're going to continue working on it anyways.

      Microsoft's researchers said they would expand their work to other demographic attributes, such as occupation, educational degree and geographic location I mean, ofcourse that's the logical thing to do so that they could hopefully get it to work more accurately, but why mention that you're even working on the technology if you're going to immediately dismiss it by saying that it doesn't really work.

      Then again, now I guess it solidifies the opinion about why Microsoft is really so sore about the Google-Doubleclick deal http://slashdot.org/article.pl?sid=07/04/16/021720 3. Google's got a leg up on what they have apparently already been working on and now they're at a disadvantage.

      --
      File Deletion is Murder.
    3. Re:Poor accuracy by Turn-X+Alphonse · · Score: 1

      Isn't that worse?

      "We know a guy of 30-35years was looking at anti-freedom/love/justice/whatever material in this net cafe on Thursday at 3pm, would that be you sir? No? Well they all say no at first..."

      While that 20 year old woman is giggling from her window as she beat "the system" yet again.

      At least if it's accurate you're okay if you behave (no I am not supporting it, just saying), where as if it's inaccurate.. well fuck, better odds on a coin flip.

      --
      I like muppets.
    4. Re:Poor accuracy by ShiNoKaze · · Score: 1

      How accurate it is doesn't matter. What matters is if the person who would make money deploying it can pitch it correctly to the people in charge. If so, they'll still try it. China would be the perfect place to start as they don't seem as though they'd be worried about false positives.

    5. Re:Poor accuracy by reddburn · · Score: 3, Interesting

      80% and 60% are both actually very poor accuracies. I wouldn't be worried; this won't be taken seriously as any type of reliable profiling. It depends on the context. If you were talking baseball, it'd be a pretty damn good batting average. It's also good enough to get a warrant, and good enough for a grand jury to indict, and those latter two are what should worry you.
      --
      "Those who believe in telekinetics, raise my hand" - Kurt Vonnegut, Jr.
    6. Re:Poor accuracy by moderatorrater · · Score: 2, Funny

      We need to start actively throwing their algorithm off. If every one of us guys were to start visiting hellokitty.com and mypinkfluffybarbieplayingwithponies.com and all the women were to start reading slashdot, they wouldn't be able to tell us apart!

    7. Re:Poor accuracy by Anonymous Coward · · Score: 0

      I wouldn't be worried

      Because you'll be in the 40% that the computer messes up and labels as a subversive, right?

      Countries like China don't care if they're right or wrong, as long as they have a few bodies to cow the masses into submission, at the end of the day they don't care how the bodies get there. Works almost as well as hooking a person up to a photocopier and telling them it's a lie detector.

    8. Re:Poor accuracy by catbutt · · Score: 1

      Regarding the age, how can they say "60%", and how can you say is is not accurate, when the tolerance isn't specified? Do they mean to the year? (I kinda doubt it)

    9. Re:Poor accuracy by acvh · · Score: 1

      this is their first stab. they'll get better at it.

    10. Re:Poor accuracy by corifornia · · Score: 0

      Hell I have better accuracy than that. . . Goatse (male), Facial Site (male), MSN(douche), Slashdot(nerd), LifeStyle(female),

      --
      crap.
    11. Re:Poor accuracy by MyOtherUIDis3digits · · Score: 3, Funny

      I'm already doing my part. While my surfing habits are definitely male-oriented, I'm sure they would determine me to be 14-16 tops.

      --
      Ignore anything I said above, I actually agree with everything you believe - mod accordingly.
    12. Re:Poor accuracy by denissmith · · Score: 1

      But it could correctly indicate you were 60% likely to be a dissident, which is good enough in most jurisdictions.

      --
      I have nothing to hide. So, why are you spying on me?
    13. Re:Poor accuracy by nschubach · · Score: 1

      ...or a sufficient number of computers called the Internet.

      --
      Every time I start to have faith in humanity, I ruin it by driving to work between 7 and 8 am.
    14. Re:Poor accuracy by Anonymous Coward · · Score: 0

      Or a computer that churns out random beeps and buzzes like from the eSurance commercials.

    15. Re:Poor accuracy by Anonymous Coward · · Score: 0

      80% on gender?

      How can you get less than 100% ?

      Gender = Male.

      There are no girls on the internets.

    16. Re:Poor accuracy by Anonymous Coward · · Score: 0
      No girls in the Internets? Not even in the lesbian chat rooms?!?!?!?!

      Oh no..........

    17. Re:Poor accuracy by Anonymous Coward · · Score: 0

      Obviously, you don't understand China (or M$). This will be used, regardless of its 'accuracy' to persecute someone who is not toeing the line. 'Oh look, the software says it's YOU!', so they can drag off anyone they like to the local dungeon. The Chinese authorities will use this as 'evidence' to extract confessions, etc. regardless of the accuracy, etc. Heck, they'll think it's great, whether it works or not!

      M$ will undoubtably makes millions in a deal with the Chinese government, corrupt officials, overzealous control freak/gestappo/authority/police types, etc. the perfect software for the perfect totalitarian regime from the perfect totalitarian monopoly...sigh..

    18. Re:Poor accuracy by DeadChobi · · Score: 1

      There are girls on the internet? Amazing!

      --
      SRSLY.
    19. Re:Poor accuracy by Anonymous Coward · · Score: 0

      We need to start actively throwing their algorithm off. If every one of us guys were to start visiting hellokitty.com and mypinkfluffybarbieplayingwithponies.com and all the women were to start reading slashdot, they wouldn't be able to tell us apart!

      But I already visit a Hello Kitty website
    20. Re:Poor accuracy by Anonymous Coward · · Score: 0

      There are girls on the internet? Amazing!

      Only a gay guy could have missed that...

    21. Re:Poor accuracy by Stefanwulf · · Score: 1
      Baseball's being a defensive sport aside, I'd argue that 80% accuracy should certainly not be good enough for a grand jury. Let's examine a hypothetical case where someone is brought before a grand jury for cime X. The situation:
      • A profiling procedure has been developed which can tell with 80% accuracy if this person is the type of person who would commit crime X.
      • Out of the population at large, let's say X is a very common crime, and 10% of the population has committed it. (In other words P(Crime) = 0.1, and P(!Crime) = 0.9)
      • If a person commits crime X, then the test returns true 80% of the time. (P(Positive|Crime) = 0.8)
      • If a person has not committed crime X, then the test returns true 20% of the time. (P(Positive|!Crime) = 0.2)
      • By bayes theorem, P(Crime|Positive) = (P(Positive|Crime) * P(Crime)) / (P(Positive|Crime) * P(Crime) + P(Positive|!Crime)*P(!Crime))
      • Unless I've made a mistake, this works out to (0.8 * 0.1) / ((0.8 * 0.1) + (0.2 * 0.9)), or roughly 0.308
      • That means that for any given person indicated by the profiling procedure as being a criminal, there is about a 70% chance that they did not do it, and only roughly a 30% chance that they did.
      I would certainly hope that presenting a grand jury with only a 30% chance of guilt would not be enough (on its own) to indict. And please keep in mind that we are assuming a very high proportion of the population is guilty here. In cases where less than 10% of the population has committed the crime, then the liklihood of a false positive for an individual case would be even higher.
    22. Re:Poor accuracy by armareum · · Score: 0

      You didn't make a mistake. You should be modded up as informative, although most people don't actually understand statistics, so I doubt you will be. :(

      And, if you'll allow me, now to summarise your calculation for the layman (the language won't be fully technically correct, but hopefully lay enough that it can be understood. Please post corrections (or questions):

      You took the chance that the person selected from the population was correctly identified as the criminal (0.8 * 0.1), or 80% of 10%, and divided it by the total chance of the a person being identified (Positive result when IS criminal + positive result when ISN'T criminal), or (80% * 20%) + (20% * 90%).

      Which give you your 0.308 chance that person indicated is the criminal.

      Thankfully they also apply the same statistical analysis to medical tests to see how effective they are.

      --
      Is this a rhetorical question?
    23. Re:Poor accuracy by rtb61 · · Score: 1

      When it comes to data poisoning the best way is an automated extension that just distorts your profile by mixing it with others, when done properly we all end up looking like one homogeneous mass. From junk data embedded in emails, to fake search routines and even random background browsing across a whole range of sites, no matter how much data their privacy invasive computers can store, hundreds of millions of users can create even more random worthless data and flood them under. What is cool is as their capability to handle more data increases so capability of generating more random 'suspect' crap also increases (when we are all guilty who will they suspect).

      --
      Chaos - everything, everywhere, everywhen
    24. Re:Poor accuracy by Stefanwulf · · Score: 1

      Thanks, that's a really good summary.

  5. If you were a totalitatian regime by supersnail · · Score: 4, Insightful

    ... wouldn't it be easier to look up the IP address and persuade the ISP to hand over the user details?

    The old ways are often the best.

    --
    Old COBOL programmers never die. They just code in C.
    1. Re:If you were a totalitatian regime by voislav98 · · Score: 1

      Or block access to the websites in question? It's going to be a cold day in Hell when a totalitarian regime starts relying on Microsoft to do the spying for them.

    2. Re:If you were a totalitatian regime by ozmanjusri · · Score: 2, Interesting
      It's going to be a cold day in Hell when a totalitarian regime starts relying on Microsoft to do the spying for them.

      You reckon? http://www.washingtonpost.com/wp-dyn/content/artic le/2007/01/08/AR2007010801352.html

      --
      "I've got more toys than Teruhisa Kitahara."
    3. Re:If you were a totalitatian regime by monk.e.boy · · Score: 2, Funny

      Hey! I live in the UK, where they can just look through my window with one of the ubiquitous CCTV cameras and just watch me browse the net...

      ...monitor my books, my mail...

      ...the rate at which I scratch my arse (there are by-laws that I won't go into, but let's just say if I do it more than 5 times in an hour and don't immediately go see the doctor, the para's turn up)...

      Microsoft are waaay behind.

      monk.e.boy

    4. Re:If you were a totalitatian regime by MrMr · · Score: 1

      But that is the complete opposite; In that report we have Microsoft relying on a totalitarian regime to do the spying for them...

  6. more useless statistics by pytheron · · Score: 1

    What percentage of net-users are male ? female ? If 80% are male, and the algorithm just guessed Male all the time, unless they bought 'ladies items' online, then they would have a pretty good accuracy. Age group usage would represent a bell graph, not difficult to again skew your results to favourably reflect on your algorithm.
    Unless of course these results were made under strict scientific obervance and imparitiality.. nah !

    --
    "I am not bound to please thee with my answers" [William Shakespeare]
    1. Re:more useless statistics by Conspiracy_Of_Doves · · Score: 1

      Actually, more women than men use the internet.

    2. Re:more useless statistics by PsEvo · · Score: 1

      What about all those men buying 'ladies items' :O

      --
      "ATI cards are like buses...They're huge, red and have bad drivers."
    3. Re:more useless statistics by pytheron · · Score: 1

      Only since they figured out they can get at their partners money via online banking

      --
      "I am not bound to please thee with my answers" [William Shakespeare]
    4. Re:more useless statistics by morgan_greywolf · · Score: 2, Informative

      Actually, no. More women under the age of 65 than their male counterparts use the Internet, according to Pew Internet. If you take into account all women and men of all age groups, then women still trail men by a few percentage points.

      If anything, you could say that men and women use the Internet about equally.

      Now, I'm almost positive that at least 80% of Slashdot readers are male, though there are an increasing number of females on this site. Many of them I think hide and don't reveal that they are female, so the statistics might be different than most of us suspect.

    5. Re:more useless statistics by Anonymous Coward · · Score: 0

      Best incredibly sexist comment ever. I'm in the library and had to stifle a laugh. If I could mod you +5 funny I would, since I can't I'll post AC to give you props.

    6. Re:more useless statistics by Conspiracy_Of_Doves · · Score: 1

      I stand corrected

  7. I'm unimpressed by Otter · · Score: 1
    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time.

    Link to paper. I don't claim to be knowledgeable about this stuff but that success rate doesn't look too remarkable to me. China's sex ratio is hardly so skewed (yet, anyway) that this could remotely identify someone from a pool of a billion users, or even out of a single Internet cafe.

    I'd wonder more about the quality of research Microsoft is getting out of their Beijing site if they think this worth bragging about.

  8. Here's my algorithm: by erroneous · · Score: 5, Funny

    If you read this post you are probably male. (80% correct)
    You are probably in the 20-35 age group. (60% correct)

    (I know I'll only get negative responses to this post of the type "I'm reading this and I'm a 47-year-old woman!" That's Ok. You're in the other bracket.)

    My algorithm is as good as Microsoft's. Can I have a research grant please?

    --
    erroneous: look me up in a dictionary
    1. Re:Here's my algorithm: by spellraiser · · Score: 4, Funny

      You got me pegged. Curse you!

      Can I have my privacy violation lawsuit settlement, please?

      --
      I hear there's rumors on the Slashdots
    2. Re:Here's my algorithm: by dcollins · · Score: 1

      Also, the time is currently 11:45. (Correct 2 times per day.)

      --
      We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
    3. Re:Here's my algorithm: by PhxBlue · · Score: 1

      "I'm reading this and I'm a 47-year-old woman!"

      As a 48-year-old grandmother, I'm offended that you called me 47!

      Actually ... I don't think that Slashdot-ism works here. Go figure. :)

      --
      !#@%*)anks for hanging up the phone, dear.
    4. Re:Here's my algorithm: by everphilski · · Score: 1

      sure, it works for slashdot, but does it work for a site dedicated to something more neutral, like say migraine headaches (predominantly female, 75/25 maybe) or cnn.com? Things that are somewhat gender neutral?

      I know, you were trying to be funny, but if this thing works across the board, 4 out of 5 positive ID's ain't bad.

    5. Re:Here's my algorithm: by Anonymous Coward · · Score: 1, Funny

      Are you trying to say that Slashdot is made up of mostly males!? And all this time I've been thinking this was a great place to meet chicks.

  9. Slashdot already has this feature by giafly · · Score: 2, Funny

    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time.
    Just include {MERGE GENDER} and {MERGE AGE} in the comment area. For example...

    Your gender is ... MALE
    Your age is ... 18-30
    --
    Reduce, reuse, cycle
    1. Re:Slashdot already has this feature by messner_007 · · Score: 1

      A agree that most of slashdoters are male, but I don't agree that most of them are younger than 30.

      male > 98 %

      age 18-30 > 53 %

    2. Re:Slashdot already has this feature by Anonymous Coward · · Score: 0

      holy crap your right!!! I'm very scared

    3. Re:Slashdot already has this feature by MyOtherUIDis3digits · · Score: 1

      A agree that most of slashdoters are male, but I don't agree that most of them are younger than 30.
      male > 98 %
      age 18-30 > 53 %


      Looks like we have our next Slashdot poll...

      --
      Ignore anything I said above, I actually agree with everything you believe - mod accordingly.
    4. Re:Slashdot already has this feature by messner_007 · · Score: 1

      Pools are not accurate enough and slashdoters are known for being honest when they have to enter the information about themselves ...

      - 5-20
      - 20 - 40
      - CowboyNeal years
      - ....

    5. Re:Slashdot already has this feature by treeves · · Score: 1

      . . .I don't agree that most of them are younger than 30.

      age 18-30 > 53 %

      Well, which is it?

      --
      ...the future crusty old bastards are already drinking the Kool-Aid.
    6. Re:Slashdot already has this feature by lakeland · · Score: 1

      http://slashdot.org/pollBooth.pl?qid=421&aid=-1
      http://slashdot.org/pollBooth.pl?qid=406&aid=-1

      I don't know what the accuracy rating in /. polls is, or if it is a function of the poll, so I don't know how much salt you'll have to take those results with.

    7. Re:Slashdot already has this feature by 2names · · Score: 1

      Wouldn't "greater then 53%" constitute "most?"

      --
      "I'm just here to regulate funkiness."
    8. Re:Slashdot already has this feature by 2names · · Score: 1

      Sorry, I meant "greater than"

      --
      "I'm just here to regulate funkiness."
  10. What now? Are they trying to eliminate bash? by Anonymous Coward · · Score: 0

    What is this? Some attempt to embrace, extend, and extinguish bash? I mean, it's nice Microsoft is using Open Source tools, but damn, this is too much.

    I strongly believe the contents of my .profile file are private, and I don't want Microsoft reading it every damn time I surf the web.

  11. Only In China by Anonymous Coward · · Score: 0

    Yeah, sure.

    1. Re:Only In China by Anonymous Coward · · Score: 0

      Yeah, sure.

      Isn't Karl Rove a Chinese name?

  12. Google by Anonymous Coward · · Score: 0

    How is this any different then what Google is doing. Google wants to be able to tell you what to do during the day. Or what type of job to get etc. Hmmm I guess this article made front page news on /. because it fit a few requirements:

    1.) Good News for OSS or Linux
    2.) Privacy Concerns
    3.) Bashes Microsoft

    1. Re:Google by mattgreen · · Score: 1

      No, see, it's okay when Google does it, because they said they would "do no evil." And we believe them! They're a big company whose name isn't Microsoft, so we automatically love them without questioning!

  13. Then get rid of oppressive regimes by Anonymous Coward · · Score: 0

    The solution to oppressive regimes: undermine them and make them go away. Don't try to change them, just start something better until they cannot stop it.

    The internet has always been anarchistic at its core. But really it is not as much anti-order as anti-authority.

    So why not take away all the central authority?

    Here's how: http://www.metagovernment.org/

    We really can rule ourselves, dontchya think?

    1. Re:Then get rid of oppressive regimes by Anonymous Coward · · Score: 0

      The thing is, this sort of draconian behavior by Microsoft is exactly what stands in the way of the success of the metagovernment.

      How can you have an internet-based revolution if the internet is working for the dictators?

  14. Follow the Chinese! by snowraver1 · · Score: 1

    Why don't we just enter out SSN every time we browse the web, and we can avoid all this 60-80% nonsense All sites visited would be logged in a centeral database, and would be used to deliver targed advertising. Just think! Based on my browsing habits, every website will look like torrentspy.com.

    --
    Copyright 2010. All rights reserved. This comment may not be copied in any way including, but not limited to caching.
    1. Re:Follow the Chinese! by Anonymous Coward · · Score: 0

      OK, here's my social security number: 123-45-6789

  15. Not too bad for now... by taupin · · Score: 5, Insightful

    Printable Version

    Right now this doesn't worry me too much - after all, how much "identification of anonymous dissidents" could someone do based only on one's gender and a rough estimate of age? On the other hand, if Microsoft do expand to geographical location, occupation, and educational degree as mentioned, then it's rather worrying.

  16. Oh noes! by Anonymous Coward · · Score: 0

    And guns might be used to kill people! So might chainsaws! Jet engines might fall from the sky! OH NOESSSSSSSS!

    Tons of stuff *might* happen, is that a good reason to go around crying like Chicken Little?

  17. Collecting and storing the data is what matters by Ravnen · · Score: 1

    This is another example of why it's important to ensure that corporations aren't allowed to collect and store huge amounts of data about individuals. The fact that they can analyse it in some way or another is irrelevant if privacy is respected in the first place.

  18. You pays your money ... by Anonymous Coward · · Score: 0

    "this type of technology would be used to spot Internet users who regularly access such 'subversive' content as news and information websites critical of the regime"


    Fair enough, people are obviously content to live under such regimes so this shouldn't concern them too much and if it helps our companies open up the profitable markets that often exist in such countries then they should look to where the profits lie and not concern themselves with the rights of people who, after all, choose not to do anything about the oppressive regimes themselves.

    Multinational corporations generate a lot of wealth in countries where they operate so eventually this will trickle down to create an affluent middle class who can afford to buy arms and education to end their oppression. Really, when you get down to it, as we have seen in Iraq, Zimbabwe and elsewhere it's pointless intervening on the behalf of the oppressed citizens ourselves with forced regime change or sanctions.

    The best thing we can do is exploit the market open to us under whatever restrictions are in place and then later look for openings for a bit of arms dealing to disaffected factions.

  19. Microsoft = Carnie? by Anonymous Coward · · Score: 0

    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time. ...maybe it's not about profiling. Could be the first step towards opening their Microsoft Land theme park.

    Hurry, hurry. Step right up. Guess Your Age and Gender!
  20. O Rly? by vivaoporto · · Score: 5, Insightful

    First things first: why China? (The same question applies to Venezuela, Russia, Brazil or whatever is the target of the Slashdot "fifteen minutes of hate" of the day). Of course people should be concerned about what these countries do wrt losses of privacy and basic rights, but what about U.S. and E.U.? As we talk, they are working on a new agreement to share data from passengers on trans-Atlantic flights, a much more effective way to profile people, because it contains name, address, gender, destination, credit card number, everything, without needing to make any kind of assumption, everything is plain and clear. This is why I think that not only "in China", as the summary states, but in most countries in the world, this information can and will be used to tag people indiscriminetaly, subversive or not, terrorist or not, law abiding or not. So, take care of your own backyard before to point the poison ivy in your neighbor one.

    Second, it is not like if Microsoft was the only one researching and developing on this field and, more than that, it is not like if Microsoft was not researching on this field, any government interested on this kind of technology would not research itself, or fund research on its public universities. So, throwing Microsoft name on the mix only reinforces my point, this submission is nothing but a flamebait, being the flame targets the usual suspects, proprietary software and communism.

    1. Re:O Rly? by Anonymous Coward · · Score: 0

      Why China? Are you retarded?

      They, unlike the nations you have listed, have a long and sordid history of human rights violations. In particular, they meet dissent with execution. China is not in the same league as any nation in the west. China is a brutal, oppressive regime.

    2. Re:O Rly? by bahstid · · Score: 3, Funny

      think of the child^H^H nese!

    3. Re:O Rly? by moderatorrater · · Score: 1

      Agreed. In addition, China's already shown a propensity for messing with the internet, so they could just institute national policies that directly link your activities with your name. I know it wouldn't be perfect and that you could get around it, but it would be a whole lot better than profiling by surfing habits.

      The real places this could be used are in countries that have enough freedom to not be able to meddle at a base level, but that have or desire a high degree of surveillance on their citizens (Great Britain works with CCTV, the NSA probably already does it). Whether you should worry about it or not is another issue entirely...

    4. Re:O Rly? by Anonymous Coward · · Score: 0

      Sounds like the US to me. They probably have the worst record now, after Iraq, and are actively building new machines to torture their citizens.

      Plus they kidnap other countries citizens and lock them up with no evidence. Even China does not that!

    5. Re:O Rly? by Prune · · Score: 1

      Funny that communism and proprietary software would be the 'usual suspects' as you say, given that the opposite of proprietary software, open source, is an ideology not that unfitting in a communist framework.

      --
      "Politicians and diapers must be changed often, and for the same reason."
    6. Re:O Rly? by westlake · · Score: 1
      As we talk, they are working on a new agreement to share data from passengers on trans-Atlantic flights, a much more effective way to profile people, because it contains name, address, gender, destination, credit card number, everything, without needing to make any kind of assumption, everything is plain and clear.

      In short, everything that an international traveler has had to disclose to authorities since the beginning of the modern era.

      Identity. Citizenship. Financial responsibility. No legal barriers to travel. No medical barriers to travel. Etc.

    7. Re:O Rly? by Hatta · · Score: 2, Funny

      China is not in the same league as any nation in the west. China is a brutal, oppressive regime.

      You're right, the US is a kinder, gentler, oppressive regime.

      --
      Give me Classic Slashdot or give me death!
    8. Re:O Rly? by Hatta · · Score: 2, Interesting

      First things first: why China?

      Exactly, there's just as much reason to be afraid of these things in oppressive first world regimes. The US government is already getting some websites to turn over their user lists. How long until they talk Microsoft or Google into giving them access to their data mining facilities? If they can get 80% accuracy, well that's probable cause, and that will get them a warrant for a search. Eventually we'll all have to be very careful what we search for, lest we end up on a list somewhere.

      --
      Give me Classic Slashdot or give me death!
    9. Re:O Rly? by JasonTik · · Score: 1

      80% accuracy of a given demographic is not probable cause to search anything. 80% accuracy for the selection of an individual is ok, but for a demographic is not enough to search every member of that demographic (or any selected members) (yet)

  21. BFG by m0nkyman · · Score: 1

    What? You think that the government of China doesn't have the resources or smarts to do this research and development themselves? Come on, shutting down research because the Chinese government might use it badly is very, very silly.

    --
    ~ a low user id is no indication I have a clue what I'm talking about.
    1. Re:BFG by gEvil+(beta) · · Score: 1

      And what does this have to do with guns (or giants, depending on how you interpret the acronym)? I only ask because you brought it up...

      --
      This guy's the limit!
  22. Non-News by Anonymous Coward · · Score: 0

    Profiling is done today in advertizing industries. How is this any different? Many applications for that matter can be used differently than it is intended for. We use CD's to store documents or ripped songs. This is no news. Advocating that the research should never take place is BS.

  23. Tradeoff by Belacgod · · Score: 1

    Any technology that reduces anonymization could have this effect. It's a tradeoff--ease of mobiliy for dissidents vs eliminating obnoxious assholes. So every time you have to suffer through a troll, you're protecting freedom-loving Chinese!

  24. Waste of my time, your time, their time.... by styryx · · Score: 1

    So now in addition to Tor et al and the things that help privacy (sending Google random data as search queries) all we will have to do is have something in the background opening up male/female sites over all popular age ranges. Way to have to cream everyone's bandwidth. Sheesh, is there anything you CAN get right, MicroJerk?

    Profiling is akin to racism in my book. It's against democracy any way you look at it.

  25. Priorities for Reporters without Borders by Nymz · · Score: 1

    I realize the concept that this software, if it became more accurate, could be used by repressive regimes against their citizens. But as far a priorities go, I think they would do better to concentrate on bringing attention to human rights violations, and educatiting people about the rule-of-law.

  26. The Usual Suspects by blueZhift · · Score: 3, Interesting

    This reminds me of the scenes in Casablanca where the police are told to round up the usual suspects. Ultimately the accuracy doesn't matter to the government anyway. Worldwide, I think we are moving in a direction of less freedom rather than more, spearheaded by wrong-headed anti-terrorism hysteria in the US. So why should they care about accuracy, they'll just round up whoever fits the profile and sort it out later, or not.

    1. Re:The Usual Suspects by Anonymous Coward · · Score: 0

      Worldwide, I think we are moving in a direction of less freedom rather than more, spearheaded by wrong-headed anti-terrorism hysteria in the US.

      Uh, yeah. The US is the fault of all of this. Let's forget the countries that openly censor web content and access.

      Next time you want to bash the US maybe you need to look towards other shores first.

      In comparison the US is a small potato when you put them up against the likes of China.

      Oh, but this is slashdot. The same fucktards who weep for the "wrongful imprisonment" of terror suspects somehow don't bat an eye when it comes to the genocide committed in other countries.

  27. Let me tell you... by PsEvo · · Score: 1

    "Our new algorithm you ask? Well, we took a look at the MSN search log and counted the ratio of keywords such as 'boobs' and 'anal'." Considering the various interests and beliefs of men and women, no algorithm could accurately guess a gender, even age.

    --
    "ATI cards are like buses...They're huge, red and have bad drivers."
  28. Extension time methinks by ajs318 · · Score: 2, Insightful

    What about somebody writing a browser extension that performs bogus searches in the background, for no better reason than to frustrate "profiling" attempts? Is this feasible?

    --
    Je fume. Tu fumes. Nous fûmes!
    1. Re:Extension time methinks by alexhs · · Score: 3, Informative

      What about somebody writing a browser extension that performs bogus searches in the background, for no better reason than to frustrate "profiling" attempts? Is this feasible? Already done
      --
      I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
  29. Article would be more useful if.... by CodeShark · · Score: 1

    They detailed how much of the information they gather is from using MS related software such as Internet Explorer, etc.

    Which is to say, how much info can be gathered using a non Ms browser such as Firefox with a Non MS operating system such as Apple or Linux, and avoiding non MS- dominated web sites?

    The more important questions are a)what extent is this important to free societies and breaking the grip of totalitarian regimes on their societies?, and b) to what extent do we as memebers in free societies need to revolt against "corporate society" and their accelerating tendency to glom together data about private individuals??

    --
    ...Open Source isn't the only answer -- but it's almost always a better value than the alternatives...
    1. Re:Article would be more useful if.... by westlake · · Score: 1
      how much info can be gathered using a non Ms browser such as Firefox with a Non MS operating system such as Apple or Linux, and avoiding non MS- dominated web sites?

      Pretty much everything anyone could want.

      When you click on a prostate cancer site it isn't hard to guess your likely age and gender.

    2. Re:Article would be more useful if.... by CodeShark · · Score: 1

      True. I guess the better question would have been "how much of the statistics are derived from M$ server software and/or M$ client software -- because a server can always collect information at an extreme level.

      --
      ...Open Source isn't the only answer -- but it's almost always a better value than the alternatives...
  30. Fuzz about nothing by Rob+Kaper · · Score: 1

    I don't get it. No matter what age-group/gender combination you think of, even combined with geodata and occupation/education levels, this doesn't even come close to identifying individuals. Unless you actually believe "a male farmer in his 30s in the Shanghai region" or "a female grandmother in the suburbs of Houston" is significantly detailed.

    It makes more sense to worry about accidently getting linked to personal details left in instant messaging, e-mail, community profiles and/or conversations.

  31. Ok boys - Time to visit that Tampon manufacturer.. by sjwest · · Score: 1

    Well you you why...

    ps: Humour

  32. I know who it is! by Phat_Tony · · Score: 4, Funny

    Right, so they're about 50% sure that it's someone who's both male and aged 24-30, living in China. It should be easy to pick out the individual from there.

    --
    Can anyone tell me how to set my sig on Slashdot?
    1. Re:I know who it is! by Oswald · · Score: 1

      Or it would be, if only they knew his hair color.

  33. USE /. (slashdot's) methods, they work, afaik! by Anonymous Coward · · Score: 0

    "Microsoft research on Internet user profiling could lead to tools that help repressive regimes identify anonymous dissidents" - from the article/item here @ slashdot /.!

    &

    "... wouldn't it be easier to look up the IP address and persuade the ISP to hand over the user details? The old ways are often the best." - by supersnail (106701) on Monday June 04, @10:52AM (#19382003)

    They should instead, use a page from the /. (slashdot) playbook, because, afaik (& have tried)? You can't GET here "via anonmyous proxies" (@ least the GOOD, highly anonymous rated ones that is)...

    THAT? That is a GOOD design here by ALL means, imo & yes, attempts @ it & thus, + experience, if you don't want "anonymous untrackable nobodies" posting things on your websites etc.!

    Add the servers noted @ spots like here:

    http://www.publicproxyservers.com/page1.html

    To a DB engine/filter here, & filter out the "anonymous lusers/cowards", everywhere. This would save time in doing ping/traceroute detections, which can fail, on most ANY OS out here now (modern ones @ least) that are set to NOT reply to them (this IS doable).

    The data for this is out there (see the url above), which should function to save time in designing ornate/complex ping-trace routines too into anyone's website, imo @ least.

    (See - Unless anybody can tell me how to use a totally, highly anonymous proxy server here? Afaik, it can't be done @ /. (here), & again - that's "kudos" I must give to this site's designers, imo (provided I am correct, but I have tried it, & never was successful @ it though)).

    APK

    (Signing my name here, for once - though I usually just post as "A/C" (no desire for mod points/karma, etc. et al) on my part, & this is all, for the most part, & the desire NOT to be 'trackable' easily by some folks I have had hassles with over time (arstechnica) as well on my part))

    I have reasons for this, see here:

    http://www.windowsitpro.com/articles/index.cfm?art icleid=41095&cpage=195#feedbackAnchor

    P.S.=> Yes, new highly anonymous servers pop up (but, then again, so do virus/trojans/malware etc. et al), but that is what UPDATES/PATCHES are for... not to a binary in this case, but a referring table instead... apk

  34. US and Chineese Methods by twitter · · Score: 1

    wouldn't it be easier to look up the IP address and persuade the ISP to hand over the user details?

    Requiring the ISP to keep records with "wiretapping" laws and then getting the details is the US method. Farming out the collation of records to a company like Choice Point goes beyond the laws and is both cheap and efficient.

    In China, the regime is the ISP and they have the best equipment and methods that US companies could provide.

    --

    Friends don't help friends install M$ junk.

  35. Must be Monday... by stubear · · Score: 0, Flamebait

    ...because the slashmonkeys are arguing that technology that can be abused is now considered to be bad. Tomorrow there will be ab article about how the RIAA/MPAA are trying to shut down P2P networks and he sheep will graze on the other side of the field once more.

  36. M$ Apologists by twitter · · Score: 2, Informative

    No, it's not OK when Google does things that might be used to harm people who have done nothing wrong.

    This thread, however, is about the much nastier things that M$ does gleefully. I'd enjoy it if you compared the details.

    --

    Friends don't help friends install M$ junk.

  37. can't detect age - only maturity ... by petes_PoV · · Score: 2, Insightful

    ... which is a completely different and only slightly correlated attribute.

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
  38. Strange age guessing by saibot834 · · Score: 1

    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time.

    A bit strange in my opinion. "Guessing the gender correctly" has already 50% even if you don't have any data about the user. So there is not much improvement here.
    But the age... if you really guess the age thats more difficult. If we say we have everyone in the Internet up to the age of 100, you have a 1% change of guessing the age - much less than 50%.
    So even if you improve only up to 60%; getting from 1% to 60% is much more than from 50% to 80%.
    Perhaps they mean "guessing the age with a tolerance of 10 years"

  39. Nahhh by Anonymous Coward · · Score: 0

    but you can't spell google with a dollah sign, so that takes the fun out of it... oh godz my head is teh hurt...

    1. Re:Nahhh by jae471 · · Score: 1
      But you can spell it with a pound and euro sign. see: Goog£€. Proof that Google wants to conquer to World!!!!

      Not as good as Micro$oft, though

    2. Re:Nahhh by Anonymous Coward · · Score: 0

      ¥ahoo?

  40. It's now or never for stopping totalitarianism by Anonymous Coward · · Score: 0

    They don't have complete control yet. But both Microsoft and the Bush administration are working as hard as they can to implement complete totalitarianism.

    I say that we need to change governments NOW, before these people succeed in rolling us back to, well, 1984.

    P.S. Posted anonymously because, really who knows how much intrusion Bush has already accomplished. All we know about is what he got caught on. I can only HOPE that it is not too late for the revolution.

    1. Re:It's now or never for stopping totalitarianism by Anonymous Coward · · Score: 0

      Get some help. Now. Mental disorders are nothing to fuck around with.

    2. Re:It's now or never for stopping totalitarianism by Anonymous Coward · · Score: 0

      Right. Because all that spying on citizens, torturing citizens, subverting the Constitution, etc. that Bush is doing... that's all just made up. Right?

    3. Re:It's now or never for stopping totalitarianism by Anonymous Coward · · Score: 0

      Maybe. Maybe not. How would I know? If I did know, would I tell you the truth? If it were the truth, would you know it? You don't want answers for me. Ask the birds. They know.

    4. Re:It's now or never for stopping totalitarianism by DeadChobi · · Score: 1

      He who controls the past controls the future, and he who controls the present controls the past. In a world where all information can be changed at the flick of a switch the only thing you can ever be sure of is that which you witness with your own eyes. Everything outside of yourself can and frequently is rewritten to the whims of whatever regime happens to be in power at the time.

      --
      SRSLY.
  41. I wonder ... by PPH · · Score: 1

    ...what sort of profile they'd get running that on /. posts?

    --
    Have gnu, will travel.
    1. Re:I wonder ... by Bugs42 · · Score: 2, Funny

      99% male

      75% 18-30 years old

      100% single and living in Mom's basement

      Hey, look, I got even more data than MS! I'll take a check now, please.

      --
      Programmer: an ingenious device that converts caffeine into code.
  42. Not a problem by Jessta · · Score: 1

    This doesn't sounds like a problem at all.
    To use it the regime would have to know about other sites the user had visited to input that information in to the algo.
    If they can already uniquely identify a users across multiple sites then they would already know who they were.
    All this is useful for is processing information for marketing after using your phishing detecting(IE7, google toolbar) software to spy on your users.

    --
    ...and that is all I have to say about that.
    http://jessta.id.au
  43. Try visiting planet earth sometime by Freed · · Score: 1

    >So, throwing Microsoft name on the mix only reinforces my point, this submission is nothing but a flamebait

    Nonsense. Microsoft's power and influence alone justifies concern about whatever they do. The same could be said of China. Your point is thus totally lost on me.

  44. Not at all by Anonymous Coward · · Score: 0

    MS has been all to happy to share info with the gov. In fact, MS's and yahoo's sharing of info with the chinese have led to the death of one person already and most likely several others into prison.

    The only thing that MS has not been willing to concede to the gov. has been control of how they do sales.

  45. Using Google while logged in? by sherriw · · Score: 1

    Are there a lot of people who, for example stay logged in to Google while checking email, searching the web, looking up maps etc? I log out every time. No personalized search for me thank you very much. Speaking of... lemme clear my cookies. hehehe

  46. bad URL by ftide · · Score: 1
  47. I' ve got the algorithm by Anonymous Coward · · Score: 0

    gender = 'male'
    age ='16-25'

    I bet it is even better than microsofts :)

  48. Marketing is the bane of our society. by SupremoMan · · Score: 1

    Unfortunately it is TOO effective. As a result, Americans spend way more money than they should. People stoped making informed purchases, they just buy the product of whoever has the most effective ad. This translates 100% into politics. The public no longer care about budget deficits, nor do they care about the candidate's stance on all issues. They are more likely to get behind a candidate with the best ads.

  49. Ron Paul by Anonymous Coward · · Score: 0

    Or... vote for Ron Paul. :)

  50. Repressive regimes are perfect for Microsoft by Anonymous Coward · · Score: 0

    Repressive, one-party political regimes, closed societies are ideal partners for Microsoft. It's the perfect political environment for a monopoly.

  51. In Soviet America, Microsoft has citizen comrades by WillAffleckUW · · Score: 1

    willingly give up all rights of privacy so they can be good citizen comrades in the willing partnership of the Corporate State and the citizen comrades!

    All power to the Rights Fuhrer Bill Gates and Citizen Comrade Chiefs who free us from worry about nasty anonymity!

    --
    -- Tigger warning: This post may contain tiggers! --
  52. Screw Communism, Democracy is more of a Threat! by lordSaurontheGreat · · Score: 1

    Screw communism (no offense, read what I have to say). What do you think Democracies are going to do with this? I can imagine congressmen buying $60,000.00 copies of this product to determine exactly who's in their voting region, match up the IPs on the internet, and then use that to further harass us every time an election comes around! It's madness! At least in Communist states they put you out of your misery and do you in! Here is America they keep you alive so they can keep exploiting you... "It's the worst system of government - except for all others tried."

    --
    Consider yourself spoken to.
  53. Leaked code by Anonymous Coward · · Score: 0

    if ( typeof(url) == porn ) {
        sex = male;
        age = 18;
    }
    if ( url == slashdot )
        assert( sex == male );

  54. "Informative?" by mattgreen · · Score: 1

    Incredible. Thank you for telling me that it isn't OK when Google does things to harm people.

    Can you show me where Microsoft does these things gleefully? Proof would consist of evil cackles recorded in MP3 files. Good luck.

    (I'm no apologist, I only make fun of the groupthink around here.)

    1. Re:"Informative?" by Anonymous Coward · · Score: 0

      "(I'm no apologist, I only make fun of the groupthink around here.)"

      Screw the groupthink, that TWITTER you're responding to. Read his posting history, he succeeds very well at making fun of himself.

  55. On the Internet, Somebody Knows You're a Dog by z-j-y · · Score: 1

    Don't bitch about it.

  56. We should let Google rule the world's information by BSDetector · · Score: 0

    If only Google could save us!!!!

    http://blogs.zdnet.com/micro-markets/?p=701