Slashdot Mirror


Concerns Over Microsoft's Internet User Profiling

jcatcw writes "Microsoft research on Internet user profiling could lead to tools that help repressive regimes identify anonymous dissidents, the Reporters Without Borders advocacy group warned last Friday. Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time. "In China, it is conceivable that this type of technology would be used to spot Internet users who regularly access such 'subversive' content as news and information websites critical of the regime," the group said."

96 of 144 comments (clear)

  1. If I were head of an oppresive regime... by Anonymous Coward · · Score: 4, Funny

    I'd be uneasy about partnering with a bunch of totalitarian control freaks like Microsoft.

    1. Re:If I were head of an oppresive regime... by Anonymous Coward · · Score: 5, Funny

      That said, I reckon I could guess the gender of a web surfer 80% of the time.

      if (site.equals("slashdot.org")) male = true;
      else if (site.equals("cutepuppydogs.com")) male = false;

    2. Re:If I were head of an oppresive regime... by nschubach · · Score: 1, Funny

      Great.. I can see it now:

      Due to a recent trend in programming. Feminist groups have decided they don't like being referred to as !male. They demand rule defined paring of boolean values "male" and "female" where setting male to false would trigger female equality to true. More on this at 11.

      --
      Every time I start to have faith in humanity, I ruin it by driving to work between 7 and 8 am.
    3. Re:If I were head of an oppresive regime... by mrmeval · · Score: 1

      z0MG PONIES

      --
      I'd go on a Vegan diet but the delivery time from Vega is too long. --brownkitty
    4. Re:If I were head of an oppresive regime... by willabr · · Score: 1

      Jezz... what next a Microsoft Van taking pictures of me?

    5. Re:If I were head of an oppresive regime... by Patrik_AKA_RedX · · Score: 1

      Unnecessary. You know those little lights on your electronic equipment? They told you those are just light emitting diodes. What they didn't told you is that those are actually little cameras. And Big Bill is watching you... And I really mean you Willabr. So put on you pants and comb your hair. We, members of the oppresive regime secretly installed in your country, have feelings too, you know.

  2. oh noes! by Anonymous Coward · · Score: 4, Funny

    Unless you happen to be the only 20-25 year old male in China I think you're safe.

    1. Re:oh noes! by Short+Circuit · · Score: 4, Insightful

      Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time. "In China, it is conceivable that this type of technology would be used to spot Internet users who regularly access such 'subversive' content as news and information websites critical of the regime," With that kind if inaccuracy, it could hardly be considered a reliable indicator of identity. It seems much more tuned to generating demographic data.

      Not that China couldn't use that in targeted propaganda...
    2. Re:oh noes! by rapidweather · · Score: 1
      This entire post has just gotten censored (self censorship), I am still hoping for a free copy of Vista from Microsoft for posting "favorable" comments on Slashdot.
      Not their idea, mine. Leaving no stone unturned, I say.
      If I did actually get a free copy of Vista, I'd put it on the living room table right next to the picture of Thomas Jefferson (1743-1826). His picture is on the Two Dollar Bill laying there.

      As you can see, I am horribly bored this evening, but I am doing a test:

      I'm running the linux OS you see in the Screenshots link below on a SanDisk 2 GB Cruzer Micro, partitioned like this:
      SDA1 700 MB for Knoppix (the CD itself)
      SDA5 700 MB for Persistent Home Directory
      SDA6 400 MB for GIMP and K3B swap, additional storage also.
      SDA7 150 MB linux swap.

      Here is the output from TOP:
      Tasks: 42 total, 1 running, 41 sleeping, 0 stopped, 0 zombie
      Cpu(s): 5.5% user, 3.9% system, 0.0% nice, 90.6% idle
      Mem: 385800k total, 218864k used, 166936k free, 6456k buffers
      Swap: 155192k total, 0k used, 155192k free, 130392k cached
      (Uptime not shown, is 2:05)

      This PC is a HP Pavilion 8250, Celeron 267 MHZ.
      To get the USB drive booted up, I use a small HDD with MSDOS 6.21, and these files.
      The small "boot" drive only runs for about 20 seconds, the Menu comes up, you make a choice, then the Linux OS runs entirely from the USB drive. The necessary "loadlin" command line is:
      loadlin vmlinuz initrd=miniroot.gz BOOT_IMAGE=knoppix xmodule=s3virge fromhd=/dev/sda1 home=/dev/sda5

      Browser is Firefox 2.0.0.4

      Thanks for listening...

      Rapidweather

  3. Poor accuracy by mws1066 · · Score: 4, Informative

    80% and 60% are both actually very poor accuracies. I wouldn't be worried; this won't be taken seriously as any type of reliable profiling.

    --
    Nothing is more dangerous than a programmer with a screwdriver.
    1. Re:Poor accuracy by Lockejaw · · Score: 4, Insightful

      I am worried about such low rates. People seem inclined to believe anything a sufficiently large computer says.

      --
      (IANAL)
    2. Re:Poor accuracy by ajanp · · Score: 5, Insightful

      That same day, Erik Bratt, a Microsoft marketing communications manager, fired a preemptive salvo about the company's age- and gender-guessing research. Bratt first downplayed the research results, saying: "[The researchers] actually found that they could not, with a high degree of accuracy, predict age from Web browsing activity." He also swore the Redmond, Wash. developer off using the resulting algorithms. "Microsoft currently has no plans to use the capabilities found through this research in our products and services," Bratt said. So... they tell people that they have the ability to kind of, maybe, sort of, predict age and gender, but that their current algorithms are basically just a bunch of BS. Then they decide to also mention that despite the fact that the foundation for this technology doesn't really work accurately and the have no plans to use it in any products, they're going to continue working on it anyways.

      Microsoft's researchers said they would expand their work to other demographic attributes, such as occupation, educational degree and geographic location I mean, ofcourse that's the logical thing to do so that they could hopefully get it to work more accurately, but why mention that you're even working on the technology if you're going to immediately dismiss it by saying that it doesn't really work.

      Then again, now I guess it solidifies the opinion about why Microsoft is really so sore about the Google-Doubleclick deal http://slashdot.org/article.pl?sid=07/04/16/021720 3. Google's got a leg up on what they have apparently already been working on and now they're at a disadvantage.

      --
      File Deletion is Murder.
    3. Re:Poor accuracy by Turn-X+Alphonse · · Score: 1

      Isn't that worse?

      "We know a guy of 30-35years was looking at anti-freedom/love/justice/whatever material in this net cafe on Thursday at 3pm, would that be you sir? No? Well they all say no at first..."

      While that 20 year old woman is giggling from her window as she beat "the system" yet again.

      At least if it's accurate you're okay if you behave (no I am not supporting it, just saying), where as if it's inaccurate.. well fuck, better odds on a coin flip.

      --
      I like muppets.
    4. Re:Poor accuracy by ShiNoKaze · · Score: 1

      How accurate it is doesn't matter. What matters is if the person who would make money deploying it can pitch it correctly to the people in charge. If so, they'll still try it. China would be the perfect place to start as they don't seem as though they'd be worried about false positives.

    5. Re:Poor accuracy by reddburn · · Score: 3, Interesting

      80% and 60% are both actually very poor accuracies. I wouldn't be worried; this won't be taken seriously as any type of reliable profiling. It depends on the context. If you were talking baseball, it'd be a pretty damn good batting average. It's also good enough to get a warrant, and good enough for a grand jury to indict, and those latter two are what should worry you.
      --
      "Those who believe in telekinetics, raise my hand" - Kurt Vonnegut, Jr.
    6. Re:Poor accuracy by moderatorrater · · Score: 2, Funny

      We need to start actively throwing their algorithm off. If every one of us guys were to start visiting hellokitty.com and mypinkfluffybarbieplayingwithponies.com and all the women were to start reading slashdot, they wouldn't be able to tell us apart!

    7. Re:Poor accuracy by catbutt · · Score: 1

      Regarding the age, how can they say "60%", and how can you say is is not accurate, when the tolerance isn't specified? Do they mean to the year? (I kinda doubt it)

    8. Re:Poor accuracy by acvh · · Score: 1

      this is their first stab. they'll get better at it.

    9. Re:Poor accuracy by MyOtherUIDis3digits · · Score: 3, Funny

      I'm already doing my part. While my surfing habits are definitely male-oriented, I'm sure they would determine me to be 14-16 tops.

      --
      Ignore anything I said above, I actually agree with everything you believe - mod accordingly.
    10. Re:Poor accuracy by denissmith · · Score: 1

      But it could correctly indicate you were 60% likely to be a dissident, which is good enough in most jurisdictions.

      --
      I have nothing to hide. So, why are you spying on me?
    11. Re:Poor accuracy by nschubach · · Score: 1

      ...or a sufficient number of computers called the Internet.

      --
      Every time I start to have faith in humanity, I ruin it by driving to work between 7 and 8 am.
    12. Re:Poor accuracy by DeadChobi · · Score: 1

      There are girls on the internet? Amazing!

      --
      SRSLY.
    13. Re:Poor accuracy by Stefanwulf · · Score: 1
      Baseball's being a defensive sport aside, I'd argue that 80% accuracy should certainly not be good enough for a grand jury. Let's examine a hypothetical case where someone is brought before a grand jury for cime X. The situation:
      • A profiling procedure has been developed which can tell with 80% accuracy if this person is the type of person who would commit crime X.
      • Out of the population at large, let's say X is a very common crime, and 10% of the population has committed it. (In other words P(Crime) = 0.1, and P(!Crime) = 0.9)
      • If a person commits crime X, then the test returns true 80% of the time. (P(Positive|Crime) = 0.8)
      • If a person has not committed crime X, then the test returns true 20% of the time. (P(Positive|!Crime) = 0.2)
      • By bayes theorem, P(Crime|Positive) = (P(Positive|Crime) * P(Crime)) / (P(Positive|Crime) * P(Crime) + P(Positive|!Crime)*P(!Crime))
      • Unless I've made a mistake, this works out to (0.8 * 0.1) / ((0.8 * 0.1) + (0.2 * 0.9)), or roughly 0.308
      • That means that for any given person indicated by the profiling procedure as being a criminal, there is about a 70% chance that they did not do it, and only roughly a 30% chance that they did.
      I would certainly hope that presenting a grand jury with only a 30% chance of guilt would not be enough (on its own) to indict. And please keep in mind that we are assuming a very high proportion of the population is guilty here. In cases where less than 10% of the population has committed the crime, then the liklihood of a false positive for an individual case would be even higher.
    14. Re:Poor accuracy by rtb61 · · Score: 1

      When it comes to data poisoning the best way is an automated extension that just distorts your profile by mixing it with others, when done properly we all end up looking like one homogeneous mass. From junk data embedded in emails, to fake search routines and even random background browsing across a whole range of sites, no matter how much data their privacy invasive computers can store, hundreds of millions of users can create even more random worthless data and flood them under. What is cool is as their capability to handle more data increases so capability of generating more random 'suspect' crap also increases (when we are all guilty who will they suspect).

      --
      Chaos - everything, everywhere, everywhen
    15. Re:Poor accuracy by Stefanwulf · · Score: 1

      Thanks, that's a really good summary.

  4. If you were a totalitatian regime by supersnail · · Score: 4, Insightful

    ... wouldn't it be easier to look up the IP address and persuade the ISP to hand over the user details?

    The old ways are often the best.

    --
    Old COBOL programmers never die. They just code in C.
    1. Re:If you were a totalitatian regime by voislav98 · · Score: 1

      Or block access to the websites in question? It's going to be a cold day in Hell when a totalitarian regime starts relying on Microsoft to do the spying for them.

    2. Re:If you were a totalitatian regime by ozmanjusri · · Score: 2, Interesting
      It's going to be a cold day in Hell when a totalitarian regime starts relying on Microsoft to do the spying for them.

      You reckon? http://www.washingtonpost.com/wp-dyn/content/artic le/2007/01/08/AR2007010801352.html

      --
      "I've got more toys than Teruhisa Kitahara."
    3. Re:If you were a totalitatian regime by monk.e.boy · · Score: 2, Funny

      Hey! I live in the UK, where they can just look through my window with one of the ubiquitous CCTV cameras and just watch me browse the net...

      ...monitor my books, my mail...

      ...the rate at which I scratch my arse (there are by-laws that I won't go into, but let's just say if I do it more than 5 times in an hour and don't immediately go see the doctor, the para's turn up)...

      Microsoft are waaay behind.

      monk.e.boy

    4. Re:If you were a totalitatian regime by MrMr · · Score: 1

      But that is the complete opposite; In that report we have Microsoft relying on a totalitarian regime to do the spying for them...

  5. more useless statistics by pytheron · · Score: 1

    What percentage of net-users are male ? female ? If 80% are male, and the algorithm just guessed Male all the time, unless they bought 'ladies items' online, then they would have a pretty good accuracy. Age group usage would represent a bell graph, not difficult to again skew your results to favourably reflect on your algorithm.
    Unless of course these results were made under strict scientific obervance and imparitiality.. nah !

    --
    "I am not bound to please thee with my answers" [William Shakespeare]
    1. Re:more useless statistics by Conspiracy_Of_Doves · · Score: 1

      Actually, more women than men use the internet.

    2. Re:more useless statistics by PsEvo · · Score: 1

      What about all those men buying 'ladies items' :O

      --
      "ATI cards are like buses...They're huge, red and have bad drivers."
    3. Re:more useless statistics by pytheron · · Score: 1

      Only since they figured out they can get at their partners money via online banking

      --
      "I am not bound to please thee with my answers" [William Shakespeare]
    4. Re:more useless statistics by morgan_greywolf · · Score: 2, Informative

      Actually, no. More women under the age of 65 than their male counterparts use the Internet, according to Pew Internet. If you take into account all women and men of all age groups, then women still trail men by a few percentage points.

      If anything, you could say that men and women use the Internet about equally.

      Now, I'm almost positive that at least 80% of Slashdot readers are male, though there are an increasing number of females on this site. Many of them I think hide and don't reveal that they are female, so the statistics might be different than most of us suspect.

    5. Re:more useless statistics by Conspiracy_Of_Doves · · Score: 1

      I stand corrected

  6. I'm unimpressed by Otter · · Score: 1
    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time.

    Link to paper. I don't claim to be knowledgeable about this stuff but that success rate doesn't look too remarkable to me. China's sex ratio is hardly so skewed (yet, anyway) that this could remotely identify someone from a pool of a billion users, or even out of a single Internet cafe.

    I'd wonder more about the quality of research Microsoft is getting out of their Beijing site if they think this worth bragging about.

  7. Here's my algorithm: by erroneous · · Score: 5, Funny

    If you read this post you are probably male. (80% correct)
    You are probably in the 20-35 age group. (60% correct)

    (I know I'll only get negative responses to this post of the type "I'm reading this and I'm a 47-year-old woman!" That's Ok. You're in the other bracket.)

    My algorithm is as good as Microsoft's. Can I have a research grant please?

    --
    erroneous: look me up in a dictionary
    1. Re:Here's my algorithm: by spellraiser · · Score: 4, Funny

      You got me pegged. Curse you!

      Can I have my privacy violation lawsuit settlement, please?

      --
      I hear there's rumors on the Slashdots
    2. Re:Here's my algorithm: by dcollins · · Score: 1

      Also, the time is currently 11:45. (Correct 2 times per day.)

      --
      We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
    3. Re:Here's my algorithm: by PhxBlue · · Score: 1

      "I'm reading this and I'm a 47-year-old woman!"

      As a 48-year-old grandmother, I'm offended that you called me 47!

      Actually ... I don't think that Slashdot-ism works here. Go figure. :)

      --
      !#@%*)anks for hanging up the phone, dear.
    4. Re:Here's my algorithm: by everphilski · · Score: 1

      sure, it works for slashdot, but does it work for a site dedicated to something more neutral, like say migraine headaches (predominantly female, 75/25 maybe) or cnn.com? Things that are somewhat gender neutral?

      I know, you were trying to be funny, but if this thing works across the board, 4 out of 5 positive ID's ain't bad.

    5. Re:Here's my algorithm: by Anonymous Coward · · Score: 1, Funny

      Are you trying to say that Slashdot is made up of mostly males!? And all this time I've been thinking this was a great place to meet chicks.

  8. Slashdot already has this feature by giafly · · Score: 2, Funny

    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time.
    Just include {MERGE GENDER} and {MERGE AGE} in the comment area. For example...

    Your gender is ... MALE
    Your age is ... 18-30
    --
    Reduce, reuse, cycle
    1. Re:Slashdot already has this feature by messner_007 · · Score: 1

      A agree that most of slashdoters are male, but I don't agree that most of them are younger than 30.

      male > 98 %

      age 18-30 > 53 %

    2. Re:Slashdot already has this feature by MyOtherUIDis3digits · · Score: 1

      A agree that most of slashdoters are male, but I don't agree that most of them are younger than 30.
      male > 98 %
      age 18-30 > 53 %


      Looks like we have our next Slashdot poll...

      --
      Ignore anything I said above, I actually agree with everything you believe - mod accordingly.
    3. Re:Slashdot already has this feature by messner_007 · · Score: 1

      Pools are not accurate enough and slashdoters are known for being honest when they have to enter the information about themselves ...

      - 5-20
      - 20 - 40
      - CowboyNeal years
      - ....

    4. Re:Slashdot already has this feature by treeves · · Score: 1

      . . .I don't agree that most of them are younger than 30.

      age 18-30 > 53 %

      Well, which is it?

      --
      ...the future crusty old bastards are already drinking the Kool-Aid.
    5. Re:Slashdot already has this feature by lakeland · · Score: 1

      http://slashdot.org/pollBooth.pl?qid=421&aid=-1
      http://slashdot.org/pollBooth.pl?qid=406&aid=-1

      I don't know what the accuracy rating in /. polls is, or if it is a function of the poll, so I don't know how much salt you'll have to take those results with.

    6. Re:Slashdot already has this feature by 2names · · Score: 1

      Wouldn't "greater then 53%" constitute "most?"

      --
      "I'm just here to regulate funkiness."
    7. Re:Slashdot already has this feature by 2names · · Score: 1

      Sorry, I meant "greater than"

      --
      "I'm just here to regulate funkiness."
  9. Follow the Chinese! by snowraver1 · · Score: 1

    Why don't we just enter out SSN every time we browse the web, and we can avoid all this 60-80% nonsense All sites visited would be logged in a centeral database, and would be used to deliver targed advertising. Just think! Based on my browsing habits, every website will look like torrentspy.com.

    --
    Copyright 2010. All rights reserved. This comment may not be copied in any way including, but not limited to caching.
  10. Not too bad for now... by taupin · · Score: 5, Insightful

    Printable Version

    Right now this doesn't worry me too much - after all, how much "identification of anonymous dissidents" could someone do based only on one's gender and a rough estimate of age? On the other hand, if Microsoft do expand to geographical location, occupation, and educational degree as mentioned, then it's rather worrying.

  11. Collecting and storing the data is what matters by Ravnen · · Score: 1

    This is another example of why it's important to ensure that corporations aren't allowed to collect and store huge amounts of data about individuals. The fact that they can analyse it in some way or another is irrelevant if privacy is respected in the first place.

  12. O Rly? by vivaoporto · · Score: 5, Insightful

    First things first: why China? (The same question applies to Venezuela, Russia, Brazil or whatever is the target of the Slashdot "fifteen minutes of hate" of the day). Of course people should be concerned about what these countries do wrt losses of privacy and basic rights, but what about U.S. and E.U.? As we talk, they are working on a new agreement to share data from passengers on trans-Atlantic flights, a much more effective way to profile people, because it contains name, address, gender, destination, credit card number, everything, without needing to make any kind of assumption, everything is plain and clear. This is why I think that not only "in China", as the summary states, but in most countries in the world, this information can and will be used to tag people indiscriminetaly, subversive or not, terrorist or not, law abiding or not. So, take care of your own backyard before to point the poison ivy in your neighbor one.

    Second, it is not like if Microsoft was the only one researching and developing on this field and, more than that, it is not like if Microsoft was not researching on this field, any government interested on this kind of technology would not research itself, or fund research on its public universities. So, throwing Microsoft name on the mix only reinforces my point, this submission is nothing but a flamebait, being the flame targets the usual suspects, proprietary software and communism.

    1. Re:O Rly? by bahstid · · Score: 3, Funny

      think of the child^H^H nese!

    2. Re:O Rly? by moderatorrater · · Score: 1

      Agreed. In addition, China's already shown a propensity for messing with the internet, so they could just institute national policies that directly link your activities with your name. I know it wouldn't be perfect and that you could get around it, but it would be a whole lot better than profiling by surfing habits.

      The real places this could be used are in countries that have enough freedom to not be able to meddle at a base level, but that have or desire a high degree of surveillance on their citizens (Great Britain works with CCTV, the NSA probably already does it). Whether you should worry about it or not is another issue entirely...

    3. Re:O Rly? by Prune · · Score: 1

      Funny that communism and proprietary software would be the 'usual suspects' as you say, given that the opposite of proprietary software, open source, is an ideology not that unfitting in a communist framework.

      --
      "Politicians and diapers must be changed often, and for the same reason."
    4. Re:O Rly? by westlake · · Score: 1
      As we talk, they are working on a new agreement to share data from passengers on trans-Atlantic flights, a much more effective way to profile people, because it contains name, address, gender, destination, credit card number, everything, without needing to make any kind of assumption, everything is plain and clear.

      In short, everything that an international traveler has had to disclose to authorities since the beginning of the modern era.

      Identity. Citizenship. Financial responsibility. No legal barriers to travel. No medical barriers to travel. Etc.

    5. Re:O Rly? by Hatta · · Score: 2, Funny

      China is not in the same league as any nation in the west. China is a brutal, oppressive regime.

      You're right, the US is a kinder, gentler, oppressive regime.

      --
      Give me Classic Slashdot or give me death!
    6. Re:O Rly? by Hatta · · Score: 2, Interesting

      First things first: why China?

      Exactly, there's just as much reason to be afraid of these things in oppressive first world regimes. The US government is already getting some websites to turn over their user lists. How long until they talk Microsoft or Google into giving them access to their data mining facilities? If they can get 80% accuracy, well that's probable cause, and that will get them a warrant for a search. Eventually we'll all have to be very careful what we search for, lest we end up on a list somewhere.

      --
      Give me Classic Slashdot or give me death!
    7. Re:O Rly? by JasonTik · · Score: 1

      80% accuracy of a given demographic is not probable cause to search anything. 80% accuracy for the selection of an individual is ok, but for a demographic is not enough to search every member of that demographic (or any selected members) (yet)

  13. BFG by m0nkyman · · Score: 1

    What? You think that the government of China doesn't have the resources or smarts to do this research and development themselves? Come on, shutting down research because the Chinese government might use it badly is very, very silly.

    --
    ~ a low user id is no indication I have a clue what I'm talking about.
    1. Re:BFG by gEvil+(beta) · · Score: 1

      And what does this have to do with guns (or giants, depending on how you interpret the acronym)? I only ask because you brought it up...

      --
      This guy's the limit!
  14. Tradeoff by Belacgod · · Score: 1

    Any technology that reduces anonymization could have this effect. It's a tradeoff--ease of mobiliy for dissidents vs eliminating obnoxious assholes. So every time you have to suffer through a troll, you're protecting freedom-loving Chinese!

  15. Waste of my time, your time, their time.... by styryx · · Score: 1

    So now in addition to Tor et al and the things that help privacy (sending Google random data as search queries) all we will have to do is have something in the background opening up male/female sites over all popular age ranges. Way to have to cream everyone's bandwidth. Sheesh, is there anything you CAN get right, MicroJerk?

    Profiling is akin to racism in my book. It's against democracy any way you look at it.

  16. Priorities for Reporters without Borders by Nymz · · Score: 1

    I realize the concept that this software, if it became more accurate, could be used by repressive regimes against their citizens. But as far a priorities go, I think they would do better to concentrate on bringing attention to human rights violations, and educatiting people about the rule-of-law.

  17. Re:Google by mattgreen · · Score: 1

    No, see, it's okay when Google does it, because they said they would "do no evil." And we believe them! They're a big company whose name isn't Microsoft, so we automatically love them without questioning!

  18. The Usual Suspects by blueZhift · · Score: 3, Interesting

    This reminds me of the scenes in Casablanca where the police are told to round up the usual suspects. Ultimately the accuracy doesn't matter to the government anyway. Worldwide, I think we are moving in a direction of less freedom rather than more, spearheaded by wrong-headed anti-terrorism hysteria in the US. So why should they care about accuracy, they'll just round up whoever fits the profile and sort it out later, or not.

  19. Let me tell you... by PsEvo · · Score: 1

    "Our new algorithm you ask? Well, we took a look at the MSN search log and counted the ratio of keywords such as 'boobs' and 'anal'." Considering the various interests and beliefs of men and women, no algorithm could accurately guess a gender, even age.

    --
    "ATI cards are like buses...They're huge, red and have bad drivers."
  20. Extension time methinks by ajs318 · · Score: 2, Insightful

    What about somebody writing a browser extension that performs bogus searches in the background, for no better reason than to frustrate "profiling" attempts? Is this feasible?

    --
    Je fume. Tu fumes. Nous fûmes!
    1. Re:Extension time methinks by alexhs · · Score: 3, Informative

      What about somebody writing a browser extension that performs bogus searches in the background, for no better reason than to frustrate "profiling" attempts? Is this feasible? Already done
      --
      I have discovered a truly marvelous proof of killer sig, which this margin is too narrow to contain.
  21. Article would be more useful if.... by CodeShark · · Score: 1

    They detailed how much of the information they gather is from using MS related software such as Internet Explorer, etc.

    Which is to say, how much info can be gathered using a non Ms browser such as Firefox with a Non MS operating system such as Apple or Linux, and avoiding non MS- dominated web sites?

    The more important questions are a)what extent is this important to free societies and breaking the grip of totalitarian regimes on their societies?, and b) to what extent do we as memebers in free societies need to revolt against "corporate society" and their accelerating tendency to glom together data about private individuals??

    --
    ...Open Source isn't the only answer -- but it's almost always a better value than the alternatives...
    1. Re:Article would be more useful if.... by westlake · · Score: 1
      how much info can be gathered using a non Ms browser such as Firefox with a Non MS operating system such as Apple or Linux, and avoiding non MS- dominated web sites?

      Pretty much everything anyone could want.

      When you click on a prostate cancer site it isn't hard to guess your likely age and gender.

    2. Re:Article would be more useful if.... by CodeShark · · Score: 1

      True. I guess the better question would have been "how much of the statistics are derived from M$ server software and/or M$ client software -- because a server can always collect information at an extreme level.

      --
      ...Open Source isn't the only answer -- but it's almost always a better value than the alternatives...
  22. Fuzz about nothing by Rob+Kaper · · Score: 1

    I don't get it. No matter what age-group/gender combination you think of, even combined with geodata and occupation/education levels, this doesn't even come close to identifying individuals. Unless you actually believe "a male farmer in his 30s in the Shanghai region" or "a female grandmother in the suburbs of Houston" is significantly detailed.

    It makes more sense to worry about accidently getting linked to personal details left in instant messaging, e-mail, community profiles and/or conversations.

  23. Ok boys - Time to visit that Tampon manufacturer.. by sjwest · · Score: 1

    Well you you why...

    ps: Humour

  24. I know who it is! by Phat_Tony · · Score: 4, Funny

    Right, so they're about 50% sure that it's someone who's both male and aged 24-30, living in China. It should be easy to pick out the individual from there.

    --
    Can anyone tell me how to set my sig on Slashdot?
    1. Re:I know who it is! by Oswald · · Score: 1

      Or it would be, if only they knew his hair color.

  25. US and Chineese Methods by twitter · · Score: 1

    wouldn't it be easier to look up the IP address and persuade the ISP to hand over the user details?

    Requiring the ISP to keep records with "wiretapping" laws and then getting the details is the US method. Farming out the collation of records to a company like Choice Point goes beyond the laws and is both cheap and efficient.

    In China, the regime is the ISP and they have the best equipment and methods that US companies could provide.

    --

    Friends don't help friends install M$ junk.

  26. M$ Apologists by twitter · · Score: 2, Informative

    No, it's not OK when Google does things that might be used to harm people who have done nothing wrong.

    This thread, however, is about the much nastier things that M$ does gleefully. I'd enjoy it if you compared the details.

    --

    Friends don't help friends install M$ junk.

  27. can't detect age - only maturity ... by petes_PoV · · Score: 2, Insightful

    ... which is a completely different and only slightly correlated attribute.

    --
    politicians are like babies' nappies: they should both be changed regularly and for the same reasons
  28. Strange age guessing by saibot834 · · Score: 1

    Microsoft's new algorithms correctly guessed the gender of a Web surfer 80% of the time, and his or her age 60% of the time.

    A bit strange in my opinion. "Guessing the gender correctly" has already 50% even if you don't have any data about the user. So there is not much improvement here.
    But the age... if you really guess the age thats more difficult. If we say we have everyone in the Internet up to the age of 100, you have a 1% change of guessing the age - much less than 50%.
    So even if you improve only up to 60%; getting from 1% to 60% is much more than from 50% to 80%.
    Perhaps they mean "guessing the age with a tolerance of 10 years"

  29. Re:Nahhh by jae471 · · Score: 1
    But you can spell it with a pound and euro sign. see: Goog£€. Proof that Google wants to conquer to World!!!!

    Not as good as Micro$oft, though

  30. I wonder ... by PPH · · Score: 1

    ...what sort of profile they'd get running that on /. posts?

    --
    Have gnu, will travel.
    1. Re:I wonder ... by Bugs42 · · Score: 2, Funny

      99% male

      75% 18-30 years old

      100% single and living in Mom's basement

      Hey, look, I got even more data than MS! I'll take a check now, please.

      --
      Programmer: an ingenious device that converts caffeine into code.
  31. Not a problem by Jessta · · Score: 1

    This doesn't sounds like a problem at all.
    To use it the regime would have to know about other sites the user had visited to input that information in to the algo.
    If they can already uniquely identify a users across multiple sites then they would already know who they were.
    All this is useful for is processing information for marketing after using your phishing detecting(IE7, google toolbar) software to spy on your users.

    --
    ...and that is all I have to say about that.
    http://jessta.id.au
  32. Try visiting planet earth sometime by Freed · · Score: 1

    >So, throwing Microsoft name on the mix only reinforces my point, this submission is nothing but a flamebait

    Nonsense. Microsoft's power and influence alone justifies concern about whatever they do. The same could be said of China. Your point is thus totally lost on me.

  33. Using Google while logged in? by sherriw · · Score: 1

    Are there a lot of people who, for example stay logged in to Google while checking email, searching the web, looking up maps etc? I log out every time. No personalized search for me thank you very much. Speaking of... lemme clear my cookies. hehehe

  34. bad URL by ftide · · Score: 1
  35. Marketing is the bane of our society. by SupremoMan · · Score: 1

    Unfortunately it is TOO effective. As a result, Americans spend way more money than they should. People stoped making informed purchases, they just buy the product of whoever has the most effective ad. This translates 100% into politics. The public no longer care about budget deficits, nor do they care about the candidate's stance on all issues. They are more likely to get behind a candidate with the best ads.

  36. In Soviet America, Microsoft has citizen comrades by WillAffleckUW · · Score: 1

    willingly give up all rights of privacy so they can be good citizen comrades in the willing partnership of the Corporate State and the citizen comrades!

    All power to the Rights Fuhrer Bill Gates and Citizen Comrade Chiefs who free us from worry about nasty anonymity!

    --
    -- Tigger warning: This post may contain tiggers! --
  37. Screw Communism, Democracy is more of a Threat! by lordSaurontheGreat · · Score: 1

    Screw communism (no offense, read what I have to say). What do you think Democracies are going to do with this? I can imagine congressmen buying $60,000.00 copies of this product to determine exactly who's in their voting region, match up the IPs on the internet, and then use that to further harass us every time an election comes around! It's madness! At least in Communist states they put you out of your misery and do you in! Here is America they keep you alive so they can keep exploiting you... "It's the worst system of government - except for all others tried."

    --
    Consider yourself spoken to.
  38. Re:It's now or never for stopping totalitarianism by DeadChobi · · Score: 1

    He who controls the past controls the future, and he who controls the present controls the past. In a world where all information can be changed at the flick of a switch the only thing you can ever be sure of is that which you witness with your own eyes. Everything outside of yourself can and frequently is rewritten to the whims of whatever regime happens to be in power at the time.

    --
    SRSLY.
  39. "Informative?" by mattgreen · · Score: 1

    Incredible. Thank you for telling me that it isn't OK when Google does things to harm people.

    Can you show me where Microsoft does these things gleefully? Proof would consist of evil cackles recorded in MP3 files. Good luck.

    (I'm no apologist, I only make fun of the groupthink around here.)

  40. On the Internet, Somebody Knows You're a Dog by z-j-y · · Score: 1

    Don't bitch about it.