Slashdot Mirror


Dataland: the Emerging Dystopia

An anonymous reader writes "Winston Smith, the protagonist of George Orwell's novel 1984, resorted to hiding the bushes with his lover in a failed attempt to escape the government's ubiquitous surveillance. Orwell was concerned with totalitarianism and explicit thought control enforced by police action. While that is still very much an issue for many of the world's residents, here in the West there is an unsettling feeling about a more subtle form of thought manipulation, as more and more of our activities are watched, cataloged, and analyzed by more and more institutions — governments, businesses, non-profits, political parties, mostly for predictive purposes. At least we have a name for it now: 'Dataland', a term suggested by Kate Crawford of Microsoft Research, who studies the sociological effects of networking technologies. Crawford has been written up in Slashdot before. She's criticized the indiscriminate adoption of Big Data analytics on several grounds, including the loss of anonymity, erroneous conclusions from skewed datasets, and the prospect of secret discrimination."

52 of 81 comments (clear)

  1. Dataland or: How I Learned to Stop Worrying... by laejoh · · Score: 2

    I, for instance, follow the advice of gman003, me too I am an ordained minister in Norse Paganism (Reformed), a registered card-carrying Communist, a decorated veteran of the Third Punic War, the second in line to the throne of Emperor Norton I, and the true assassin of Archduke Ferdinand. Big data gets pretty useless once it's full of nonsense.

    1. Re:Dataland or: How I Learned to Stop Worrying... by Samantha+Wright · · Score: 4, Insightful

      Sadly not; at best you can only exclude yourself from certain demographics. That does you no good if they're looking for those demographics. The genie isn't back in the bottle.

      At worst, the category "random/unclassifiable" gets flagged as suspicious in itself. (And no points for being an avowed Communist, even if you are reverent towards the Protector of Mexico.)

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    2. Re:Dataland or: How I Learned to Stop Worrying... by Samantha+Wright · · Score: 1

      ...in fact, on that point, you may've gotten yourself flagged as an anarchist already. Smooth.

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    3. Re:Dataland or: How I Learned to Stop Worrying... by Gen_Music · · Score: 1

      Unfortunately, due to the fact that you are speaking in English in 2013 on the internet, the possible chances of that are 0.

      And yes, I can write a program that could work that out for everyone's statements so I'm sure as hell that the NSA could, especially with all the rest of the accounts you have attached to your email address and social networks to correlate with..

    4. Re:Dataland or: How I Learned to Stop Worrying... by HiThere · · Score: 1

      What are the tenets of Reformed Norse Paganism? Are they related to Reformed Druidism?

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    5. Re:Dataland or: How I Learned to Stop Worrying... by citizenr · · Score: 1

      No. Actually you are an outlier, your record sends all kinds of red flags.

      --
      Who logs in to gdm? Not I, said the duck.
    6. Re:Dataland or: How I Learned to Stop Worrying... by Jah-Wren+Ryel · · Score: 3, Interesting

      Sadly not; at best you can only exclude yourself from certain demographics.

      I've come to the conclusion that the best way (as an individual) to handle this sort of thing is to create personas for different contexts. You'll need fake ids, but you won't be using them for anything technically illegal (no fraud, no underage drinking). You just show them to people/systems that want the info to track you - like loyalty cards (that you then only use with cash).

      That way you end up with a handful of distinct personas that all have data trails but only have data trails in specific contexts so that cross-referencing is impossible.

      --
      When information is power, privacy is freedom.
    7. Re:Dataland or: How I Learned to Stop Worrying... by Anonymous Coward · · Score: 3, Interesting

      Are you sure that cross checking is impossible?

      https://panopticlick.eff.org/

      And that's just one example. Truly big data will be essentially impossible to hide from completely. It doesn't need to reach a 100% positive result before people start treating it like it is, and that's only one possible problem that we should fully expect to arise from this.

      Here's another that could make your idea less effective as well:

      http://www.nytimes.com/2013/07/15/business/attention-shopper-stores-are-tracking-your-cell.html?pagewanted=all&_r=0

    8. Re:Dataland or: How I Learned to Stop Worrying... by Jah-Wren+Ryel · · Score: 1

      Are you sure that cross checking is impossible?

      Make that "out of the ordinary" - if cross-referencing for different personas is not the normal use of these databases, then the decision to try to cross-reference is going to be an extra-ordinary situation which will require an extra-ordinary amount of effort.

      Of course, the more people who do adopt personas, the more likely it is that Big Data will come up with standard tools to accomplish that cross-referencing.

      On the other hand, if it gets to the point where so many people are doing that sort of thing, that means the problems have become so well-known that there is probably enough political will to change the system through law.

      --
      When information is power, privacy is freedom.
    9. Re:Dataland or: How I Learned to Stop Worrying... by Enter+the+Shoggoth · · Score: 1

      Sadly not; at best you can only exclude yourself from certain demographics.

      I've come to the conclusion that the best way (as an individual) to handle this sort of thing is to create personas for different contexts. You'll need fake ids, but you won't be using them for anything technically illegal (no fraud, no underage drinking). You just show them to people/systems that want the info to track you - like loyalty cards (that you then only use with cash).

      That way you end up with a handful of distinct personas that all have data trails but only have data trails in specific contexts so that cross-referencing is impossible.

      I agree somewhat however you might want to check on the legality of this in some jurisdictions.

      --
      Andy Warhol got it right / Everybody gets the limelight
      Andy Warhol got it wrong / Fifteen minutes is too long.
    10. Re:Dataland or: How I Learned to Stop Worrying... by fisted · · Score: 1

      Oh, look. It's /this/ idiot again.
      Protip: Tick AC box next time.

  2. Bushes? by rudy_wayne · · Score: 1

    "Winston Smith . . . . . resorted to hiding the bushes with his lover "

    I don't remember any bushes in that story.

    1. Re:Bushes? by Anonymous Coward · · Score: 5, Funny

      "Winston Smith . . . . . resorted to hiding the bushes with his lover "

      I don't remember any bushes in that story.

      You know, neither did I, so I checked and there was a new paged taped in with bushes in the story. I guess I was wrong, there were always bushes in the story.

    2. Re:Bushes? by hutsell · · Score: 1

      "Winston Smith . . . . . resorted to hiding the bushes with his lover "

      I don't remember any bushes in that story.

      You know, neither did I, so I checked and there was a new paged taped in with bushes in the story. I guess I was wrong, there were always bushes in the story.

      Congratulations citizen; you have at last achieved a victory over yourself and will begin to realize that all is well now.

      --
      Yesterday's Weirdness is Tomorrow's Reason Why
    3. Re:Bushes? by SeaFox · · Score: 4, Funny

      "Winston Smith . . . . . resorted to hiding the bushes with his lover "

      I don't remember any bushes in that story.

      Exactly. Because Winston hid them, you never saw them.

    4. Re:Bushes? by Virtucon · · Score: 1
      --
      Harrison's Postulate - "For every action there is an equal and opposite criticism"
    5. Re: Bushes? by echnaton192 · · Score: 1

      Clever. Chapeau!

    6. Re: Bushes? by ExFCER · · Score: 1

      Mod up +1

  3. Re:Microsoft by Chompjil · · Score: 1

    It has begun, no one can stop it now

    --
    People once told me 68K ram was all we needed,
  4. Predictive purposes? by m00sh · · Score: 4, Informative

    Reminds me of Google's data collection on its hard disk failures and hiring good programmers.

    They couldn't find any sort of predictive factor. GPA, brain teasers etc had zero correlation. There was no hiring person that had statistically better performance at hiring good programmers.

    There are some things that are just random.

    Perhaps being able to predict accurately is the flying car of our generation. Or, perhaps some will say the answer is more data.

    1. Re:Predictive purposes? by rudy_wayne · · Score: 2, Insightful

      Reminds me of Google's data collection on its hard disk failures and hiring good programmers.

      They couldn't find any sort of predictive factor. GPA, brain teasers etc had zero correlation. There was no hiring person that had statistically better performance at hiring good programmers.

      People still cling to the idea of using the past to predict the future.

      Years ago I knew a guy who played the lottery a lot. He kept a list of all the previous winning numbers and spent countless hours studying the numbers looking for patterns that would allow him to predict future winning numbers. It never worked.

      All of this data collection is essentially the same thing.

    2. Re:Predictive purposes? by Anonymous Coward · · Score: 2, Interesting

      > He kept a list of all the previous winning numbers and spent countless hours studying the numbers looking for patterns that would allow him to predict future >winning numbers. It never worked.

      Amazing.....that is the exact job description of a stock analyst.

    3. Re:Predictive purposes? by Mirey · · Score: 5, Insightful

      It's not really the same. Human behaviour is not inherently random. The lottery is. If I've bought a coffee every day for the last year, it's quite likely I'll buy one tomorrow. I thought everyone knew about Bayes?

    4. Re:Predictive purposes? by Samantha+Wright · · Score: 4, Insightful

      Well, no; lottery numbers are known to be random. With human behaviour you at least have the underlying assumption that there are habits being picked up on. If all big data studies were as fruitless as your friend, the investments into the necessary infrastructure and algorithms wouldn't have made it nearly as far as they have. They do, however, find a lot of stupid correlations.

      But much more importantly, the desire to find these correlations is potentially profound in its ability to damage society. The whole scheme is an effort to cheat the normal boundary of personal space in order to optimize business and surveillance efficiency. If this erosion spreads into everyday interactions between people, it'll be the end of trust. To fix it, we'd need who-knows-how-many Hollywood blockbusters about noble savages re-teaching the West how to act like decent human beings.

      Perhaps if these businesses and government agencies were more willing to act like your friend and actually accept that life involves risk, we wouldn't be heading down this slippery slope so quickly.

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    5. Re:Predictive purposes? by amaurea · · Score: 5, Insightful

      "Using the past to predict the future" is what we usually call "learning". Even goldfish and flies to it, and it has brought us all our science and technology. Why do people exit the door at the ground floor rather than windows 5 stories up? Because past experiences has taught us that things fall down, and that falling far is harmful. Why do you type words rather than random chains of letters? Because you predict from past data that people in the future will be able to read and understand them. Even the fact that lottery numbers are impossible to predict is a prediction about the future we make based on physical understanding (which we have learnt from data from the past) coupled with data about how the lottery process works.

      You probably didn't mean to make as strong a statement as what you did but you basically said the single most anti-intellectual thing is is possible to say.

    6. Re:Predictive purposes? by PracticalM · · Score: 1

      http://www.forbes.com/sites/kashmirhill/2012/02/16/how-target-figured-out-a-teen-girl-was-pregnant-before-her-father-did/ ...says no. When people got creeped out by the pregnancy targeted ads, they mixed them with random coupons and people used the pregnancy coupons.

    7. Re:Predictive purposes? by HiThere · · Score: 1

      It does (used to) work in Roulette. But not well enough to beat the house. (It had to do with uneven wear on the support of the wheel. And it was specific to each individual wheel. I think they eliminated that possibility by occasionally polishing the bearings, or perhaps it was the axel.)

      --

      I think we've pushed this "anyone can grow up to be president" thing too far.
    8. Re:Predictive purposes? by radarskiy · · Score: 1

      "People still cling to the idea of using the past to predict the future."

      Do you think the sun will rise in the east tomorrow?

    9. Re:Predictive purposes? by m00sh · · Score: 3, Insightful

      It's not really the same. Human behaviour is not inherently random. The lottery is. If I've bought a coffee every day for the last year, it's quite likely I'll buy one tomorrow. I thought everyone knew about Bayes?

      It is not fully predictive though.

      One day, for whatever reason, you will stop buying coffee. That approximate day that will happen is not predictable because it is random.

      So, you buying coffee tomorrow is quite predictable. You buying coffee 5 years from now is not.

    10. Re:Predictive purposes? by m00sh · · Score: 1

      Well, no; lottery numbers are known to be random. With human behaviour you at least have the underlying assumption that there are habits being picked up on. If all big data studies were as fruitless as your friend, the investments into the necessary infrastructure and algorithms wouldn't have made it nearly as far as they have. They do, however, find a lot of stupid correlations.

      But much more importantly, the desire to find these correlations is potentially profound in its ability to damage society. The whole scheme is an effort to cheat the normal boundary of personal space in order to optimize business and surveillance efficiency. If this erosion spreads into everyday interactions between people, it'll be the end of trust. To fix it, we'd need who-knows-how-many Hollywood blockbusters about noble savages re-teaching the West how to act like decent human beings.

      Perhaps if these businesses and government agencies were more willing to act like your friend and actually accept that life involves risk, we wouldn't be heading down this slippery slope so quickly.

      This is the problem. People think there is a pattern in everything when there are things that are random. Finding surprising correlations here and there is completely different from being all-predictive.

    11. Re:Predictive purposes? by Anonymous Coward · · Score: 1

      I predict the sun will rise in the horizon tomorrow.

      And no, I'm not interested in arguing semantics. Somehow, statistics is used in science to establish positive tests. Ever heard of "outside 3 standard deviations"?

      Statistics cannot 100% predict individual outcomes, but in many cases, the averages of yesterday will be similar to the averages tomorrow.

      Anyways, statistics is old. Agent models and simulating potential outcomes from different potential agent actions and mixes of such, is where it begins to become interesting.

    12. Re:Predictive purposes? by Dr+Max · · Score: 1

      I don't play the lottery, but you can win. Trick is to pick all the unlucky numbers (you don't predict the result but you can predict the human side of the equation) this way you don't have to split any winnings, because nobody else uses those numbers. A guy proved it works using past lottery results, and pretending to buy the unlucky numbers in bulk, after 15 years of past results he came out on top by around $100 000 i think.

      --
      Rocket Surgeon.
    13. Re:Predictive purposes? by Samantha+Wright · · Score: 1

      Sounds plausible, if profoundly inefficient and not really worthwhile.

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    14. Re:Predictive purposes? by sjames · · Score: 1

      The problem is when the associated policies make it predictive after the fact. Some random correlation becomes the bank's new high risk loan flag so they up the interest rates and clamp down on any amount of grace. So people matching become more likely to be foreclosed on, so the flag is predictive.

    15. Re:Predictive purposes? by lennier · · Score: 1

      And no, I'm not interested in arguing semantics.

      Oh, so you're just arguing syntax then?

      (Sir Tim Berners-Lee kills a SPARQL query whenever someone uses the word "semantic" to mean "lacking information content". Luckily, no-one notices.)

      --
      You are not a brain: http://books.google.com/books?id=2oV61CeDx-YC
    16. Re:Predictive purposes? by intermodal · · Score: 1

      It's not really about prediction though. It's about manipulation. You can much more easily manipulate the future than predict it with a hands-off approach to its creation.

      --
      In SOVIET RUSSIA... erm...NSA AMERICA, the Internet logs onto YOU!
    17. Re:Predictive purposes? by tlhIngan · · Score: 1

      People still cling to the idea of using the past to predict the future.

      Years ago I knew a guy who played the lottery a lot. He kept a list of all the previous winning numbers and spent countless hours studying the numbers looking for patterns that would allow him to predict future winning numbers. It never worked.

      All of this data collection is essentially the same thing.

      For things that are not entirely random, the past is a great predictor of the future. Things like trends and other factors do regularly show up time and time again - leading to the adage that those who don't know history are doomed to repeat it. Because it's likely it's happened before.

      Thing is, you cannot take individual details and extrapolate from that (e.g., you bought coffee the past year, you'll buy one tomorrow).

      The goal of dataland is to accumulate data and run behaviorial analysis - stuff like if you bought coffee yesterday AND the week before AND something else AND that other thing, you're 99.99% certain to buy a coffee today.

      Of course, we don't know what "something else" and "that other thing" are. Which is why they have large data sets to mine data from.

      It also leads to interesting correlations that may not explain why they're related, but that the two things pretty much go together with practical certainty.

      And yes, there's no true predictability to it - it's a chaotic system, and humans do add some randomness, but there's more than an even chance.

  5. A data score? by Anonymous Coward · · Score: 2, Insightful

    Currently banks and lenders can ask a clearing house about our finacial activities and get a standardized credit score that can then be used to assess the risk of making a loan.

    With ubiqitous data collection, we are already seeing the sale of lists of users who might be interested in a given marketing campaign, i.e. Target sending pregnacy sales/coupons to teens whose parents didn't even know yet.

    It seems that at some point, organizations will want to know "How good is the available data on a given individual?" For a millenial who posts every minute of their lives online, it is likely that the available data is very high and pretty reliable. For a non-technical individual who carries out most of their activities off-line on a cash basis, the available data is pretty sparse and not very reliable.

    Will data collection, big data, and continuous surveliance by business and governments lead to a data score similar to ones credit score? Will people be refused jobs/clearance based in part on this score? How would such a score and organizational behavior affect our society?

    1. Re:A data score? by fuzzyfuzzyfungus · · Score: 3, Interesting

      The nice thing about inaccuracy is that (as long as you are tactful in your exploitation of the data) the user will never know if you fuck up; and if it becomes 'common knowledge' that people shrouded in mystery are usually passed over in favor of transparent choices, we'll probably start seeing advice on 'building persona', just as we currently have people interacting with financial institutions purely for the purpose of 'building a credit score'.

  6. With big power... by gmuslera · · Score: 2

    must come big responsibility. Internet empowered all of us, we should be thankful for that. But having that power implies new rules of convivence. And abusing of that power just because you can always have undesirable consequences in the future.

    1. Re:With big power... by AnotherAnonymousUser · · Score: 1

      Not when you post as an AC ;)...

  7. You have no record? by fustakrakich · · Score: 4, Insightful

    The absence of data is even more suspicious. No facebook, gmail or Linkedin account? Not carrying your cell phone or laptop when entering the country? What are you trying to hide? I hope I'm wrong, but I don't think you can put enough chaff into the system to make a difference. The botnets can sort it out pretty quickly.

    --
    “He’s not deformed, he’s just drunk!”
  8. Microsoft should stifle by Mister+Liberty · · Score: 1

    They are the perpetrators of 'Dataland', and in bed with the NSA.
    Avoid them -- they are worse than a cancer.

  9. PROTIPS by Anonymous Coward · · Score: 4, Informative

    +Turn On Mobile Phone only a few times a day to check for new calls.
    + Use Cash as far as possible
    + TOR

    1. Re:PROTIPS by coolsnowmen · · Score: 1

      Sometimes when people suggest behaviour, I think of: "what happens when everyone does that?"
      Just like if everyone ran ad-block, whole cool parts of the internet would soon go dark; if everyone only turned on their phone 3 times a day, we'ld never really be able to call anyone in the way some of us have come to appreciate / expect today.

  10. My greatest fear by radarskiy · · Score: 1

    Even more than true data getting out of some database, I fear false data getting in and accumulating. Someday, I will not be able to prove who I am because I won't be able to verify all of the false facts that have been stored about me.

  11. If it bothers you by koan · · Score: 1

    Don't participate.

    --
    "If any question why we died, Tell them because our fathers lied."
  12. Prediction Troubles. Manipulation Destroys by jimbrooking · · Score: 4, Insightful

    We wring our hands at the accursed sellers and buyers of our browsing habits. We glibly ignore what happens when we sit for a few hours in front of a television screen. Knowing our browsing habits gets us targeted ads. Getting our minds in a receptive mood by showing the trash that passes for content on commercial TV, then cramming crafty advertising into those receptive minds impels us to do things we wouldn't be predicted to do, which is manipulation.

    Why do Americans lust after 2-ton gas-guzzlers to taxi the kids to school and fetch a couple of bags of groceries from the supermarket? Why does PHaRMA spend untold billions advertising expensive drugs that, in many cases, are no more effective than over-the-counter remedies? Why do so many of our people live in McMansions so expensive they are a paycheck away from foreclosure? Because advertising to minds pried open by "must-see" TV works.

    The TV tells them what they want and how to get it - no money down, pennies per week. And this relentless barrage of hard, soft, and subliminal sales messages passes into the TV-watcher's mind with nothing getting in the way like critical thinking, priorities, or social or environmental concerns.

    We ought to be more worried about what 10-20 hours watching TV every week is doing to us and our society than whether Google is showing us an ad for suntan lotion after we've booked a trip to the Caribbean.

  13. Re:Prediction Troubles. Manipulation Destroys by Anonymous Coward · · Score: 2, Insightful

    This stuff you're talking about is all behavior driven by the need to keep up appearances. Guys in the US have to drive trucks or they're not going to get laid. You need to own a large mcmansion or there is obviously something wrong with your finances. Even the pharma stuff is like that. You should hear my relatives bragging about how many prescription pills they need to stay alive.
    Pop culture is the driver. You need to be conversant on americal idol or the latest failures of your football team at the water cooler, or risk spending lunches by yourself. Conversely, publicly worrying about the NSA makes you some sort of conspiracy paranoiac and once again, solo lunches.

    Ads have something to do with all this, but are certainly not the cause. Media manipulation is present, but the cause of all this is peer pressure.

  14. The world would become a nice place for all. by Anonymous Coward · · Score: 1

    If we all just turned off cookies.

  15. Thought control is happening right now! by hexpill · · Score: 2

    Are you kidding me? We have two political parties in the USA with very powerful propaganda arms designed to tell people exactly what to think. They use half-truths, flawed statistics, and exaggerations. These lies are not difficult to refute, yet people willingly accept the lies just because they're so willing to gravitate to the political echo chambers that are Fox News and MSNBC, among others. In a republic, if a representative is able to control what his/her constituents think, he has essentially become dictator-by-deception. Thought control is here, it's just that people have become willing recipients of the propaganda.

  16. Dataland by Dabido · · Score: 1

    It's double plus good that Kate Crawford should report for re-education. Inventing a word that did not come from the Ministry of Truth is rebellion. She is obviously a friend of Emmanuel Goldstein. Hail Big Brother.

    --
    Sure enough, the cow costume was hanging up next to the superhero outfit and sailors uniform. (S,Spud)