Ask Slashdot: What Does Your Data Mean To Google? (google.com)
shanen writes: Due to the recent kerfuffles, I decided to try again to see what Google had on me. This time I succeeded and failed, in contrast to the previous pure failures. Yes, I did find Google's takeout website and downloaded all of "my data," but no, it means nothing to me. Here are a few sub-questions I couldn't answer:
1. Much more data than I ever created, so where did the rest come from?
2. How does the data relate to the characteristic vector that Google uses to characterize me?
3. What tools do Googlers use to make sense of the data?
Lots more questions, but those are the ones that are most bugging me right now. Question 2. is probably heaviest among them, since I've read that the vector has 700 dimensions... So do you have any answers? Or better questions? Or your own takeout experiences to share? Oh yeah, one more thing. Based on my own troubled experience with the download process, it is clear that Google doesn't really want us to download the so-called "our own" data. My Question 4. is now: "What is Google hiding about me from me?"
1. Much more data than I ever created, so where did the rest come from?
2. How does the data relate to the characteristic vector that Google uses to characterize me?
3. What tools do Googlers use to make sense of the data?
Lots more questions, but those are the ones that are most bugging me right now. Question 2. is probably heaviest among them, since I've read that the vector has 700 dimensions... So do you have any answers? Or better questions? Or your own takeout experiences to share? Oh yeah, one more thing. Based on my own troubled experience with the download process, it is clear that Google doesn't really want us to download the so-called "our own" data. My Question 4. is now: "What is Google hiding about me from me?"
Uh? What question are you trying to answer? And how does that question relate to any of the questions I posed? At first I thought you were trying to say something about derived data, but now I have no idea...
However, one of the categories of data I was looking for was data about me from other sources. For example, in terms of marketing my data to the advertisers, such external data as my credit history would seem to be highly relevant. Perhaps I can find my credit report somewhere in there?
In the original questions I left out one of the peculiarities I already discovered. A lot of "my" data that the google sent me was actually links to other places where I had posted things. In other cases the links seemed completely unrelated to me, as with a Google Play app to some game I don't believe I've ever downloaded or played.
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
I think I agree with you as far as you went, but in that case part of the information I am asking about is the context to interpret the shape of the categorization space and where I am within it. That is also in terms of the relationships to the parts of my data that contributed to my location and to the accuracy of that location. The google can reveal a lot about the space without exposing any of the individuals within it.
Perhaps a more concrete example will help? For example, can the google look at the vectors of spouses to assess how well their marriages are liable to work? Just asking for a friend, since I'm pretty sure my wife would NOT let me look at her data. She'll barely tell me when breakfast is ready.
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Me thinks you [Lanthanide] are projecting, but I will confess that I never did understand how my own parents stayed together. My condolences to your much better half. Or perhaps better to respond with some variation of the old grading joke: "I was one of the students who made the dean's list possible!"
That was just minor tit for even more minor tat. The most appropriate response would probably be to ask "Don't you have anything to say on any aspect of the actual topic at hand?" If you know nothing and have nothing to say, then you can always say nothing.
I actually did consider raising the issue of using personality characterization for marriage guidance and counseling. I would not be at all surprised to find out that some branch of the google is exploring related business opportunities. However my own interests these days are probably much more mundane. I'm just trying to figure out who's treading on my freedom.
By the way, I don't think the google is the worst abuser of our personal information. In a sense, the google's motives are pure insofar as they are focused on the money. Almost every question about what the google is doing with our information comes back to the answer "... because they think it will increase their profits."
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Highly principled stand, and I congratulate you [HermMonster] for your energy and enthusiasm and even for your efforts, but I think you are deluding yourself. One reason is that by attempting to hide yourself you would actually be attracting attention to yourself. Quite possibly, you are even rendering yourself a marked man and the FBI is following you around trying to figure out what you are trying to hide.
More seriously, some of the services cannot be used without leakage. Let me take an innocent example, the case of using a private browser window to evade a paywall. This is something I've started doing fairly routinely when using Google News. I've made the calculation that I'm willing to let the google use my identity to recommend stories that I'm interested in, but what am I actually hiding there? I'm willing to assume that the paywalled website is fooled into thinking that I've never been seen before (by the website), but am I actually fooling the google? I don't think so precisely because it is clearly in the google's interest to detect the link translation process.
In concrete terms of the data that I just downloaded from the google, I suspect that there are browser histories in there, including information on direct and indirect links to other websites. The new derivative question is whether or not the google is reporting on this to the paywalled websites?
So far this topic seems to be generating lots of new questions in my mind, and I haven't found many (any?) answers.
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
2. Google doesn't have all that data unified. The takeout project is actually the most unified view of your data.
3. Googlers in general doesn't have access to your data. Systems do, and use it in an automated fashion. There are break glass access for some engineers for some types of troubleshooting - but this triggers alarms.
In general, during my > 5 years at Google, I realized it's a company I'll trust with my data for many years to come. The "Data Liberation Front" who ensures that data takeout is available is huge. Also, GDPR in Europe ensures that data takeout needs to be very easy for many years to come. Google was just years ahead of the law there.