Ask Slashdot: What Does Your Data Mean To Google? (google.com)
shanen writes: Due to the recent kerfuffles, I decided to try again to see what Google had on me. This time I succeeded and failed, in contrast to the previous pure failures. Yes, I did find Google's takeout website and downloaded all of "my data," but no, it means nothing to me. Here are a few sub-questions I couldn't answer:
1. Much more data than I ever created, so where did the rest come from?
2. How does the data relate to the characteristic vector that Google uses to characterize me?
3. What tools do Googlers use to make sense of the data?
Lots more questions, but those are the ones that are most bugging me right now. Question 2. is probably heaviest among them, since I've read that the vector has 700 dimensions... So do you have any answers? Or better questions? Or your own takeout experiences to share? Oh yeah, one more thing. Based on my own troubled experience with the download process, it is clear that Google doesn't really want us to download the so-called "our own" data. My Question 4. is now: "What is Google hiding about me from me?"
1. Much more data than I ever created, so where did the rest come from?
2. How does the data relate to the characteristic vector that Google uses to characterize me?
3. What tools do Googlers use to make sense of the data?
Lots more questions, but those are the ones that are most bugging me right now. Question 2. is probably heaviest among them, since I've read that the vector has 700 dimensions... So do you have any answers? Or better questions? Or your own takeout experiences to share? Oh yeah, one more thing. Based on my own troubled experience with the download process, it is clear that Google doesn't really want us to download the so-called "our own" data. My Question 4. is now: "What is Google hiding about me from me?"
My question is ; who else is getting data about me from Google? Does Google sell it outright? I suppose that is their business model, but it would be nice to know how my metadata is distributed.
Wow they invented LG G Flex, such innovation such wow
Google maps, Google Earth, keeping their word (My email account) and the rest they offer us.
That's what Google and these other groups don't want you to know.
whatever they didn't get from you.
Seriously, do you really think that with anything short of a court order or an order from Congress (or maybe a gun pointed at their heads) they're really going to show you how much actual data they have collected on you? When you signed up for their 'services' using your real name, you handed them the Keys to the Kingdom, regardless of any agreement (that you likely never read in the first place). The only way to win this game was to have not played in the first place.
The 700 dimensions vector (if it's true) is not something you can make sense of. It's an embedding vector that represents your characteristics in relation to all the other people. Each individual dimension doesn't have a meaning.
I used the provided link to "download all your data" and had it save a "takeout" ZIP file on my Google Drive. I then tried adding a few files to drive and removing them then "really" removing them. In both cases a "removed" file (in the Trashcan but not "really" removed) did not appear in the Takeout archive. I then created a new Takeout archive and had it send it as an email to my gmail account. In both cases it's everything from my drive, calendar, all emails, contacts, bookmarks, photos, etc.
In the expanded ZIP under the root "Takeout" dir there's an "index.html" with details on all the files. The 2nd archive i created even contained the first archive in it's entirety from the "Takeout" folder on my Drive.
Are you seeing something other than this?
The Russians have won. They have made the world a cesspool of distrust, greed, fear and hate.
You tell me NSA collaborators! I assume it means lots of money, and a chance to pretend you in the SS with there cool cloths, FASCISTS
Google doesn't sell it outright. They are aggregating from data brokers and other sources.
You can cut into the data broker model by subscribing to a service like DeleteMe, but it's expensive and not a silver bullet by any means. But doing that + using a privacy-friendly e-mail provider + using a secure messenger + securing your browser with ad/tracker blockers + seriously limiting what you put on social media + using DuckDuckGo or Startpage for search + using a VPN...
If you do ALL that you'll have pretty strong protection.
What does Google mean to my data?
I cant believe we have deteriorated as to let a corporation stalk us
With Google Chrome you can turn many of their tracking features off although if you are feeling paranoid there are other web browsers you can use. It does get more difficult to control or stop information being sent to one or more interested parties if the operating system you are using is configured by default to do so and you can't blame Google Chrome for that.
Like it or not any site, you visit with a web browser will log your information as metadata. Under normal circumstances, metadata is only used for debugging purposes unless a court order is presented to the appropriate managers, (ah the good old days) however depending on the privacy policies of the company that metadata can be sold to interested parties.
It must be noted that most computers even from the 1950's onward logged metadata which as I have explained before is extremely useful for debugging purposes. Under normal circumstances, metadata was only kept for a few days or months (depends on company policy), however, it appears metadata can be used for other purposes and depending which country you live in there may be government policies in place that require retention of metadata for years.
BTW. I run Linux as my primary operating system and I have instant access to four web browsers, those are Google Chrome, Firefox, Konqueror and Qupzilla. There are other browsers I could install (takes about a minute or two) but I choose not to. No matter which browser I use any site I visit will log my activity as metadata even if I am using incognito settings. At least I don't have to worry that my operating system is sending data to interested parties.
There ain't no such thing as proprietary standards only proprietary formats. Standards are by definition open.
First of all, thanks to all the people who have provided thoughtful or useful ideas. I'm about to make the attempt to read everything (except for the ACs, and I'm even considering looking at them this one time), but right now I want to add a few thoughts from my early reflections on the first comments I saw... I'm going to put them in the form of additional questions I wish I could answer from the voluminous, even overwhelming data that the google sent me:
(5) Where is the evidence that I'm a good person who deserves more success? The stuff I'd be glad to show to a prospective employer, for example.
(6) Where is the proof of what a prick I am? Assuming I want to be a better person (and I strongly suspect that many of us are lying to ourselves on that topic), I want to see that evidence to change or challenge. (To change myself or to challenge the evidence.)
(7) Is there anything in there that I should actually be afraid of? Things that my enemies could use against me, especially if they are more motivated to find those things than I am?
(8) How could I find evidence of strengths that I don't even know I have?
So far I'm just being highly reflective, but I've thought of at least two question beyond that low level...
(9) What is the economic value of my personal information to the google?
(10) Where are the pieces the googlers are cashing in on?
Now I'm going to look over the entire thing seeking more clues and provocations... Seems to be my duty as the "instigator" of this discussion in some sense of instigate...
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Sorry, Trax3001BBS, but I have to conclude that you are a terrible writer. Perhaps indifferent to communicating? If so, why write at all?
I'm really trying to strain my imagination for some meaning in any of your comments. Perhaps your last comment is supposed to mean that you think I'm advocating on behalf of Facebook in some sense of its superiority to the google? If so, I would say that I basically have the same questions (and concerns) about the Facebook data, even though there was so much less of it. At least based on Facebook's claim to have three orders of magnitude less data about me...
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
So far in my explorations of the data I haven't seen any browser history data, though I strongly suspect the google is collecting it. Are you saying that it isn't anywhere in the archive? Is the google claiming that this is some sort of derived information that belongs to the google, not me?
My hypothesis is that it's in there somewhere in some form, but I just don't know how to look for it. I certainly can't prove it isn't there.
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
It seems that you are leading into some of the sensitive topics that were touched upon earlier in the context of VPNs... However, to say more would be to draw exactly the sort of attention that I don't want to attract?
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Really? Every search, every location, every image, every chat...? I'm a queer leftist atheist. Those three vectors alone (I'm white) could get me thrown in jail in some parts of the country. I'm a threat to the right-wing establishment, and I feel very scared about my online trail. I've been off social media and I'm using TOR to de-fingerprint even this session (fingerprinting a host computer is trivial despite the facade of "privacy" settings in browsers).
I think if you were more at-risk demographic, you wouldn't be so cavalier about what Google knows about you.
2. Google doesn't have all that data unified. The takeout project is actually the most unified view of your data.
3. Googlers in general doesn't have access to your data. Systems do, and use it in an automated fashion. There are break glass access for some engineers for some types of troubleshooting - but this triggers alarms.
In general, during my > 5 years at Google, I realized it's a company I'll trust with my data for many years to come. The "Data Liberation Front" who ensures that data takeout is available is huge. Also, GDPR in Europe ensures that data takeout needs to be very easy for many years to come. Google was just years ahead of the law there.
Google's panopticon means the Stasi know every detail of my life.
Is that you?
The main thing to understand here is that there are two types of data:
- Your raw data
- Their 'derived data'
This 'Derived data' (as the databroker industry calls it) is where the real value is. These algorithmically formed 'opinions' about you are the valuable distilled product they sell. In the USA this derived data doesn't belong to you. It's protected as a form of corporate free speech.
In the EU this is a little different, as these 'opinions' are also considered personal data. The question is to what extent you get access to it. For example, the threshold for personal data is when a piece of data can be traced back to less than 11 people. So the trick here is to create opinions about small groups of which you are a part. For example: knowing that someone with cancer lives in one of three adjacent houses, that is not considered personal data.
Z^-1
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
You only looked at the first part of the results? However, I think what you saw from Personality Insights was an example of GIGO. You picked problematic input. Not just the effect of multiple authors, but also Trump's YUGELY garbled delivery of whatever he was supposed to say mashed into whatever popped into his head from moment to moment. Largely incoherent input, and yet some parts of the results make sense. Empathetic? Yes, but in a twisted way. I actually think that Trump is strong on the "humanist" dimension, but strongly negative in his polarity. The only person who matters to Trump is himself, and I'm sort of unsurprised that the results got confused on that dimension because Trump does have extremely high empathy for the people who matter, which is only himself.
(Me? It's the idealistic dimension that dominates... But the world is mostly run by the materialists.)
In my largest experiment, I actually used my side of some long email exchanges with a particularly close friend, and the results were surprising and not surprising... At the time it seemed like too much effort, but perhaps I should try feeding it different samples to see how consistent my personality is when I'm writing to different people? Or how much I've changed over times? I think I didn't want to go there because the next step is analyzing THEIR sides of the conversations, perhaps on the excuse of seeing what sorts of people my friends tend to be...
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
Or maybe there is no answer along the lines I was seeking?
Anyway, I do want to thank the constructive contributors, even though I didn't learn the kinds of things I was hoping to learn. I did learn a few new things and got a few new ideas, but mostly I feel like I framed the topic incorrectly. Is it evidence of too much Japanese influence to feel like an apology is in order?
However, Slashdot marches on, and this "story" has pretty much expired already... The google and our private data held by the data is not expiring.
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.
China is now actively hiring semiconductor professionals and they are paying huge salaries for those who are experienced
At last count China still needs at least 400,000 semiconductor professionals
If you ever faced age discrimination on your current career in the semiconductor field, please consider China as your next move
An added bonus --- China does not have the SJW problem
Z^-2
Freedom = (Meaningful - Coerced) Choice != (Speech | Beer^2), and sad sock puppets' bad mods avail them naught.