Why is this Company Tracking Where You Are on Thanksgiving? (theoutline.com)
Earlier this week, several publications published a holiday-themed data study about how families that voted for opposite parties spent less time together on Thanksgiving, especially in areas that saw heavy political advertising. The data came from a company called SafeGraph that supplied publications with 17 trillion location markets for 10 million smartphones. A report looks at the bigger picture: The data wasn't just staggering in sheer quantity. It also appears to be extremely granular. Researchers "used this data to identify individuals' home locations, which they defined as the places people were most often located between the hours of 1 and 4 a.m.," wrote The Washington Post. The researchers also looked at where people were between 1 p.m. and 5 p.m. on Thanksgiving Day in order to see if they spent that time at home or traveled, presumably to be with friends or family. "Even better, the cellphone data shows you exactly when those travelers arrived at a Thanksgiving location and when they left," the Post story says. To be clear: This means SafeGraph is looking at an individual device and tracking where its owner is going throughout their day. A common defense from companies that creepily collect massive amounts of data is that the data is only analyzed in aggregate; for example, Google's database BigQuery, which allows organizations to upload big data sets and then query them quickly, promises that all its public data sets are "fully anonymized" and "contain no personally-identifying information." In multiple press releases from SafeGraph's partners, the company's location data is referred to as "anonymized," but in this case they seem to be interpreting the concept of anonymity quite liberally given the specificity of the data.
Does the headline refer to Google or to Facebook?
#DeleteChrome
It's the Russians, who got you to vote for orange Hitler, thus dividing the country and now they are checking how badly they divided your families as well.
So if you celebrate with your parents, they'll adapt their ads until you don't talk anymore to your dad and uncle after the 2018 vote.
I don't have any idiots in my family stupid enough to vote for Trump. (At least they don't own up to it.) And I'm talking extended family as well. I can't figure out who actually DID vote for him; I really think there was some kind of fraud. I'm in one of the swing states that tipped for him, and none of my relatives or coworkers voted for him. I don't live in some kind of bubble. It doesn't pass the smell test.
I firewall every app, including google apps on my phone using the free firewall NetGuard. I also have data turned off and data limits set to zero. I also keep my phone in airplane mode, although that would be impractical for most people. This assures that I am never tracked by google, our Nazi snooping government, etc.
All these claims surfacing about Hollywood and Politicians having inappropriate relations with women from 20 years ago. Imagine the amount of blackmail dirt they will have in the next 20 years. Everything you do, say, and part of how you think (at least online) is being tracked and saved. It may not come back to haunt you but get rich, famous, or powerful enough and you might just find yourself writing checks to people to keep quite because you left your phone on when you went to a location that becomes unpopular 20 years in the future.
"Why is this Company Tracking Where You Are on Thanksgiving?"
Shitty privacy laws from shitty paid-for public "servants". Anything else is a distraction from that issue.
Populus vult decipi, ergo decipiatur...
"Force shits upon Reason's back." - Poor Richard's Almanac
It's not just Thanksgiving, that was just an interesting data point -- like all of the other ones...
If you don't have the data, you can't scan it. But if you do, you can squeeze the data so hard that a 0 becomes a 1.
If the universe is someone's simulation -- does that mean the stars are just stuck pixels?
When they drill down to see more "granular data" on me they find that the rabbit hole goes deep indeed. =)
Why did the chicken cross the road? Because Elon Musk put an AI chip in its head.
From the OP:
"A common defense from companies that creepily collect massive amounts of data is that the data is only analyzed in aggregate; for example, Google's database BigQuery, which allows organizations to upload big data sets and then query them quickly, promises that all its public data sets are 'fully anonymized' and 'contain no personally-identifying information.' "
I think it is critically important that we [as the data subjects ] recognise an important distinction.
This statement would be equally true if the company:-
1. Collected all the data with maximum resolution
2. Stored that data in a maximum resolution data set
3. Created a transformation process that took the maximum resolution data, "anonymized it" as it was loaded into a queryable database
4. Ran queries of the database...
The point being that the wording is so specious and so perfect for leading you to jump to the wrong conclusion. In other words, unless the company actually comes out with, "We do not store or otherwise retain access to your data in original or non-anonymized form - and you can come audit us so we can prove it to you", then they are not to be trusted.
And remember, anything that is captured - even if not used as part of the company's commercial offering - can be subpoenad or demanded via NSL.
And if your company is doing something that is right on the edge of being shut down by i.e. privacy laws, then maybe one way of staying just inside the line of acceptability [to government] is to offer to share what you've got if they ask...
None of this is safe. None of it.
If you have nothing to hide why are you so concerned about us having all these details about you?
In other news researchers also said that 65.237 percent of moms wished that their children away at college would call home more often. For children whose names begin with B, the percentage of mothers who want their children to call home is 71.237. For moms whose first name begins with L, the percentage is 73.543. The researches assured us that the data is anonymized.
But the article doesn't tell us what collects the data and instead advises turning off location services.
Anonymous doesn't mean that I can't track you from your house to your work. It's that I don't know who you are when you get to either destinations. We build an incredible database of information about anonymous people, unfortunately with enough information we can often de-anonymise them.
Same with Bitcoin being anonymous. Just because I know exactly how much money is in your wallet and exactly where you spend it doesn't make it less anonymous.
if your phone is your closest companion, maybe this life is not meant for you
Why is this Company Tracking Where You Are on Thanksgiving?
Because you are disclosing the data.
Anything you give away will be logged. That is the modern world. If you don't want them to collect it, do not leak the data to begin with.
Hey, you turned on location services on your phone, you installed the shiny free app (for which you will pay with your privacy and being tracked), and you have decided that's OK. And you have elected officials who refuse to do anything about real privacy legislation to ensure that greedy douchebags can do exactly this, because your elected officials represent the corporations and not you.
Turn off location services, uninstall these apps which primarily exist to perform analytics, and stop using this shit.
Or, shut the fuck up, and realize you've signed up for exactly this kind of shit. Welcome to the future, where Big Brother is an opt in program that everyone is keen to use, and all of the idiots have opted in ... and where the government can now demand this information they would not either be able to legally or practically collect.
This is a self inflicted societal problem, and those of us who see little value in shiny baubles that track your every location sincerely don't have any fucking sympathy.
The entire business model of those free apps is collecting and selling your personal information. And if you gave them access to location services 24x7, don't be even remotely surprised they can do this.
Analytics is the scariest form of Big Brother you can imagine, because as long as people are running these pointless apps, this is only going to get worse.
(And I see after all these years the ability to paste actual content into Slashdot is still broken ... and which has some of these very same kind of trackers on it.)
Because they can.
to safegraph. If the data is say held by Google and they allow only certain aggregate queries to be done but never give you anything but the aggregate answer then Safegraph won't know what happened in individual houses. This gets very tricky though. You have to have some thresholds about how small an area you can give a report on.
For example - The Canadian credit bureaus will sell reports based on postal code (a postal code is a side of a street, between intersections), that give the high, low and median score. Now if there were under a certain number of people in that postal code we didn't give the information (This was a decision made by the programmers, legally the company could) but what about the case where the high and the low score were almost the same? In such a case, revealing the high and the low essentially revealed everyone's score.
They just asked Google.
Sent as ripples into the electromagnetic field. No single photon has been harmed in the process.
The author of the article might want to learn what words mean. They do typically have meanings, you know.
Anonymous data is data not identified with a particular person. It does not mean cannot be identified with a particular person. It also does not mean the data cannot be associated with itself over time.
Five-digit ZIP code areas are pretty big and are not particularly indicative of an individual. Cell tower coverage is typically more detailed than a five-digit ZIP code. ZIP code of residence is trivial to determine from mobile phone records: it’s where your phone spends the majority of the day. ZIP code of work place is also fairly easy to determine: it’s where your phone spends the majority of the day when it’s not at home. Associate these two ZIP codes, though, and the association is unique for about 90 to 95% of the US population. Therefore knowing these two ZIP codes means you have isolated an individual. All anonymity means is that this information, by itself, does not tell you who that individual is. You can find out, though, with a subpoena, not even a warrant—or a friendly employee of the wireless carrier—or if you have someone specific in mind and you know or can find out where they live and work.
It is useful to consider how powerful location data is. A phone goes to a cancer clinic twice a week but not five times a week in 8-hour blocks? The phone owner has cancer. A phone goes to an ob-gyn twice in a single month? The phone owner is pregnant. A phone goes to an ob-gyn once a month for three months running? The phone owner is trying to get pregnant. A phone goes to a particular church most Sunday mornings? The phone owner belongs to the denomination of that church. Two phones are sporadically at the same motel at the same time (even if the particular motel changes)? The phone owners are having an affair. And on and on it goes.
Because de-anonymizing data is so trivial, having access only to anonymous or anonymized data protects against absolutely nothing.
And yet in this particular story, anonymity was retained. You can identify households from individual location data alone, which the study did. You can identify likely political leanings from individual location data alone, which the study did. You don’t need to attach names to the individuals to study the individuals, and this study did not.
Anonymity does not mean you as an individual cannot be identified. It just means you haven’t been—yet.
At least it is when you're talking about individual citizens. But if it's the government or a corporation, stalking is apparently perfectly acceptable.
It defines "home" as where I am most often from 1 - 4am?? That's just crazy talk ...
"Ahh! I see you're in that indeterminate Schrodinger state where - oh, uh
Anonymous = without names. Location, dates, homes, family connections, and times of day aren't names.
Congrats on shitty laws.
Anonymity does not mean you as an individual cannot be identified. It just means you haven't been - yet.
Excellent post, but I think you err on the side of too much optimism. Anonymity doesn't mean you haven't been identified yet; it means you have not been told you were identified yet.
Send you phone to another country, round trip.
Money.
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
I'd like some more justification for "90-95% of the population will make up a unique (home,work) ZIP pair." (paraphrased)
If we assume the estimated 330M population of the US, and *around* 43k ZIP codes, that 76% of the population is adult, and the relatively crazy assumption that ALL of them are working, let's do some combinatorics...
Yes, there exists sufficient space in Binomial[43001,2] (Mathematica/Wolfram Alpha notation) to *permit* every adult to have a unique pair. The estimate gives 251M adults and about 925M k-multicombinations (43000 pick 2, with repetition permitted) of (ZIP,ZIP) pairs.
Lots of people live and work in the same ZIP, though, especially among non-white-collar jobs, or live and work in the one or two residential ZIP codes and commercial ZIP codes in smaller areas.
That said, I'm not claiming it's not easy-ish to take "anonymous" data and reattach it to a person. It's all too easy.
Or should I just turn off location tracking for everything. I think I just answered my own question.
How do they avoid tracking in countries where privacy laws are more strict? It seems to me that the data they collect does not conform to European privacy laws.
About the first google hit refers to the study. See https://books.google.com/books...
If you're on Facebook, you're already being tracked. Something that "Chaos Monkeys: Obscene Fortune and Random Failure in Silicon Valley" makes clear when Facebook had the author set up their advertising system.
Sure, data is anonymized, but it still contains the location where you sleep and where you work (the later being just what is missing to sort out people living in the same building)
Any reasonable person knows that anonymous does not mean "We Know Everything about you but pretend to not know your name so we can equivocate that you are anonymous."
To fail to disclose that you know everything about someone, and share that data, is failure to disclose material facts. And is therefore, fraud.