Google's Sidewalk Labs Plans To Sell Location Data On Millions of Cellphones (theintercept.com)
An anonymous reader quotes a report from The Intercept: Most of the data collected by urban planners is messy, complex, and difficult to represent. It looks nothing like the smooth graphs and clean charts of city life in urban simulator games like "SimCity." A new initiative from Sidewalk Labs, the city-building subsidiary of Google's parent company Alphabet, has set out to change that. The program, known as Replica, offers planning agencies the ability to model an entire city's patterns of movement. Like "SimCity," Replica's "user-friendly" tool deploys statistical simulations to give a comprehensive view of how, when, and where people travel in urban areas. It's an appealing prospect for planners making critical decisions about transportation and land use. In recent months, transportation authorities in Kansas City, Portland, and the Chicago area have signed up to glean its insights. The only catch: They're not completely sure where the data is coming from.
Typical urban planners rely on processes like surveys and trip counters that are often time-consuming, labor-intensive, and outdated. Replica, instead, uses real-time mobile location data. As Nick Bowden of Sidewalk Labs has explained, "Replica provides a full set of baseline travel measures that are very difficult to gather and maintain today, including the total number of people on a highway or local street network, what mode they're using (car, transit, bike, or foot), and their trip purpose (commuting to work, going shopping, heading to school)." To make these measurements, the program gathers and de-identifies the location of cellphone users, which it obtains from unspecified third-party vendors. It then models this anonymized data in simulations -- creating a synthetic population that faithfully replicates a city's real-world patterns but that "obscures the real-world travel habits of individual people," as Bowden told The Intercept. The program comes at a time of growing unease with how tech companies use and share our personal data -- and raises new questions about Google's encroachment on the physical world.
Typical urban planners rely on processes like surveys and trip counters that are often time-consuming, labor-intensive, and outdated. Replica, instead, uses real-time mobile location data. As Nick Bowden of Sidewalk Labs has explained, "Replica provides a full set of baseline travel measures that are very difficult to gather and maintain today, including the total number of people on a highway or local street network, what mode they're using (car, transit, bike, or foot), and their trip purpose (commuting to work, going shopping, heading to school)." To make these measurements, the program gathers and de-identifies the location of cellphone users, which it obtains from unspecified third-party vendors. It then models this anonymized data in simulations -- creating a synthetic population that faithfully replicates a city's real-world patterns but that "obscures the real-world travel habits of individual people," as Bowden told The Intercept. The program comes at a time of growing unease with how tech companies use and share our personal data -- and raises new questions about Google's encroachment on the physical world.
But I thought AT&T, Sprint and T-Mobile stated that they'll no longer sell location data...
Is it anonymized to the point where they can't see who's parking in which driveway or walking into which home? It may be technically "anonymous", but if locations are sufficiently accurate, any POS with a mind to it can "deanonymize" it relatively quickly.
TFA: "Any location data that Sidewalk Labs receives is already de-identified (using methods such as aggregation, differential privacy techniques, or outright removal of unique behaviors)"
Differential privacy is a rigorous mathematical definition of privacy. In the simplest setting, consider an algorithm that analyzes a dataset and computes statistics about it (such as the data's mean, variance, median, mode, etc.). Such an algorithm is said to be differentially private if by looking at the output, one cannot tell whether any individual's data was included in the original dataset or not. In other words, the guarantee of a differentially private algorithm is that its behavior hardly changes when a single individual joins or leaves the dataset -- anything the algorithm might output on a database containing some individual's information is almost as likely to have come from a database without that individual's information. Most notably, this guarantee holds for any individual and any dataset. Therefore, regardless of how eccentric any single individual's details are, and regardless of the details of anyone else in the database, the guarantee of differential privacy still holds. This gives a formal guarantee that individual-level information about participants in the database is not leaked. https://privacytools.seas.harv...
TFA makes it sound sinister, but this is exactly what people signed up for. When turn on your new Android phone for the first time it asks if you want to turn on location history and gives you the privacy policy, which states that anonymized data may be used to build tools like this.
Also note that they don't sell your data, that would make it worthless. They provide a GUI that lets city planners visualize it, similar to how advertisers can select certain interest groups to show ads to but can't access the underlying data used to assign people to those groups. Google isn't about to give away it's USP.
const int one = 65536; (Silvermoon, Texture.cs)
SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
Precisely. In my mind, this is the cost of Waze providing real-time traffic, construction and police reports. Of course, I turn the app off when I'm not actively using it.
Being a Google-controlled company, the news would have been if they had decided NOT to sell that data.
I hope this practice get squashed under avalanche of privacy-related lawsuits.
What Evoogle doing with this is in effect asserting that if they can track any electronic device that you have on you, then they can associate it with your identity and sell resulting location data to the highest bidder in any form without you having any say in this. They don't need to actually have any business relationship or agreement with you, it is sufficient that they can fingerprint and identify your electronic device to own your data.
> Differential privacy is a rigorous mathematical definition of privacy ...
That was informative, thank you.
Data which has been anonymized poorly, if the raw data is distributed rather than statistics, can sometimes be de-anonymized. I see differential privacy mathematically guarantees that the statistics they provide cannot be de-anonymized back to data about individuals.
What you said is true, but not relevant.
Google is distributing statistics about large populations, not tokenized data about individuals.
Tokenized data (raw data with names replaced by numbers) can sometimes be de-anonymized. That's not what Google is doing.
Differential privacy is a rigorous mathematical definition of privacy. ... (supporting nonsense deleted) ...
This gives a formal guarantee that individual-level information about participants in the database is not leaked.
This is getting old.
The issue isn't what is done with data stolen continuously in real-time from millions of people the issue is the theft in the first place.
If someone broken into your house and stole all of your shit... whether they donated it all to a worthwhile charity or pawned it all for crack is irrelevant.
Differential privacy is a rigorous mathematical definition of privacy.
And the word "or" means you have no idea if they did that, or not.
Technically, this is not correct. Many people with phones are tagging you in pictures, correlating your purchases with theirs, and their home "ring" cameras are illegally recording you in public places, dumping it all into a database, which correlates with your facial recognition data and walk/stride patterns.
You're being tracked too.
-- Tigger warning: This post may contain tiggers! --