Slashdot Mirror


Google's Sidewalk Labs Plans To Sell Location Data On Millions of Cellphones (theintercept.com)

An anonymous reader quotes a report from The Intercept: Most of the data collected by urban planners is messy, complex, and difficult to represent. It looks nothing like the smooth graphs and clean charts of city life in urban simulator games like "SimCity." A new initiative from Sidewalk Labs, the city-building subsidiary of Google's parent company Alphabet, has set out to change that. The program, known as Replica, offers planning agencies the ability to model an entire city's patterns of movement. Like "SimCity," Replica's "user-friendly" tool deploys statistical simulations to give a comprehensive view of how, when, and where people travel in urban areas. It's an appealing prospect for planners making critical decisions about transportation and land use. In recent months, transportation authorities in Kansas City, Portland, and the Chicago area have signed up to glean its insights. The only catch: They're not completely sure where the data is coming from.

Typical urban planners rely on processes like surveys and trip counters that are often time-consuming, labor-intensive, and outdated. Replica, instead, uses real-time mobile location data. As Nick Bowden of Sidewalk Labs has explained, "Replica provides a full set of baseline travel measures that are very difficult to gather and maintain today, including the total number of people on a highway or local street network, what mode they're using (car, transit, bike, or foot), and their trip purpose (commuting to work, going shopping, heading to school)." To make these measurements, the program gathers and de-identifies the location of cellphone users, which it obtains from unspecified third-party vendors. It then models this anonymized data in simulations -- creating a synthetic population that faithfully replicates a city's real-world patterns but that "obscures the real-world travel habits of individual people," as Bowden told The Intercept. The program comes at a time of growing unease with how tech companies use and share our personal data -- and raises new questions about Google's encroachment on the physical world.

14 of 100 comments (clear)

  1. Look what we have here by fortythirteen · · Score: 4, Insightful

    Sidewalk Labs explains that Replica’s data is purchased from telecommunications companies and companies that aggregate mobile location data from different apps.

    But I thought AT&T, Sprint and T-Mobile stated that they'll no longer sell location data...

    1. Re: Look what we have here by Anonymous Coward · · Score: 3, Interesting

      Even if it did matter it still would not matter because no professional data firm would put PII at risk anyway and would always keep data in the proper hands. Of course, feel free to keep preaching otherwise to yourself if you like wasting your own time in bouts of what ifs.

      Oh, you mean like this data firm?

      "Facebook and Twitter hold a huge amount of users' personal data while LinkedIn includes users' professional data. Data from real-estate site Zillow was also roped in to create these consolidated user profiles. Researchers believe these profiles containing sensitive and personally identifiable information is highly coveted and targeted by hackers."

      Or, perhaps this one? I mean, it's Google, right? They've never had this problem before, right?

      Oh, wait! Maybe you mean this one!

      I believe we, as a society, and as a global people, need to put Google, and others who hoover up and trade in peoples' data in their place. We need to get up off our collective butts, find or create an alternative to the service(s) offered by them. Start with Google. Bankrupt them and bury them. Fast and hard. Perhaps that will teach the others like them to think twice before engaging in this chicanery.

  2. Re:Anonymized by b0s0z0ku · · Score: 4, Informative

    Is it anonymized to the point where they can't see who's parking in which driveway or walking into which home? It may be technically "anonymous", but if locations are sufficiently accurate, any POS with a mind to it can "deanonymize" it relatively quickly.

  3. Re:Anonymized by evendiagram · · Score: 4, Informative

    TFA: "Any location data that Sidewalk Labs receives is already de-identified (using methods such as aggregation, differential privacy techniques, or outright removal of unique behaviors)"

    Differential privacy is a rigorous mathematical definition of privacy. In the simplest setting, consider an algorithm that analyzes a dataset and computes statistics about it (such as the data's mean, variance, median, mode, etc.). Such an algorithm is said to be differentially private if by looking at the output, one cannot tell whether any individual's data was included in the original dataset or not. In other words, the guarantee of a differentially private algorithm is that its behavior hardly changes when a single individual joins or leaves the dataset -- anything the algorithm might output on a database containing some individual's information is almost as likely to have come from a database without that individual's information. Most notably, this guarantee holds for any individual and any dataset. Therefore, regardless of how eccentric any single individual's details are, and regardless of the details of anyone else in the database, the guarantee of differential privacy still holds. This gives a formal guarantee that individual-level information about participants in the database is not leaked. https://privacytools.seas.harv...

  4. Re:What else would you expect? by AmiMoJo · · Score: 4, Interesting

    TFA makes it sound sinister, but this is exactly what people signed up for. When turn on your new Android phone for the first time it asks if you want to turn on location history and gives you the privacy policy, which states that anonymized data may be used to build tools like this.

    Also note that they don't sell your data, that would make it worthless. They provide a GUI that lets city planners visualize it, similar to how advertisers can select certain interest groups to show ads to but can't access the underlying data used to assign people to those groups. Google isn't about to give away it's USP.

    --
    const int one = 65536; (Silvermoon, Texture.cs)
    SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  5. Re:What else would you expect? by sh00z · · Score: 2

    Precisely. In my mind, this is the cost of Waze providing real-time traffic, construction and police reports. Of course, I turn the app off when I'm not actively using it.

  6. This is not news by OneHundredAndTen · · Score: 4, Insightful

    Being a Google-controlled company, the news would have been if they had decided NOT to sell that data.

  7. Invasive tracking by sinij · · Score: 4, Insightful

    I hope this practice get squashed under avalanche of privacy-related lawsuits.

    What Evoogle doing with this is in effect asserting that if they can track any electronic device that you have on you, then they can associate it with your identity and sell resulting location data to the highest bidder in any form without you having any say in this. They don't need to actually have any business relationship or agreement with you, it is sufficient that they can fingerprint and identify your electronic device to own your data.

    1. Re:Invasive tracking by thegarbz · · Score: 2

      I hope this practice get squashed under avalanche of privacy-related lawsuits.

      It won't, let me explain why:

      1. You expressly agreed to this data being collected and also being used in far worse ways than this.
      2. No data is being sold, only a aggregated results based on data is being sold, and even then only access to this data rather than the raw dataset itself.
      3. No individuals can be identified from this data so there's no privacy related effects on anyone.
      4. The high bar for privacy in the USA relies on someone being materially impacted. Far worse privacy breaches have gotten nowhere with lawsuits.

      then they can associate it with your identity and sell resulting location data to the highest bidder in any form without you having any say in this.

      5. Except Google has never and is not now selling any information or even provided information that individually identifies a specific person without that being in control of the user.
      6. You have an express say in this through the use of a Google account with an Android device and your location services being active. Furthermore Google provides you complete insight and control over your location data including the ability to delete it from its timeline service.

      They don't need to actually have any business relationship or agreement with you, it is sufficient that they can fingerprint and identify your electronic device to own your data.

      Actually it's quite the opposite. Without the business relationship or agreement with you they would have fuck all location data that makes these services possible which is precisely why they expressly ask you to agree to this service when you first power on your phone.

      What Evoogle doing

      Now the real question is that statistically based on your UID you're an adult, however based on your speech patterns you more closely match those 13 year olds who used to think replacing the S in Microsoft with a $ was somehow "cool". Does your father know you're using his Slashdot account?

  8. Mod informative by raymorris · · Score: 2

    > Differential privacy is a rigorous mathematical definition of privacy ...

    That was informative, thank you.

    Data which has been anonymized poorly, if the raw data is distributed rather than statistics, can sometimes be de-anonymized. I see differential privacy mathematically guarantees that the statistics they provide cannot be de-anonymized back to data about individuals.

  9. If (false) { . (true but not relevant) by raymorris · · Score: 3, Informative

    What you said is true, but not relevant.

    Google is distributing statistics about large populations, not tokenized data about individuals.

    Tokenized data (raw data with names replaced by numbers) can sometimes be de-anonymized. That's not what Google is doing.

  10. Re:Anonymized by WaffleMonster · · Score: 2

    Differential privacy is a rigorous mathematical definition of privacy. ... (supporting nonsense deleted) ...

    This gives a formal guarantee that individual-level information about participants in the database is not leaked.

    This is getting old.

    The issue isn't what is done with data stolen continuously in real-time from millions of people the issue is the theft in the first place.

    If someone broken into your house and stole all of your shit... whether they donated it all to a worthwhile charity or pawned it all for crack is irrelevant.

  11. Re:Anonymized by Aighearach · · Score: 2

    Differential privacy is a rigorous mathematical definition of privacy.

    And the word "or" means you have no idea if they did that, or not.

  12. Re:The Ultimate Anonymizer by WillAffleckUW · · Score: 2

    Technically, this is not correct. Many people with phones are tagging you in pictures, correlating your purchases with theirs, and their home "ring" cameras are illegally recording you in public places, dumping it all into a database, which correlates with your facial recognition data and walk/stride patterns.

    You're being tracked too.

    --
    -- Tigger warning: This post may contain tiggers! --