Google Makes It Harder For Marketers To Collect User Data

← Back to Stories (view on slashdot.org)

Google Makes It Harder For Marketers To Collect User Data

Posted by timothy on Sunday December 15, 2013 @07:51AM from the no-peeking-now dept.

cagraham writes "In a seemingly minor update, Google announced that all Gmail images will now be cached on their own servers, before being displayed to users. This means that users won't have to click to download images in every email now — they'll just automatically be shown. For marketers, however, the change has serious implications. Because each user won't download the images from a third-party server, marketers won't be able to see open-rates, log IP addresses, or gather information on user location and browser type. Google says the changes are intended to enhance user privacy and security."

33 of 195 comments (clear)

Min score:

Reason:

Sort:

And google will retain that info exclusively. by Spamalope · 2013-12-15 07:54 · Score: 5, Insightful

While I applaud the move, it is about competitive advantage for Google.
1. Re:And google will retain that info exclusively. by dotancohen · 2013-12-15 07:56 · Score: 3, Informative
  
  While I applaud the move, it is about competitive advantage for Google.
  Google already knew which emails you have or haven't read. So does every other email client, web-based or IMAP / POP3.
  
  --
  It is dangerous to be right when the government is wrong.
2. Re:And google will retain that info exclusively. by jaseuk · 2013-12-15 08:01 · Score: 4, Insightful
  
  Yes and the point the summary misses, is that the images are used to verify that you have received and viewed the e-mail. This is far more important than browser types / locations etc.
  It also prevents some evil things, such as first time you hit the page you get a drive by, the second time (with cookie set) you get the actual image and all seems fine.
  Jason.
3. Re:And google will retain that info exclusively. by pradeepsekar · 2013-12-15 08:16 · Score: 5, Insightful
  
  The article does not state of all images would be cached automatically even if you have not read your mail. It only says that images would be served through a Google proxy server, which caches the images.
  So if Google proxies and caches the images when you open the mail, there is no protection added from marketers, except for the fact that Google can scan the images for exploits.
  And if Google proxies and caches the images as soon as the service receives the mail, marketers can verify if the address is a valid gmail address or not by just sending mails and waiting for Google to cache the image. Expect more spam if this is the case.
  There will be true protection from email tracking only if Google caches the images in all emails it receives, even if the email address is invalid - and that would increase the load on Google servers quite a bit.
4. Re:And google will retain that info exclusively. by EvanED · 2013-12-15 08:19 · Score: 3, Informative
  
  As Google *always* cache the image, the sender does not knows anymore when or even if the image was viewed and, so, doesn't knows anymore if the email was opened.
  If they have specific knowledge about Gmail. Unfortunately, mailers that don't would make the more dangerous assumption (that you read the mail) under that behavior.
  But anyway, even that's not true because under Gmail's new setup, the first download will still come when the user opens the mail and loads the images. At least, that's the best information I can find on this. I also saw a comment somewhere a couple of days ago by someone who claimed to have tested that behavior, and checked that the load of the image came when the mail was opened.
5. Re:And google will retain that info exclusively. by KiloByte · 2013-12-15 08:45 · Score: 4, Insightful
  
  And if Google proxies and caches the images as soon as the service receives the mail, marketers can verify if the address is a valid gmail address or not by just sending mails and waiting for Google to cache the image. Expect more spam if this is the case.
  Verifying that foobar@gmail.com is a valid address doesn't give spammers any real information: the namespace is so full even most pwgen outputs point to existing names, as long as you don't have embedded numbers (on gmail, addresses seem to have numbers at the end).
  Thus, that check can be quite simplified to "does a Markov chain say this string of letters is pronounceable?". Not a big benefit to a spammer. On the other hand, they don't get told anything about the recipient anymore.
  While for a small mail provider this change might leak some info, for Gmail it seems to be nearly entirely positive.
  I for one don't use Gmail for privacy reasons, and don't fetch remote images, but good luck training aunt Lucy about that.
  
  --
  The creatures outside looked from Alt-Right to Antifa; but already it was impossible to say which was which.
6. Re:And google will retain that info exclusively. by hairyfeet · 2013-12-15 09:26 · Score: 3, Interesting
  
  Do those even work anymore on anything other than XP? Because I fix PCs 6 days a week and I haven't seen one in years. The way most folks get infected nowadays is 1.- "Hey its your (insert friend's name) on (insert IM) and I found this great new thing that made my PC faster, just (click this link,push this button)". 2.- "You want to see teh lesbians? To watch this hot video just run 'IzNotViruzIzCodec.exe'". 3.- (insert friend name) just sent you an e-card for (insert holiday), just go here and receive your e-card!" 4.- "Oh noes, you have teh viruz OMG! Run 'IzNotViruzIzCleaner.exe' to get rid of it". That last one works well on old folks BTW
  As for TFA yet again another change that fucks the user or takes a valuable tool away from the user while giving Google more power....are we even surprised anymore? the only nice thing about Google in the last year is only the hardcore Googleaid drinkers buy the "Do no evil" "don't be evil" horseshit, the rest of the world can see its as much bullshit as "think different" and "where do you want to go today", Google has become just as nasty as the other two and in some ways worse.
  
  --
  ACs don't waste your time replying, your posts are never seen by me.
7. Re:And google will retain that info exclusively. by icebike · 2013-12-15 10:40 · Score: 3, Interesting
  
  While I applaud the move, it is about competitive advantage for Google.
  If you applaud this you haven't thought it out very far.
  Almost ever SPAM has small uniquely named images embedded. Often single pixel images.
  These are encoded to your email address. If you fetch this image, your email address is VERIFIED. You just did the spammer a favor.
  If you were reading the email with a mail client, you would NEVER fetch these, because 1) spam is spam, and 2) most
  email clients don't download images by default and most email recipients are just fine with that.
  With Google pre-fetching all of these, every GMAIL address id Verified for the Spammers.
  Its not a well thought out scheme at all. No sensible person would read Gmail with a web browser from now on.
  The wise choice is to use a traditional Email Client, (something like Thunderbird, Kmail, k-9 mail, Evolution, etc), and set them not to load images at all.
  
  --
  Sig Battery depleted. Reverting to safe mode.
8. Re:And google will retain that info exclusively. by icebike · 2013-12-15 10:59 · Score: 3, Insightful
  
  Marketeers already know the address exists the moment they get a 200 on the RCPT TO: header. Spammers, using botnets, generally don't care about the maildelivery itself, for these the autodownload of images is extra information.
  Spammers do everything in their power not to get bounce messages. They do everything they can to not personally contact your (google's) mail server.
  The fact that uniquely encoded image URLs are embedded in virtually ALL spam and UCE should be proof enough for you that you haven't thought your argument through. Go look at your email raw view someday.
  
  --
  Sig Battery depleted. Reverting to safe mode.
9. Re:And google will retain that info exclusively. by Anonymous Coward · 2013-12-15 11:32 · Score: 4, Interesting
  
  This.
  I work for an email marketing company. Since our customers are very keen on not being mixed in with spam, we (and I think I speak for most of our competitors in this respect) take care to ensure only legit (confirmed double opt-in) email accounts are listed, to keep our servers' reputation perfect. Understand that it is in the best interest of legit senders to make customers WANT to recieve their emails. Open images and the statistics they create are primarily used to fine-tune the emails sent.
  These open pixel images have practically no value to spammers (hence very few spammers actually use them); sending out spam over botnets, they don't care if an email address exists. They might care if a batch of several thousand email addresses no longer exists, but tracking and logging individual recipients... that's damn expensive if you're sending to millions of email addresses.
  This cache won't hurt spammers.
  It hurts companies you have subscribed to receive email messages (I sure hope you trust the average Hotmail user's taste, since emails will change to suit their needs).
  And I dare bet that pretty soon, Google will start selling this information, and then everybody will be hurt.
10. Re:And google will retain that info exclusively. by StripedCow · 2013-12-15 11:50 · Score: 3, Interesting
  
  The solution is simple:
  if(connection.ip_address in google_ip_addresses) write(connection, "Sorry Google, only the user may open this image!");
  
  --
  If Pandora's box is destined to be opened, *I* want to be the one to open it.
11. Re:And google will retain that info exclusively. by icebike · 2013-12-15 12:17 · Score: 3, Informative
  
  Typically SMTP servers won't accept delivery if the recipient is unknown, although I suppose when you write your own SMTP server you can do pretty much as you want. I can't see any up-side for google to process mail for NO ONE.
  If they don't accept the body of the message they wouldn't get the URLs.
  
  --
  Sig Battery depleted. Reverting to safe mode.
12. Re:And google will retain that info exclusively. by tlhIngan · 2013-12-15 17:46 · Score: 3, Interesting
  
  Verifying that foobar@gmail.com is a valid address doesn't give spammers any real information: the namespace is so full even most pwgen outputs point to existing names, as long as you don't have embedded numbers (on gmail, addresses seem to have numbers at the end).
  Actually, it does. Because it tells the spammers that the recipient opened the email
  Google doesn't fetch the image until you open the email. And the moment you do, Google just confirmed that the email was read. And that information is very valuable.
  There are two ways Google can fix it:
  1) Set "Don't load images" back as default again, as it is now and in every email client.
  2) Simply load every image, so valuable information like that isn't revealed - the marketer just pays for bandwidth and gets zero information - they don't even know if the image is read. No storage requirements as Google can re-write the email to self-contain all the images.
  Of course, Google is probably going into email marketing - given how Gmail has sprouted that "Promotions" tab (yes, you can turn it off, but don't you think it immediately foreshadows something? It's not Spam, but "Promotions" - what, spam that someone paid to bypass Google's filters?). And they don't need competition - best way to squash it is to starve out the existing marketers.
  And of course, since Google's in the information business, selling that information is very valuable - Google knows what you like, so they can sell targeted ads into your inbox.
13. Re:And google will retain that info exclusively. by mrt_2394871 · 2013-12-15 22:14 · Score: 4, Insightful
  
  If you want to know if I've read an email:
  request a return receipt
  If I want to give you that information, I will.
  Goodness, there's an existing, non-scummy way of working all this out which preserves user expectations of privacy and provides you with the information you actually want, not a poor proxy of it.
14. Re:And google will retain that info exclusively. by gsslay · 2013-12-15 22:47 · Score: 4, Insightful
  
  How would you feel about your customers sending tracking images to you with orders/complaints/queries? Just to "fine-tune" whether they deal with you again? I imagine it could be statistically enlightening to see how quickly you open emails, how often, and how long the response takes. Not so keen?
  I appreciate your efforts to ensure that your emails lists are on target and not spammy, many companies are not so diligent. (Particularly with confirmed opt-ins.) But you have no automatic right to collate any further information about your customers unless they intentionally provide it. Tracking images are sneaky and most certainly not used by your customers intentionally. There is a reasonable expectation of privacy when reading your own email on your own computer.
  You're right about two things though. The days are long gone when spammers cared about whether an address was valid or not. They are not incurring any costs spamming to invalid addresses. All they care about is how many suckers they hook with a response. And yes, the cached image hits are yet more information being sucked up by google, that will inevitably be sold in some way in the future.
Cutting into their business by Anonymous Coward · 2013-12-15 07:56 · Score: 5, Funny

Of course they're cracking down hard - stealing user data is Google's job...they don't like the competition.
They do see open rates by Anonymous Coward · 2013-12-15 07:58 · Score: 3, Informative

The cache system honors no-cache headers. As long as your images are served no-cache, you do see exactly when the email was opened, since the GMail servers refetch it every time. If each user gets a unique URL, you know exactly who opened the email.
1. Re:They do see open rates by Anonymous Coward · 2013-12-15 09:16 · Score: 5, Interesting
  
  Multiple tests by multiple individuals have shown that they do NOT honor any of the various no-cache headers.
  Tracking unique users is still easy (using a unique URL) - but tracking how many times they opened the email, or where they opened it from (IP address) or on what platform is now lost.
2. Re:They do see open rates by StripedCow · 2013-12-15 12:02 · Score: 3, Insightful
  
  Tracking unique users is still easy (using a unique URL)
  Not if google simply opens all e-mail behind the scenes, regardless of whether the user exists or not.
  
  --
  If Pandora's box is destined to be opened, *I* want to be the one to open it.
Possible? by Tim12s · 2013-12-15 07:59 · Score: 3, Insightful

Well, pulling all the images certainly solves the problem of having to display emails with images. The only reason we (I) don't click the display-images button is because the images allow us to be tracked, the images may have some sort of exploit (rare). Originally this used to be due to limited download speeds.
I suspect caching the images allow pre-processing of the images and therefore making the whole system more secure by default. Images could therefore be displayed in full by default with images, preferably with some large images being intelligently excluded by default.
Google could release a mass marketing email API/gateway and monetise that allowing marketeers access to data regardless of whether you open the images/email or not. This is slightly more valuable information.
1. Re:Possible? by symbolset · 2013-12-15 08:50 · Score: 4, Informative
  
  Image formats have been used to compromise browsers in the past, so automatically loading images in your webmail or email client is a bad idea. Fortunately this is just a change from the default behavior so you can turn it off in the options.
  In fact, Microsoft just patched a .tiff image format exploit last Tuesday.
  
  --
  Help stamp out iliturcy.
What this is really by Rosco+P.+Coltrane · 2013-12-15 08:02 · Score: 3, Interesting

is a monopoly tightening its grip on the market it monopolizes.

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
Harder for **Other** Marketers by perpenso · 2013-12-15 08:06 · Score: 4, Insightful

Yeah. The move is to make things harder for **other** marketers. For the marketer named Google it confers advantages.
change or same mistake I made about announcement? by patrixmyth · 2013-12-15 08:08 · Score: 4, Informative

Is this a new change, because after I saw the google announcement, I saw a report that they would share all that data about loading of images with marketers. End result: safer images, but just as much information for marketers, as along as they make nice with Google as 'official' email marketers. Would love to be wrong. Here's my source, Ars Technica article.
http://arstechnica.com/security/2013/12/dear-gmailer-i-know-what-you-read-last-summer-and-last-night-and-today/

--
"Don't you know you're going to shock the monkey?"- Peter Gabriel
Awesome for spam/tracking by saikou · 2013-12-15 08:09 · Score: 4, Insightful

Actually, this is rather awesome for spam/tracking of "real" addresses.
Before silly users could refuse to load external tracking pixels with unique IDs, assigned to each email.
And now? It's auto-downloaded for everyone. Yay!
While absence of IP address, Referral (if tracking image was loaded via https) and Browser info is sad, "everyone now auto-loads images" waaaay outweighs it :P You won't hide from confirming that email address that easily ;)

--
Hyperom.com
1. Re:Awesome for spam/tracking by Stonent1 · 2013-12-15 08:26 · Score: 3, Interesting
  
  Now all the spammers will get their servers overloaded. If they send out millions of e-mails and they all immediately get "opened" by google trying to pull in the picture data.
Re:And when the next JPEG or PNG exploit comes alo by Anonymous Coward · 2013-12-15 08:17 · Score: 3, Interesting

No, because Google will scan the images for viruses and common inconsistencies, then convert them to raw pixel data, using there decoding libraries that don't have these exploits, and then re-encode them into consistent and buffer-overflow-free images, that will work on any old and/or bug-riddled operating system or browser used by the recipient.
I hope google will also re-sacale images when people embed 3000 DPI company logo's in HTML-emails.
Summary is wrong wrong wrong by Dynedain · 2013-12-15 09:46 · Score: 4, Informative

This summary is garbage and complete misrepresents the implications of Gmail's change. (I already researched this last week and developed a solution to avoid cacheing with in-progress email images that might get replaced with final versions)
Every singe email marketing system already uses a unique image URL to identify a given recipient. This is frequently called a "tracking pixel" because it's usually a 1px transparent gif stuck in the corner of an email where it won't be distracting. In fact, this method has been used for web tracking as well for many years. It's how Google Analytics originally worked.
Since these unique images will still get loaded when an email is opened in Gmail, marketers will still be able to track your opens. What they won't see, however, is how many times you re-opened the email. And since the image gets cached and requested through Gmail's proxy, marketers won't get information about your machine like browser, IP address, etc. But if you click-through on a link, or you visited their site before (highly likely if you're on their mailing list) then they have most of that info anyways.
This caching by Gmail is primarily to speed up Gmail since it means images can be loaded and shared on Google's Content Delivery Network which is almost certainly faster than servers owned by the email campaign provider for image hosting.

--
I'm out of my mind right now, but feel free to leave a message.....
Worse, Google now blocks steganography too by Anonymous Coward · 2013-12-15 10:38 · Score: 3, Interesting

I'm surprised that everyone is focused only on how this affects advertisers. That might be just a decoy excuse for the modifications.
A far more fundamental change is that Google will now be transcoding all images, which inherently blocks the sender's ability to transmit steganographically hidden information with plausible deniability. I bet the NSA has been requesting Google to do that for ages, as it must have been an extreme headache to have to scan all images just to find the few with a hidden payload. No such payloads now.
Spooks aside, the effect of this on photography will probably be far more dramatic for the general population, since photographers often transmit precisely controlled images. Google's new transcoding means that Gmail is no longer suitable for sending bit-perfect images of known properties or quality, so we're going to have to put our images in archives from now on, which will be a pain to view.
It seems that Gmail is becoming strictly a conduit for advertising. Google is at least consistent in their evil.
1. Re: Worse, Google now blocks steganography too by Anonymous Coward · 2013-12-15 12:12 · Score: 4, Informative
  
  They're not caching attached images, they're caching linked images.
Re:Hah by TheRaven64 · 2013-12-15 10:50 · Score: 3, Interesting

Not necessarily. A lot of email virus scanners will pre-fetch images and follow links in emails, for example. They'll do it even if they're just forwarding the mail to another server, and sometimes before the mail even gets to the delivery agent.

--
I am TheRaven on Soylent News
Re:The fix that breaks things by Cassini2 · 2013-12-15 11:17 · Score: 4, Interesting

If I were google, I would download images in all incoming messages regardless if they are intended for real email boxes or not. This would let them know which websites are being used for spam. The spam detector could use this information by pattern matching every image (regardless of relabling or website copying), and mark spam accordingly.
Re:Hah by Ash+Vince · 2013-12-16 02:53 · Score: 3, Insightful

Yep and in fact despite what I said earlier, this could be worse. If google pre-fetch every image for instance, then this could have some horrid consequences. Such as confirming e-mail addresses.
Jason
You all seem to assume you are the first people to realise this, ten to one says some Google engineer also realised this and so is just going to get the software to do a hit on the sending or linked server for every image, even if the email address it was sent to does not exist. Then, they can use the content of that image as an additional way to help identify unsolicited email.

--
I dont read /. to RTFA, I read /. to offend people in ignorance.