Google Makes It Harder For Marketers To Collect User Data
cagraham writes "In a seemingly minor update, Google announced that all Gmail images will now be cached on their own servers, before being displayed to users. This means that users won't have to click to download images in every email now — they'll just automatically be shown. For marketers, however, the change has serious implications. Because each user won't download the images from a third-party server, marketers won't be able to see open-rates, log IP addresses, or gather information on user location and browser type. Google says the changes are intended to enhance user privacy and security."
While I applaud the move, it is about competitive advantage for Google.
Of course they're cracking down hard - stealing user data is Google's job...they don't like the competition.
The cache system honors no-cache headers. As long as your images are served no-cache, you do see exactly when the email was opened, since the GMail servers refetch it every time. If each user gets a unique URL, you know exactly who opened the email.
Well, pulling all the images certainly solves the problem of having to display emails with images. The only reason we (I) don't click the display-images button is because the images allow us to be tracked, the images may have some sort of exploit (rare). Originally this used to be due to limited download speeds.
I suspect caching the images allow pre-processing of the images and therefore making the whole system more secure by default. Images could therefore be displayed in full by default with images, preferably with some large images being intelligently excluded by default.
Google could release a mass marketing email API/gateway and monetise that allowing marketeers access to data regardless of whether you open the images/email or not. This is slightly more valuable information.
continue: }
You'll get hit automatically! It's a win-win!
#DeleteChrome
http://arstechnica.com/security/2013/12/dear-gmailer-i-know-what-you-read-last-summer-and-last-night-and-today/
As long as you're giving a unique url to each user who you email, this actually makes open-rate calculations a lot more accurate, doesn't it? Instead of a large percentage of your users never seeing the image, they'll all get loaded.
Sure you can't track cookies, get IP addresses, or any of that anymore...
I'm assuming Google is only downloading images of emails that people open. If Google is downloading every image of every email they get, then never mind.
cached when you open the email or cached if you are not even logged in? this could either verify active email addresses for spammers or hinder spammers, which is it?
This fixes: opening ratio, opening time, user's IP.
This breaks: spammers will now have confirmation is the @gmail email is valid or not.
img source = "img/target/example.com/0xDEADBEEF.png"
Yes, target@example.com received our email.
We don't know where he was, what tool he used and anything more.
ID: the nose did not occur naturally, how would we wear glasses otherwise? (apologies to Voltaire)
is a monopoly tightening its grip on the market it monopolizes.
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
If I could likely deduce that inside our local software you owned an item with the id 9, I could email you:
Because the request goes out with your authorization cookie it'll executes successfully.
This is why you should only accept post requests for actions that change data and use xsrf tokens (that aren't stored in cookies, local storage, etc).
Yeah. The move is to make things harder for **other** marketers. For the marketer named Google it confers advantages.
Google Makes it Harder for the Competition to Collect User Data
Is this a new change, because after I saw the google announcement, I saw a report that they would share all that data about loading of images with marketers. End result: safer images, but just as much information for marketers, as along as they make nice with Google as 'official' email marketers. Would love to be wrong. Here's my source, Ars Technica article.
http://arstechnica.com/security/2013/12/dear-gmailer-i-know-what-you-read-last-summer-and-last-night-and-today/
"Don't you know you're going to shock the monkey?"- Peter Gabriel
Wrong on one point: Users that click on "show images" will still download the image to Google's servers, letting marketers know if an email is ever opened or not. The one thing the marketer won't get is the user's IP address (because it's Google's cache server downloading the image, rather than the end user).
Actually, this is rather awesome for spam/tracking of "real" addresses.
Before silly users could refuse to load external tracking pixels with unique IDs, assigned to each email.
And now? It's auto-downloaded for everyone. Yay!
While absence of IP address, Referral (if tracking image was loaded via https) and Browser info is sad, "everyone now auto-loads images" waaaay outweighs it :P You won't hide from confirming that email address that easily ;)
Hyperom.com
Can't believe the stupidity of this story.
You mean, like, attachments? Those are part of the email anyway.
Or are we talking about this weird new HTML-email thing I've been hearing so much about? Who even uses that crap. :P
No, because Google will scan the images for viruses and common inconsistencies, then convert them to raw pixel data, using there decoding libraries that don't have these exploits, and then re-encode them into consistent and buffer-overflow-free images, that will work on any old and/or bug-riddled operating system or browser used by the recipient.
I hope google will also re-sacale images when people embed 3000 DPI company logo's in HTML-emails.
If Google is smart, they'll download approximately 1 copy of each image, ignoring the tracking ID in the URL.
"Most successful tech company in the world" suggests that they may in fact be smart.
Tho, im sure they will do the tracking for their own purposes, this will help reduce 'bad things' from questionable sources. As always, its a trade-off.
---- Booth was a patriot ----
pop3 doesn't tell anyone which messages you read, only that you received them.
You make the tracking ID part of the image name. Set up a cgi to always return the same image regardless of what it is called. Use a fake hashed etag thingy so they are always different.
Google has to download the image to see if it is the same, marketing mission accomplished.
e.g. http://examplemarketing.com/images/gjdfkadfdhkhkfdhkdsfhkhfdsqiuqr.gif
Oh. Please send royalties to A.C. @ Slashdot.
Google is a bit late. Facebook has been doing this in posts to its site since day one. It makes a huge difference in
security, privacy and speed... finally.
Yeah, because it will take all of two seconds to generate a unique tracking bug per email address. Two 32 bits pixels offers a lot of data storage.
Marketers will at least know that the user opened the email because the images were loaded somewhere. See MailChimp's post on the subject. This means that you can not longer look at a message even once without the marketer knowing that you did.
GMail will be fetching the images by default but only after the user opens the mail. So it's an improvement because the user's browser and IP address will be hidden (as it will be Google's servers doing the fetching) and it's a step back because it is tracking images will work by default. If you want the old behaviour of not showing images you will need to opt into it so only those who explicitly don't want to be tracked will remain anonymous.
Sources: Wired, Ars Technica
So, presumably they don't actually rewrite the message as such, just change the way it's displayed in the web interface (through an intermediate proxy). Rewriting the message would break all those nice email verification systems, no?
So what about those people using IMAP and not GMail's web interface? Presumably, it's business as usual.
Fact is, if I don't want you to be able to know when I've loaded your images, I won't load your images unless I think they are vital. Which is why my mail-client doesn't download any images by default anyway.
I see this as a good thing - Google are protecting users who are dumb enough to use the web interface for email and rely on it, but not touching anyone who would do things properly anyway.
Of course, the data about the cached access will be sold by Google.
This summary is garbage and complete misrepresents the implications of Gmail's change. (I already researched this last week and developed a solution to avoid cacheing with in-progress email images that might get replaced with final versions)
Every singe email marketing system already uses a unique image URL to identify a given recipient. This is frequently called a "tracking pixel" because it's usually a 1px transparent gif stuck in the corner of an email where it won't be distracting. In fact, this method has been used for web tracking as well for many years. It's how Google Analytics originally worked.
Since these unique images will still get loaded when an email is opened in Gmail, marketers will still be able to track your opens. What they won't see, however, is how many times you re-opened the email. And since the image gets cached and requested through Gmail's proxy, marketers won't get information about your machine like browser, IP address, etc. But if you click-through on a link, or you visited their site before (highly likely if you're on their mailing list) then they have most of that info anyways.
This caching by Gmail is primarily to speed up Gmail since it means images can be loaded and shared on Google's Content Delivery Network which is almost certainly faster than servers owned by the email campaign provider for image hosting.
I'm out of my mind right now, but feel free to leave a message.....
They want to make it harder for those entities to collect that data, because Google wants to collect it and sell it back to them at a premium.
I've done testing with my own emails with a link to my own server. The image is still only downloaded once you view the email. The only thing that is any different is that the request comes from google instead of the user's IP address. This prevents getting or reading cookie data during the image request, but does nothing to prevent image-based tracking of email opens. For image content on non-unique URLs this could mean better loading speeds, but won't do anything to make email load faster for unique images.
Nothing stops them from pre-caching these images in the future, but for now it isn't quite as catastrophic for the email marketers are some article suggest.
-- If it aint broke, fix it till it is. --
From the OP: "Google says the changes are intended to enhance user privacy and security."
I find this lie from google/doubleclick insanely funny yet darkly cynical.
To enhance user privacy and security, don't use services from this huge ad broker which has a small army of lobbyists working Washington to prevent laws that would harness our privacy, and which works with the NSA to rape our liberty and privacy. If you use gmail, you should have no expectations of privacy or security whatsoever. That would be insane. It is everything their prime directive is not - i.e. make money of your privacy.
No, you are completely misunderstanding that article.
Before mail clients stopped loading images by default, it was possible to embed a "web bug" image in an email. Essentially a transparent non-image that is referenced with a unique ID for each user. When the email was viewed, the mail client would request this web bug, and their server could record a) that this particular user opened the email, b) when they opened it, and c) whatever information they could glean from a normal HTTP request - where in the world you are, what software you are using to read the email, what language you have your mail client configured to use, etc.
If at any point you click "Load images", you will be sending this information to whomever sent the email. It's just that by default this would not occur in the majority of mail clients.
Gmail are switching to proxying the images and loading them by default. This means that email senders will get a) and b) by default. You can remedy this by switching your Gmail settings back to the old default of not loading images by default.
However because they are proxying the requests for the images, the people sending emails no longer get access to c) - things like your IP address, location, software, etc.
You seem to have invented some kind of nefarious arrangement between email marketers and Google, but that appears nowhere in the article you link to. It does not describe Google sharing data at all. All the article describes is the fact that by default, email marketers can now get a) and b) by using web bugs - this is something you don't need an agreement with Google to use, it's a natural consequence of the technology in question. It's your browser that shares the data, and it does so by performing a normal HTTP request - this is information you send to each and every website you visit. There's no http://google.com/download-private-data-muhahaha.zip link that email marketers now have access to.
This change improves privacy and has no loss of privacy if you change your settings to not load images by default. If you leave the settings at their defaults, you gain privacy in some ways and lose it in others.
Bogtha Bogtha Bogtha
I'm surprised that everyone is focused only on how this affects advertisers. That might be just a decoy excuse for the modifications.
A far more fundamental change is that Google will now be transcoding all images, which inherently blocks the sender's ability to transmit steganographically hidden information with plausible deniability. I bet the NSA has been requesting Google to do that for ages, as it must have been an extreme headache to have to scan all images just to find the few with a hidden payload. No such payloads now.
Spooks aside, the effect of this on photography will probably be far more dramatic for the general population, since photographers often transmit precisely controlled images. Google's new transcoding means that Gmail is no longer suitable for sending bit-perfect images of known properties or quality, so we're going to have to put our images in archives from now on, which will be a pain to view.
It seems that Gmail is becoming strictly a conduit for advertising. Google is at least consistent in their evil.
The only way this can be of any use is if you trust the servers the images are moving to (tip, don't trust Google). Google has sucked for a long time, this is just more of the same.
[...] I hope google will also re-sacale images when people embed 3000 DPI company logo's in HTML-emails.
I HATE when companies do that. My friend started working for a company that automatically adds a footer to every page. It consists of their logo and some green space to fill the rest of the image (it is set up like this: but the logo is on the left http://candacereese.files.wordpress.com/2011/02/creesej_speaker-footer-e1296867208417.jpg). The logo itself is sized to be 1.5 by 1.5 however, the actual image file downloaded is, no joke, 6000x6000 pixels and is 3MB (I tried pngcrush and it reduced it to 88KB). The green space is a block of solid green 6000 high by 18000 long and is absolutely massive and again pngcrush got it under 200KB. What a waste considering they dynamically size it using css to fill the rest of the page and be 1.5 inches high on the screen and it could literally be a single pixel png stretched out or be colored green by CSS with no image necessary.
You're too fixated on the old definitions of monopoly, which aren't really very helpful in this new world of digital services.
The questions that are asked in an anti-trust investigation aren't about whether the accused meets the definition of "monopoly" but whether the methods pursued by the company block the competing operation of others in the same space.
And what do you think the answer to that question is? Google has an almost complete stranglehold on the Internet advertising sector, and it's not necessarily because of evil actions. They're simply too good at this "free services --> advertising revenue" game, so good that they've captured most of the target audience. It's a monopoly of the black hole type --- it has attracted everything to it and there's no matter left to attract.
When someone has a free Gmail account already, there's almost no possibility of attracting them to your competing services, since you can't compete with free, and that means that your advertising business can't capture the needed eyeballs. It's effectively a monolithic sector now, and it's owned by a single player.
The old definitions don't work, yet the anti-competitive nature of what's going on is unquestionable.
I read somewhere that salesforce did this years ago to allow people to track who actually read emails. I then renamed SalesForce to UsedCarSalesForce as that is a pure scumbag thing to do. I am a huge fan of some kind of privacy law where a company may not collect data that people haven't had clearly pointed out is being collected with the option to opt-in. You will notice opt-in as the operative word. Thus I don't even want my power company being able to sell my data even in aggregate and say that my neighbourhood uses more power than a different neighbourhood. When I see that "trusted-third-parties" thing it just ticks me off.
I'm sure marketeers can still access the data. Only now they'll have to pay Google for it.
They rarely care about the actual person. They only care that their garbage is being read, and therefore potentially sold.
After all, it's not like they care about spamming to the appropriate market. They only care that they are actually sending it to someone (even that's a borderline stretch, but it's the entire purpose of the images).
So do they want privacy or not?
On one hand they're claiming to serve up images by proxy to protect users privacy, on the other hand, they're using Google+ and youtube to force users to display their real name.
We had the issue where Google started forcibly customizing google services for you based on you signing up for Google+. When I signed up a couple years ago, it broke my news archive search, because it would only search news sites in Korea, and in Korean despite having everything in English and my account being created in Canada (I happen to be in Korea). While several months later that was actually fixed, they also went ahead and first removed the insanely useful timeline from the archive, and then just recently killed off the archive entirely, because who could ever want to read news more than 30 days old.
Butchering services, heavy handed user manipulation, my patience with google is quickly wearing thin.
What Google is doing is a complete hypocrisy: will will protect everyone from 'something' but we do exactly the same that we are protecting everyone from.
I sincerely don't see any major advantages for the end user with this move.
What I can see is a major advantage for Google that this way has the data from everyone and doesn't allow anyone else to do exactly the same they do.
If Google does this, why doesn't Google stops collecting data from the emails for ad targetting?? Everyone knows Google looks at emails to gather data for ad purposes. A beacon of a third party email is less intrusive than harvesting data directly from email content. Major hypocrites!
I will consider cancel my email account after this.
What Google is doing is a complete hypocrisy: will will protect everyone from 'something' but we do exactly the same that we are protecting everyone from.
I sincerely don't see any major advantages for the end user with this move.
What I can see is a major advantage for Google that this way has the data from everyone and doesn't allow anyone else to do exactly the same they do.
If Google does this, why doesn't Google stops collecting data from the emails for ad targetting?? Everyone knows Google looks at emails to gather data for ad purposes. A beacon of a third party email is less intrusive than harvesting data directly from email content. Major hypocrites!
Privacy my ass.
Theywill just resell the stats to the marketers.
Google will continue to vacuum everything you do, they are just getting rid of the competition, who will now have to pay Google for the info. No favors here, folks. We benefit inadvertently and temporarily, however.
Because the image URLs are rewritten to the Google proxy server, you can no longer copy and paste rich text with embedded images from Gmail into another application. That really sucks. I need to do that all the time for a certain client. It looks like I'm now going to have to get a separate webmail service just to be able to copy and paste rich text. Ridiculous.
Easily defeated. Google loads a couple of the images, detects they're all the same, detects they're placed the same in the message, determines the pattern and replaces all images fitting that pattern with the first copy downloaded from their cache.
Yeah. I love how everyone in this thread is pointing out obvious problems which are actually not that hard to solve.
It's not like Google's engineers know anything about email, spam or caching servers...
Google gives you the OPTION to either have images AUTOMAGICALLY displayed now or NOT. I still went with not as AFAICT ATT it was across the board and I don't ALWAYS want images displayed in my email when accessed from phone/tablet.