Google Cache Makes Murdoch's K-12 Site Look Obscene
theodp writes "Rupert Murdoch's Amplify Education site is all about the kids, so it's understandable that the site's Terms of Use bans abusive, pornographic, obscene, and vulgar content. But if one uses Google to do a site search of Amplify.com (e.g., site:amplify.com donkey) you may get quite an unexpected eye-opener (redacted, but still NSFW). So, does someone at Amplify really want to "@&^$" your "a**"? Of course not. But this does serve as a cautionary tale of the perils of buying a second-hand domain name when pages of the shuttered site may live on in cache-land. Prior to its conversion to a site for kids' education, Amplify.com was a social sharing product that allowed users to clip favorite sites from the web and add their own commentary. Google does note that removed content may still show up in Google's search results in certain situations (removal requests can be made)."
Update: 04/08 17:04 GMT by T :
Stephanie Chang writes (in a comment below):
"Hi, I’m the editor of Amplify.com. We purchased our domain name in February 2012 and took ownership of the site in July 2012 for use as our company's home page. Prior to that, the domain was used by its previous owners as a social-sharing site. As a result, some old content dating back to the previous domain ownership still shows up as cached on certain search engines. Amplify Education, Inc. did not produce the cached content in question nor do we in any way endorse it. We’re working with Google and other search providers to make sure caches of our site are up to date. In the meantime, we apologize to anyone whose attempts to locate information on amplifying donkeys resulted in a negative browsing experience."
Thanks! I'm looking for suggestions for other good porn sites - preferably free, although I'm not opposed to paying a little if the content is particularly high quality. Good job /.!
And his foray into education is even more so.
If you want to establish your own site after you take it over, always throw a deny-all robots.txt to clear out it's google cache and archive.org entries for a couple of weeks
Rupert Murdoch doesn't want to f_ck your a_s - he wants to f_ck your access to impartial press.
Google's webmaster tools can limit issues like this.
Beware of those who profit off the docile and persecute the unbelievers.
Google's webmaster tools can limit issues like this.
As can wary domain buyers who know to look at a domain's history as part of the valuation.
When the foot seeks the place of the head, the line is crossed. Know your place. Keep your place. Be a shoe.
...if the previous residents of my house liked to decorate the windows with pentagrams? Or do people understand that different people live at the same address at different times?
Look - I know many of you are *philosophically* opposed to copyright. Fine. Whatever.
But put that aside for a moment and take it as a given that copyright exists. I really struggle understanding how it is legal, other than 'google has expensive lawyers' that one private entity can take the contents of another website, store them, and then essentially re-publish them for money (google is largely advertiser supported, even if the adverts dont necessarily directly appear on the cached page). This is especially true when you see here that google's actions can directly reputationally harm one of those whose site is is "caching."
Really, under what theory is this remotely consistent with copyright as we understand it? It passes no test of "fair use" whatsoever.
Notice that my comments aren't in any way related to whether the google caching is useful from a user's standpoint. Of course it is or can be. But, from a copyright standpoint, I have a lot of sympathy with righsholders.
Note also that the argument that "getting your stuff copied" should be an opt-out situation (as in "well, you can always put a robots.txt" or "you can always do steps x y and z") i find weak. this is what in essence we have with the DMCA and youtube - and you see it takes all of an instant from the time that any given video is taken down to the time that it's up again in some other form Rightsholders have to have *full time* people involved in policing sites like youtube.. something just isnt right about that.
If google is indeed providing a useful service to company/site x/y/z, then company x/y/z should be able to welcome such caching by an opt-in robots.txt. the current situation, given how copyright law actually exists in most jurisdiction, is perverse.
...and so is the idea of him doing anything remotely helpful for children. Wanting to "fuck someone's ass" is nothing compared to the man himself.
Whitehouse.com off limits or something? Doesn't fit the narrative/snipe I suppose.
That's why it's amusing. And how long should a cache hold an old site? You want to be able to pull it up years later, but don't want to pull it up years later. Maybe clear on transfer of ownership? But then, how can I refer to the "old" site if the cache is wiped on transfer?
So, if this is a "Google issue" then what's the solution? You'll piss off someone somewhere, no matter what you do. That's not a Google issue, that's an issue with all opinions.
Learn to love Alaska
What is the point of replacing 'fucking' by 'f****ng', 'ass' by 'a**' and 'dick' by 'd**k'? In the context you can still clearly tell what the words being used are.
All search engines have to take a copy of page to run it through their ML algorithms to create the index thats used to serve a users query. And this copy is also used sometimes to generate the content in the snippet.
WebArchive could be a even serious problem here.
Google's cache are temporary. WebArchive aims to be eternal. :-)
Lisias@Earth.SolarSystem.OrionArm.MilkyWay.Local.Virgo.Universe.org
I don't think google cache should necessarily be used for looking into a websites history especially beyond an ownership change when the site is completely different. That's something for the Internet Archive project. I think people should be able to request any previously cached pages be removed (can they already? the notion of pages being removed on request was vague enough that I don't know if it's ona per-page basis or can be per site) and updated with modern content. It doesn't need to be an immediate process, just a queue that Google goes through.
It's like the linking bullshit, we all know that if Murdoch wants to stop Google indexing his propaganda all he need do is fix his robots.txt. Same deal here, the process/facts are irrelevant when you are trying to paint the enemy as an irresponsible pornographer, a brazen thief, a despicable leach, or whatever bad news story he can dream up where Google are trampling all over his delicate sense of entitlement.
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
This comment really made me laugh. Thank you.
http://archive.org/about/faqs.php#14
You spell "a***", not "a**". For f***'s sake, learn English!
I hadn't the slightest objection to his spending his time planning massacres for the bourgeoisie... (P.G. Wodehouse)
How long? "Until next page update, or as short as possible after it occurs".
Google Cache is a backup resource for the case the site is temporarily down, not a history resource to show how the site looked like before. If the site goes missing, keep until it's deemed gone permanently. But if the site changes, apply the changes and don't hoard expired pages.
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
Hi, I’m the editor of Amplify.com. We purchased our domain name in February 2012 and took ownership of the site in July 2012 for use as our company's home page. Prior to that, the domain was used by its previous owners as a social-sharing site. As a result, some old content dating back to the previous domain ownership still shows up as cached on certain search engines. Amplify Education, Inc. did not produce the cached content in question nor do we in any way endorse it. We’re working with Google and other search providers to make sure caches of our site are up to date. In the meantime, we apologize to anyone whose attempts to locate information on amplifying donkeys resulted in a negative browsing experience.
And now the fun is over - all animal-related site searches at amplify.com seem to be disabled by google :(
We will, as soon as someone demonstrates that Her Majesty's trivial dialect matters to anyone outside of the Airstrip One.
"resulted in a negative browsing experience"
Bellend