Slashdot Mirror


Google Redesigns Image Search, Raises Copyright and Hosting Concerns

An anonymous reader writes "Google has recently announced changes to its image search. The search provides larger views of the images with direct links to the full-sized source image. Although this new layout is being praised by users for its intuitiveness, it has raised concerns amongst image copyright holders and webmasters. Large images can now easily be seen and downloaded directly from the Google image search results without sending visitors to the hosting website. Webmasters have expressed concerns about a decrease in traffic and an increase in bandwidth usage since this change was rolled out. Some have set up a petition requesting Google remove the direct links to the images."

37 of 203 comments (clear)

  1. Re:does not compute by Georules · · Score: 4, Informative

    More people being linked directly to the high resolution image, but less people actually visiting the website. This isn't really that confusing.

  2. Re:What? by 0123456 · · Score: 2

    In fact, it causes reduced bandwidth usage because you don't have to download some stupid ad-filled (and possibly malware-infested) web page that you don't want to see, the way the old image search did.

    If they don't like it, block any requests with a Google referrer string.

  3. Solves a annoying problem. by stevenh2 · · Score: 4, Interesting

    Some websites use a annoying script that redirects people when they click a image.

    1. Re:Solves a annoying problem. by icebike · · Score: 2

      Yeah, they want you to go their payment page and sign up for unlimited access.

      --
      Sig Battery depleted. Reverting to safe mode.
  4. Re:does not compute by m1ndcrash · · Score: 3, Informative

    It's called hot linking or leeching and it has been a headache forever. You want to show content + ads but your server is used just to pull an image, thus no traffic and high bandwidth.

    Fighting the good fight:
    RewriteCond %{HTTP_REFERER} !^$
    RewriteCond %{HTTP_REFERER} !^http://(www\.)?cyberciti.biz/.*$ [NC]
    RewriteRule ^.*\.(bmp|tif|gif|jpg|jpeg|jpe|png)$ - [F]

  5. I'm Sofa King We Tod Did by future+assassin · · Score: 2
    --
    by TheSpoom (715771) Uncaring Linux user here. I have nothing to add to this but please continue. *munches popcorn*
    1. Re:I'm Sofa King We Tod Did by VortexCortex · · Score: 4, Interesting

      So, your answer is that because google has decided it has the right to redistribute copyrighted images in full resolution in most cases, that everyone else on the web should go to Google and opt out of their caching system? Site owners are in coorperation with google, we like google when they don't do fucked up illegal things... We see thumbnails as "fair use", maybe. We don't mind much as long as the users end up on our site to see the image. Google understands advert revenue funded websites... They are one. So, it's really hard to understand users who want free stuff saying that we have to change our business practices, and maybe not even give them free stuff (or make it harder to find free stuff) simply because a bigger free stuff provider decides they can get away with infringing copyrights of everyone.

      Your solution is not a solution. A real solution will be to address the issues. Hell, maybe while google is processing the images to reduce their resolution and run heuristic matching algorithms for their other-sizes and search terms feature, they can water-mark them with the domain name of the site they downloaded the image from.

      Or, let's simply turn your moronic suggestion on it's ear. Why don't we all just say: Hey Google, If you want the feature to work that way, you needed to GET PERMISSION FROM EVERYONE BEFORE INFRINGING THEIR COPYRIGHTS. Fuck you and your opt-out "let's piss off everyone, then apologize until we get our way", Facebook feature roll-out model.

    2. Re:I'm Sofa King We Tod Did by msheekhah · · Score: 3, Insightful

      Google is acting on feedback from ITS customers. You are not its customers. If you want to protect your context, then do a little research and take care of it. It's not difficult. The information is freely available on the internet how to block hotlinking. And think of the logistics, google will never ask site owners what they want. There are too many of you and not enough of them. That's why you have the option to keep Google off your site. Use it.

      --
      Mark Anthony Collins
    3. Re:I'm Sofa King We Tod Did by stephanruby · · Score: 2

      Hey Google, If you want the feature to work that way, you needed to GET PERMISSION FROM EVERYONE BEFORE INFRINGING THEIR COPYRIGHTS. Fuck you and your opt-out "let's piss off everyone, then apologize until we get our way", Facebook feature roll-out model.

      This was tried before. It just wasn't practical at all. If search engines had waited until they got permission from everyone before they could index everyone's public content, most public government sites, most public newspaper sites, most public personal web sites, etc. would have been excluded by default.

      The advantage and the problem with the http protocol is that it's copy-agnostic. And if you really want to control the dissemination of your content, you better put it behind a wall of some kind. Don't post it publicly and then complain that people/bots made copies of it. That just goes against the nature of the public internet.

      Also, don't imply that you need to set a different opt-out text/xml file for each search engine. If you do an opt-out for Google, it will work equally well for other search engines. And doing a granular optout is actually very little work to do for someone who's trying to make money from their own web site.

      And finally, please don't try to take advantage of the public nature of the public internet and then complain about that very nature. The internet was created for sharing content. There is a reason you're on the public decentralized internet now, and not on the privately centralized walled garden of AOL, Prodigy, or Minitel.

  6. Re:does not compute by rudy_wayne · · Score: 5, Insightful

    If you even read the summary, let alone TFA you'll see:

    "The search provides larger views of the images with direct links to the full-sized source image."

    Yes, I did read TFA. And nowhere does it explain how you can have decreased traffic but increased bandwidth usage. Because it's not possible. Decreased traffic = decreased bandwidth usage.

    Here's the real problem (quote from TFA):

    When people get the full resolution image, they have no reason to click to go to the URL.

    Dear "Webmaster", nobody cares about your shitty website packed full of annoying ads. Get over it already.

  7. Re:does not compute by icebike · · Score: 4, Informative

    Lots of sites put hi-rez images on file, and link to them via a thumb nail.
    The majority of visitors don't request the hi-rez images, at least not all of them.

    But posting a link to a high-rez image can get your bandwidth slammed, serving images, but nobody requesting the web pages. Especially if its porn, or happens to hit the search topic of the moment. Without the ability to serve ads, these websites make no money.

    Of course, if the complainers had an actual clue, they could just put a robots.txt file in their image storage, which Google seems to honor.

    --
    Sig Battery depleted. Reverting to safe mode.
  8. Re:does not compute by miserere+nobis · · Score: 3, Informative

    It isn't as obvious as you make it sound. Scenario 1: Google links to your page. People who want your image click through, your server throws them the whole page plus the high resolution image. Scenario 2: Google links only to your image. People who want your image download just that, your server sends them just that. All else being the same, scenario 2 is less bandwidth, not more, because you'd be serving the same image either way, but in one case with and in the other case without all the other stuff on the page as well. It's entirely possible for it to add up to more, but this depends on how the new search affects people's usage of the results- it requires that more people actually click to view the full-resolution image as a result of the changes. That's a likely, but not necessary outcome.

  9. If It's Copyright That They're Worried About by mk1004 · · Score: 5, Interesting

    IIRC, jpeg images allow header data that includes copyright info. If you don't care about use of the image, leave it blank. If you do, insert the copyright info. Google's bot can look for copyright data and if it finds it, it can link to the original html page. Otherwise, it can give a link for a direct download.

    I think there was something on /. awhile back that talked about some system for the owner to indicate how an image could be used, e.g. commercial, non-commercial, free and so on. Couldn't find it on a quick search, but that might be another option to tell Google how to handle an image.

    --
    I can mend the break of day, heal a broken heart, and provide temporary relief to nymphomaniacs.
  10. Referer Header! by TheNinjaroach · · Score: 4, Insightful

    If webmasters don't want people "stealing" photos without viewing directly on their website, they are more than welcome to instruct their web servers to not display images to freeloaders. Look at the referer header, if the request didn't originate from your site, then don't serve it.

    --
    I went to eat some animal crackers and the box said, "Do not eat if seal is broken." I opened the box and sure enough..
    1. Re:Referer Header! by ShaunC · · Score: 2

      Heh. Some years ago, BlackPlanet.com (basically "MySpace for African-Americans" at the time) actually hotlinked an image on a site of mine into their templating system. It wasn't just random users, this pic was built straight into their publishing platform, meaning tens of thousands of users were selecting this particular image on my server to be part of the theme of their BlackPlanet page.

      Didn't take them too long after my RewriteRule to rehost it on their own server.

      --
      Thanks to the War on Drugs, it's easier to buy meth than it is to buy cold medicine!
    2. Re:Referer Header! by gmhowell · · Score: 2

      Didn't take them too long after my RewriteRule to rehost it on their own server.

      lemonparty.jpg served for those referred from their site?

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
  11. Re:What? by icebike · · Score: 4, Informative

    In fact, it causes reduced bandwidth usage because you don't have to download some stupid ad-filled (and possibly malware-infested) web page that you don't want to see, the way the old image search did.

    If they don't like it, block any requests with a Google referrer string.

    This has been answered in the branch above. You can easily exceed your hosted bandwidth quota (with zero ad-generated revenue) by having a high-rez photo from your site pop up in a google image search, especially in a situation where something you have on file becames the topic of a high number of searches.
    Even if you don't serve that photo normally on your web pages, but simply provide a button or thumbnail to click for the small percentage of viewers that want to see the high-res.

    Most visitors don't click the high-rez button or thumbnail. The few that do, don't matter. Until Google indexes it, then all bets are off.

    Some (failed) web designers only put the high-rez image in, then shrink it into a box via the html IMG tag. (Then they wonder why people complain that their web loads slowly). These guys would see very little difference in this case, unless of course Google sees a surge of searches that just happen to find your Nattily Portman collection.

    --
    Sig Battery depleted. Reverting to safe mode.
  12. Simple by bobthesungeek76036 · · Score: 2

    # cd
    # cat - > robots.txt
    User-agent: *
    Disallow: /
    <crtl-D>
    #

    Problem solved!

    --
    Karma: Bad
  13. If this kind of image mining is a problem by fyngyrz · · Score: 5, Interesting

    If you're running a website with Apache, you can configure Apache to look at the HTTP_REFERER header and see where the web surfer was when they made the request for the image. If they weren't on your website, (or if they don't provide the header, an act to be widely discouraged), just re-direct them to your home page instead of serving the image.

    I would think that other web servers could do the same thing, one way or another.

    For most people, it costs money -- perhaps not a huge amount, but still, real money -- to put up a website and serve content to the world. The expectation, if not agreement, is that you'll look at the site's content on the site.

    The webmaster's position is no more hostile than that of the deep miner: There are expectations, but no promises.

    Google's search goes far beyond fair use, as far as I'm concerned.

    --
    I've fallen off your lawn, and I can't get up.
    1. Re:If this kind of image mining is a problem by fyngyrz · · Score: 3, Insightful

      Not a problem. No header, no pages, and off you go to somewhere else. You don't trust me -- then I don't trust you.

      --
      I've fallen off your lawn, and I can't get up.
  14. Re:does not compute by Dancindan84 · · Score: 5, Insightful

    You used to get traffic actually visiting your site. That meant full page loads, but a lot of that is text which is low bandwidth. You now have less traffic (unique IPs hitting your site), but they're JUST downloading hi-res images which leads to a net increase in bandwidth.

    Also, ads don't have to be shitty and annoying. Slashdot uses ads, and even though I can I don't turn them off because they're relatively passive. Hosting and bandwidth cost money, and a lot of sites rely on small ad revenue to help offset those costs.

    --
    "Always forgive your enemies; nothing annoys them so much." - Oscar Wilde
  15. Re:What? by sjames · · Score: 2

    Because google goes directly to the full sized image, not the thumbnail on the web page. Grabbing the image directly creates no impressions, so the bandwidth burned per impression shoots up.

  16. Re:Be careful, Google by tbird81 · · Score: 2

    You can opt out.

  17. Re:does not compute by drinkypoo · · Score: 2

    Google always offered links directly to the original image, though it did load the actual site in the background. And you've always been able to prevent the direct image links by referer control.

    --
    "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
  18. Re:robots.txt by fatphil · · Score: 4, Funny

    > Google does NOT behave itself.

    It's also a bit dumb. It's been playing my webserver at a variant of reversi for the last 12 months (one of the links at the end of each game is to start a new game, which it duly follows...)

    --
    Also FatPhil on SoylentNews, id 863
  19. Re:does not compute by GrumpySteen · · Score: 4, Informative

    What's going on is fairly obvious if you read the article linked in the sentence "Webmasters have expressed concerns about a decrease in traffic and an increase in bandwidth usage since this change was rolled out."

    The article says nothing about an increase in bandwidth usage. The anonymous reader who submitted the article obviously just made that part up, as anonymous people on /. do, without regard for whether it made sense or accurately reflected the link being given.

  20. I can see both sides by Miamicanes · · Score: 3, Interesting

    On one hand, I think the site owners deserve the traffic. On the other hand, it seems like at least a quarter of the pages end up being dead when I click on them, or redirect to sites attempting to install malware on old versions of Firefox, or seemingly have nothing whatsoever to do with the image that's supposedly there.

    A compromise might be to allow users to open the referring page in context immediately, open the cached page (with live content) after a 2-second delay, and allow users to grab the full-sized image directly from Google's cache after a 10-second CAPTCHA-guarded delay. Then, users would have every incentive to try viewing the page in context, falling back to the cached page if the original page ends up being down/borked/whatever, and being able to grab the cached image if all else fails.

    Going a step further, Google could come up with some free digital watermarking scheme that allows a 48-bit (give or take) payload to be encoded into the image at a user-selected strength (allowing him to balance robustness, file size, and visibility... pick any two of the three).

    The upper few bits (let's say, 4) would indicate the version. Initially, it would be 0001.

    The next 40(give or take) bits would be globally-unique, and allow somebody who knows the value to obtain meta info about you in a sensible manner. If they're all 0, it means you're using a generic permissions watermark that doesn't identify ownership, but simply restricts use.

    The lower 4 bits specify explicit restrictions

    * do not contextually-index
    * do not cache full-sized image
    * do not perform face recognition of any kind
    * do not index for similarity to other images

    A value of "0000" would allow search engines to index the image, unless you restricted them in some industry-standard way via metadata referenced to your unique id. For the generic value with all 0s, 0000 means "go ahead and index this".

    A value of "1111" would indicate that the image, when encoded with a 4-bit watermark, should not be indexed in any way, shape, or form, regardless of future extensions to the standard that might define additional permissions, and regardless of what any indirectly-referenced meta-info might or might not say. Let's call this the "Stop Facebook from Permissions Creep in a GPLv3-like manner" anti-permission.

  21. Sample? by Antony+T+Curtis · · Score: 2

    I think that if I was a photographer, I would be OK with Google caching full quality images as long as they put their own annoying watermark all over it with the URL where the image came from clearly visible.

    --
    No sig. Move along - nothing to see here.
  22. Re:Copyrighted contents ... by Anonymous Coward · · Score: 2, Insightful

    Retard.. things are copywritten automatically when they are published. What you just suggested is - never publish anything online.

    Your post is covered by copyright
    Linux - copyright
    slashdot's html - copyright

    Do you know what copyright is?

    Why is slashdot filled with retards these days.

  23. Re:Copyrighted contents ... by Anonymous Coward · · Score: 2, Informative

    wow. "Retard.. things are copywritten.." ..."Do you know what a copyright is"
    When I try to type copywritten, it get a red underline. My PC doesn't know what copywrite is.

  24. Re:does not compute by Ford+Prefect · · Score: 3, Interesting

    Dear "Webmaster", nobody cares about your shitty website packed full of annoying ads. Get over it already.

    If someone clicks the Google Image Search 'high-resolution' link for one of my photos from Flickr, they get a medium-resolution version with no description, attribution or copyright information. (Example search page here.

    If they go to the ad-free Flickr page, they get links to much higher resolution versions, associated images and also get informed that it's under a super-open Creative Commons Attribution licence.

    --
    Tedious Bloggy Stuff - hooray?
  25. Re:does not compute by Stiletto · · Score: 3, Insightful

    What's "Bing"?

  26. Re:does not compute by Stiletto · · Score: 2

    So, use robots.txt to remove yourself from their search listings. Problem solved.

  27. Re:does not compute by fluffy99 · · Score: 2

    Really the same issue webmasters had with deep-linking where Google sends the searcher straight to the page they wanted without having to wade through the front end of the website. And yes, the same mitigation techniques such as robots.txt and refferring block apply with the same drawbacks of those searchers not bothering with the site that's making things more difficult for them.

  28. Re:Be careful, Google by BradleyUffner · · Score: 2

    Yes, and the folks on slashdot are really big on opt-out instead of opt-in... ..oh wait.. no they fucking arent. The folks on slashdot fucking hate opt-out, and rightly fucking so.

    Posting your content on a publicly accessible URL IS opt-in.

  29. Re:What? by Anonymous Coward · · Score: 2, Insightful

    I'd go further than this, honestly, I'm sick of people whining about this sort of thing.

    The internet was created for one purpose - information sharing, if you don't want your information shared then get it off the web, otherwise don't cry when it is shared.

    Yes that may mean there's a cost to you, in terms of hosting, but that's part of what the web spirit always was - that people share information for free at their time and expense, or as part of their employment (i.e. academics sharing data).

    I'm sick of these people who believe they have a god given right to make money from the web and deserve legal protection as such. I'm not saying you shouldn't be able to make money, but making money should be upto you to figure out without expecting the whole of the purpose and intent of the web and it's design to revolve around what you want.

    Booohooo, people can link to content on your site. Get over it, that's how it was designed, that's how it was meant to be, don't like it? Then stick your content behind some passworded paywall or whatever, if it's on the public web it should be fair game, that's the whole point of it. It's the same as the newspapers whinging about Google quoting and linking their content - again, Google is doing nothing wrong, it's using the web EXACTLY as it was intended, if they don't like it they should get off the web and see how that suits them.

  30. Re:Copyrighted contents ... by alexo · · Score: 2

    Retard.. things are copywritten (sic) automatically when they are published.

    Why is slashdot filled with retards these days.

    I think you got your answer.