Content-Aware Image Resizing

← Back to Stories (view on slashdot.org)

Posted by kdawson on Saturday August 25, 2007 @11:30AM from the got-a-nice-gui-too dept.

An anonymous reader writes "At the SIGGRAPH 2007 conference in San Diego, two Israeli professors, Shai Avidan and Ariel Shamir, have demonstrated a new method to shrink images. The method is called 'Seam Carving for Content-Aware Image Resizing' (PDF paper here) and it figures out which parts of an image are less significant. This makes it possible to change the aspect ratio of an image without making the content look skewed or stretched out. There is a video demonstration up on YouTube."

15 of 174 comments (clear)

Min score:

Reason:

Sort:

The paper via ACM by xenocide2 · 2007-08-25 11:38 · Score: 4, Informative

The author's website was pegged serving that 20MB PDF before slashdot got ahold of it, I doubt it'll survive now. The paper is also hosted by the ACM, if you're a subscriber.

--
I Browse at +4 Flamebait
Open Source Sysadmin
1. Re:The paper via ACM by spydir31 · 2007-08-25 12:09 · Score: 4, Informative
  
  The Coral Cache" has it also.
2. Re:The paper via ACM by Anonymous Coward · 2007-08-25 12:23 · Score: 5, Informative
  
  I used a lossy compression algorithm on their paper and got this...
  
  Shrink image:
  Step 1: Run an edge detection algorithm.
  Step 2: Find minimal energy (least amount of edges crossed) path from top to bottom or left to right (graph-cut algorithm).
  Step 3: Remove pixels along that path.
  Step 4: Repeat steps 2 and 3 as necessary.
  
  Extend image:
  Step 1: Run an edge detection algorithm.
  Step 2: Find minimal energy (least amount of edges crossed) path from top to bottom or left to right (graph-cut algorithm).
  Step 3: Insert pixels along that path (interpolated from neighbors)
  Step 4: Repeat steps 2 and 3 as necessary.
  
  Remove objects:
  Step 1: Run an edge detection algorithm.
  Step 2: Mask object by giving its pixels low/negative energy values.
  Step 3: Find minimal energy (least amount of edges crossed) path from top to bottom or left to right (graph-cut algorithm).
  Step 4: Remove pixels along that path.
  Step 5: Repeat steps 3 and 4 as necessary.
3. Re:The paper via ACM by Anonymous Coward · 2007-08-25 13:44 · Score: 5, Insightful
  
  I think you've got it except for a small detail in the "Remove objects", which the narrator alludes to around timestamp 4:01 of the video. You might want to add:
  
  Step 6: Extend image to match original size using the previous extend image algorithm
  
  (Of course, I leave the obligatory Profit step as an exercise for the reader).
nice! by White+Shade · 2007-08-25 11:40 · Score: 4, Interesting

It seems like a little bit of work is left to make it as completely automated as you would need to have it just "always work" on any platform or device, but it seems like they're already working on that...

Other than that though, that's pretty awesome... I'm sure there's more instances where it doesn't look right than what they showed, but it's definitely cool how well it works as it stands!

I can imagine it would be extremely useful for ex-boyfriends or ex-girlfriends; just load up all their photos of them and their ex, wave the magic eraser, and *boom* you don't have to delete all your old vacation shots ;)

I wonder how well it would work for the porn industry too; nice automatic resizing of breasts without ruining the picture! Fetishists will be SO happy! :)

--
ìì!
1. Re:nice! by aliquis · 2007-08-25 12:23 · Score: 5, Funny
  
  I'd never understod this hate-your-ex-thing? The person where part of your life for some time but you have decided to hate it and want to erase it from it?
  Better never get a partner then at all if you are going to hate the person once it doesn't work longer.
  
  But then I'm a regular slashdot visitor and don't have any exs so what do I know.
Whao by Arthur+B. · 2007-08-25 11:59 · Score: 4, Funny

Ths s rly gret !

--
\u262D = \u5350
Gimp! by larry+bagina · 2007-08-25 12:13 · Score: 5, Interesting

Although they demonstrated on Windows, a friend of mine is one of their graduate students and was peripherally involved. He said it was originally developed as a GIMP plug in, but moved to a separate Windows app to show off the realtime resizing, etc. Hopefully they'll release the GIMP plugin? More likely Adobe will write them a check and license it to make sure that never happens.

--
Do you even lift?
These aren't the 'roids you're looking for.
Does Anyone Find It Ironic by szyzyg · 2007-08-25 12:25 · Score: 4, Funny

I find a small irony in the fact that the video is posted on youtube, a site which stretches and squeezes video to fit into a 4:3 aspect ratio
Re:Slightly Strange by Nutty_Irishman · 2007-08-25 12:34 · Score: 4, Insightful

I think you're missing the point of their method, which is to provide realistic images during rescaling that aren't corrupted by blind interpolation (equal averaging). In downscaling the images, it preserves parts of the images that would lose their information through downscaling (e.g. complex textures, people), while at the same time removing textures that would not lose information through downscaling (sky, water, sand). The sky, water and sand will still look like sky water and sand whether it's at 1/4 or 10x resolution, people however look much different if you try and downscale them or upscale them(they would appear blurry and hard to distinguish). The same works in reverse. The sky is still going to look like the sky whether you scale it to 10x or 5x-- it would still look natural. Tree's on the other hand, would not. Once you start to scale up the trees you would expect to be seeing different characteristics-- leaves, branches, etc. Any type of scaling up of a tree would make it seem very blurry and unnatural (lacking leaves, branches, etc.)-- you cannot create an additional information that isn't present in the original image. Therefore, the most natural looking image would be to increase the sky.

It's not perfect of course. I'm guessing that if you had a picture of two people next to each other, one with a solid colored shirt, and the other with a striped colored shirt, that the solid colored shirt guy would get skinner than the striped when shrinking, and the reverse when enlarging. However, it's a neat idea, and I look forward to reading the paper.
Re:A picture speaks a thousand words... by Fred+Ferrigno · 2007-08-25 12:58 · Score: 5, Informative

It's not removing any more pixels than normal resizing or cropping would, it's just doing it such that the least important ones are removed first. Instead of:

he uic bownfoxjumed verthelaz yelowdog

You get:

Th qik brwn fx jmpd ovr th lzy ylo dog

Which reduces the total size by the same amount, but retains more information than treating every bit of information the same.
My Implementation by The+New+Andy · 2007-08-25 13:35 · Score: 5, Interesting

I thought it was pretty cool, so I made my own version after seeing the video. It obviously won't be as awesome as their one, but if you want to play around with it, you can get my C source and have a play around. It is GPL3.
Re:A picture speaks a thousand words... by random735 · 2007-08-25 15:20 · Score: 5, Interesting

while this is technically true, you're also rearranging the relative positioning of those pixels. cropping something out doesn't change the relationship of what is left in the photo (though it may remove critical details).

if you have 3 people in a picture and you crop it down to 2, you've erased a person, but you haven't changed who is seated next to whom. if you use this method and the middle person is erased, you make it appear as though the outer two people were in fact seated next to each other when they weren't.

we are used to the idea that a picture can be cropped (mentally considering what might be just outside the frame). We aren't yet used to the concept that the photo has effectively been cut and pasted together to create new relationships between the objects in the photo (though of course photoshop is getting us there).

to continue your analogy, if we take:
the quick brown fox jumped over the lazy dog

and drop letters, we can create:
the cow jumped over the dog

whereas "cropping" might let us say:
the quick brown fox jumped

I think it's clear that one of these is more misleading than the other, though in both cases you're just removing information. (in one case, some of that information happens to be spaces between letters/words)
Re:Not ready for Prime Time by pclminion · 2007-08-25 16:23 · Score: 4, Insightful

It has nothing to do with edge detection. The algorithm simply detects paths of minimal gradient which lead from one side of the image to the opposite side. This can be used to produce a "pretty picture" which shows the edges -- but this is merely fallout.

They showed what I thought were several realistic photos with complex backgrounds, and the algorithm did well overall, except on structures where people are closely attuned to exact detail -- such as human faces. If we weren't innately wired to process faces in incredible detail, we wouldn't even notice the distortion.

So it's not perfect. Can you show me something in this world that is? And I don't think there has been any mention of "prime time" application, whatever that means.
Re:Great - We can do this, but should we? by vasanth · 2007-08-25 20:47 · Score: 4, Insightful

Your comment seems to be similar to the headline on tabloids.. Just because a technology could be used for negative purposes does not mean that it should not be developed.. If your reasoning was used, we should have all been living in caves by now..

By your reasoning
Cars can be used by criminals to travel faster.
A knife can be used to kill
Electricity can be used to kill
Computers can be used by the govt to collect more information abt us effectively

Is that really what we want?

see the flaw in the logic?