Particle Swarm Optimization for Picture Analysis

Wow by Izabael_DaJinn · 2008-02-03 19:27 · Score: 5, Insightful

I love an article on digital imaging technology that has no pictures. This is 2008. Send out your press release with a photo...of something...anything.

--
Careful What You Wish For....

Re:Wow by Smordnys+s'regrepsA · 2008-02-03 19:57 · Score: 4, Funny

Let me just point out your sig

Careful What You Wish For..
So, did you realize an optimized goatse fits your wish for a picture of "something...anything"?

--
Just -1, Troll talking to another.
Re:Wow by RuBLed · 2008-02-03 20:12 · Score: 0, Troll

Done, here it is, optimized and at high resolution...
Re:Wow by SlashWombat · 2008-02-03 22:30 · Score: 2, Insightful

A pictures worth a thousand words. Strikes me that what they are implying is that take a CCTV (MJPEG/MPEG) and correlating the differing images (frames/fields really). I dont think that manipulation of one CCTV image over and over will ever produce results like that seen on CSI!
Re:Wow by rucs_hack · 2008-02-03 22:44 · Score: 1

I love an article on digital imaging technology that has no pictures. This is 2008. Send out your press release with a photo...of something...anything.

What do you think this is.. /b/?
Re:Wow by Anonymous Coward · 2008-02-04 03:09 · Score: 0

Just keep pressing the 'Enhance' button until the image looks like the guy at the top of your suspect list. It's like magic!
Re:Wow by MobileTatsu-NJG · 2008-02-04 04:51 · Score: 3, Interesting

A pictures worth a thousand words.

Strikes me that what they are implying is that take a CCTV (MJPEG/MPEG) and correlating the differing images (frames/fields really). I dont think that manipulation of one CCTV image over and over will ever produce results like that seen on CSI! Well, c'mon, CSI's trying to entertain an audience here. That said, video does offer some potential for enhancement. I've seen technology that can take a sequence of images, extract the motion out of them, and use that to work out a higher resolution image by watching how the pixels shift in color. There are cases where a car could drive into frame but the resolution is too low to make out the license plate. But if the motion of that car is extracted, and assuming that motion is actually of a useful vector, they can watch how the pixels shimmer and figure out what the color of the pixels in between them was supposed to be. When the image is reconstructed, the license plate could be read.

It's not as magical or practical like they show on CSI, but there are cases where it can be done. Heck, Hollywood uses technology like that to slow down video like the bullet time effect in the Matrix. There's a lot you can do with motion vectors.

--

"I like to lick butts!" by MobileTatsu-NJG (#32700246) (Score:5, Informative)
Re:Wow by Anonymous Coward · 2008-02-04 07:48 · Score: 0

That's funny you mention that because the firefox RSS feed reader actually cut of the article title as 'Particle swarm optimization for picture anal' which immediatly got the clear image of goatse in my head. Damn you internet! *going to look at some tubgirl pictures now*

pics or it didn't happen by Anonymous Coward · 2008-02-03 19:28 · Score: 0

tfa doesn't actually show that this does anything, so all we have is AI people making big promises. Am I missing something out of the ordinary here?

Re:pics or it didn't happen by Anonymous Coward · 2008-02-03 21:03 · Score: 0

Yes, you missed the fact that it was Roland, so you gotta expect that tfa is flawed in some manner.
Re:pics or it didn't happen by Wiseman1024 · 2008-02-04 02:19 · Score: 1

Ha ha ha, you ought to mod parent Funny. ohnoitsroland

--
I was about to say 13256278887989457651018865901401704640, but it appears this number is private property.

The only problem... by arrrrg · 2008-02-03 19:30 · Score: 4, Informative

with PSO, ant colony optimization, genetic algorithms, etc. is that they take tons of computational effort, and typically work no better than (or significantly worse than) much more efficient direct optimization methods. Wake me up if they show good results (esp. that didn't take a year of computer time to construct).

P.S. IAAAIR (I am an AI researcher, albeit not in computer vision)

Re:The only problem... by TapeCutter · 2008-02-03 19:46 · Score: 3, Informative

"the fittest end up at the front of the swarm until a single individual that is the most effectively enhanced"

Actually I think the biggest problem with any of these techniques is finding an algorithmic definition of 'fittest' and 'effectively', the rest can be solved by throwing money at the computation.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
Re:The only problem... by Anonymous Coward · 2008-02-03 20:16 · Score: 2, Interesting

Yeah, and as such, you should know that this is just a manifestation of the No Free Lunch Theorem. Basically, they're trying to perfect one case at the expense of the others, resulting in possibly hundreds of poor matches to get one really good match. While this isn't the typical route that I'd try to take, it does have its own interesting applications.
Re:The only problem... by NickBoje · 2008-02-03 20:27 · Score: 3, Informative

You are hundred and one percent right. PSO works mainly with the help of two arbitrary coefficients which are highly oscillatory. Main effort is involved in selecting those coefficient values, accurately. Very good technique but very few good applications solved ...,
Re:The only problem... by Mr2cents · 2008-02-03 21:04 · Score: 3, Insightful

I wonder why they call it a "swarm approach". I'm always suspicious toward people using the latest buzzword, especially if what they are doing sounds like "sorting". The interesting part is the criterion they use, not how they sort the images.

--
"It's too bad that stupidity isn't painful." - Anton LaVey
Re:The only problem... by Anonymous Coward · 2008-02-03 21:08 · Score: 1, Interesting

True, but only for problems where "efficient direct optimization methods" are known. If you have a high dimensional search space and a multimodal objective function or say a multi-objective optimization problem - what then?
Re:The only problem... by somersault · 2008-02-03 21:26 · Score: 3, Funny

Yep because it makes lots of financial sense to have a few supercomputers plugged into your TV so that you can get your contrast setup correctly..

--
which is totally what she said
Re:The only problem... by TapeCutter · 2008-02-03 21:32 · Score: 1

How did you get 'makes lots of financial sense' from what I said?

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
Re:The only problem... by corgi · 2008-02-03 21:55 · Score: 1

From the synopsis: " Despite its potential it relies on only simple mathematics and does not need powerful computers to run, which means software applications based on PSO would not be limited only to academic researchers and those with access to supercomputers."
There is an excellent treatise on a mathematical foundations of PSO in a book Fundamentals of Computational Swarm Intelligence by A.P. Engelbrecht.
Re:The only problem... by should_be_linear · 2008-02-03 21:59 · Score: 1

Solomon's problem R112 of vehicle routing with time windows (VRPTW) has best solution found using Ant Colony (ACO) algorithm. You can bet researchers thrown *all* known algorithms (Tabu, annealing, genetic, ...) at it, still Ant wins. In other instances ACO has similar results to other algorithms while, from programmer perspective, is much simpler and more elegant.

--
839*929
Re:The only problem... by kvezach · 2008-02-03 22:07 · Score: 1

Yeah, that's why NASA uses genetic algorithms for antenna design instead of doing it manually, and estimates that it takes less work.

Err...
Re:The only problem... by rucs_hack · 2008-02-03 22:56 · Score: 2, Informative

with PSO, ant colony optimization, genetic algorithms, etc. is that they take tons of computational effort, and typically work no better than (or significantly worse than) much more efficient direct optimization methods. Wake me up if they show good results (esp. that didn't take a year of computer time to construct).

Oh god, not another 'Bayesian methods for everything' guy..

Genetic algorithms have major advantages over other approaches. When designed well they are easy to code, and they can get tasks done as well as, if not better then the alternative techniques. I have a GA that can outperform a neural network on a particular task (not all tasks, just one very hard pattern recognition task, not going into it though, that would result in too long a post). It outperforms NN, and is so much simpler you wouldn't believe it. I was shocked to discover how much better it was, and I wrote it.

As for computational effort, well duh..

If it wasn't a task that needed a lot of computational effort, it would hardly be interesting, probably it would be in P or something.

ACO does tend to take a while, but in my experience, most really interesting GAs can take weeks to complete a single run. As a rule what your after is the finished result, and the time taken, provided it doesn't run for more than a few weeks, is usually not much of an issue.
Re:The only problem... by somersault · 2008-02-03 22:56 · Score: 1

Just saying that it isn't something that's worth throwing money at (unless possibly it's done on the broadcaster's side rather than the viewer's side). Maybe you're talking about throwing money at finding an algorithm in the first place, but it's going to need buttloads of computation time to run the process even after you've found a suitable algorithm. Maybe once we all have 5000 core computers then this will be a worthwhile use of computation time (because there's nothing else to use it on), but at the moment it seems a very inefficient way of doing things (something that happens a lot in AI IMO, for limited applications you can often get the same results by using much simpler algorithms, you often don't need stuff like neural networks and swarm optimisation - just because you can use them, doesn't mean you should).

--
which is totally what she said
Re:The only problem... by tooyoung · 2008-02-04 02:45 · Score: 1

PSO is a hill climbing algorithm that involves a population of climbers attempting to find the best outcome of an evaluation function. PSO differs from some other types of hill climbing algorithms in that after each iteration, the population converges upon the current highest ranked individual. The idea is that by moving through the search space towards the current best value, you may inadvertently stumble upon the optimal solution. In essence, the population is acting like a 'swarm', by constantly moving towards the best known solution.
Re:The only problem... by ceoyoyo · 2008-02-04 03:34 · Score: 2, Informative

Two excellent points about why you wouldn't want to apply a GA to photography, one yours and one mine.

Mine first: you're right, GAs are easy to program, once you know the selection criteria. How do you have the computer select the best looking photo? Photoshop has for years had a feature where the computer will supply some altered images and let YOU pick the right one, but how do you give the computer a sense of esthetics?

Yours: GAs are great for finding finished products that you can then use. Both GAs and neural networks can turn up interesting algorithms etc. But they're not great algorithms for getting things done. I don't want to have to apply them to each photograph, I want to have them produce an algorithm or trained network or something of that nature that I can in turn apply to each photograph.
Re:The only problem... by serviscope_minor · 2008-02-04 04:47 · Score: 2, Insightful

IAAAIR...

Oh god, not another 'Bayesian methods for everything' guy..

I know the type, but...

I have a GA that can outperform a neural network on a particular task

Really? Sounds unlikely to me, because a NN is a function which maps inputs to outputs (sigmoid, sum, sigmoid, sum,...) and is often, but not always optimized with gradient descent. A GA on the other hand is an optimization algorithm. You could optimize an NN with a GA if you wished.

Either way, a mapping function (eg an NN) is not really comparable to a method op optimizing a mapping funcion (eg a GA).

--
SJW n. One who posts facts.
Re:The only problem... by caffeinemessiah · 2008-02-04 04:56 · Score: 1

PSO differs from some other types of hill climbing algorithms in that after each iteration, the population converges upon the current highest ranked individual. In essence, the population is acting like a 'swarm', by constantly moving towards the best known solution.

This is *exactly* the problem with this branch of computational intelligence, stuff that you see at any CI/AI conference. PSO is a minor variation of stochastic hill-climbing -- it's a friggin heuristic. There is no guarantee that it will perform better than any of the myriad other "biologically-inspired" heuristic out there, including ACO, GAs, ESs, etc. See the No free lunch theorem.
IMO (and as an earlier poster said I am an AI researcher, albeit not in CV), the whole field is trying to wean off a hangover from '89, when genetic algorithms blew people away with the fact that they were "biologically-inspired" and actually could do optimization. Now every minor heuristic variation of stochastic search has to have some cutesy Disney-like "biologically-inspired" back story -- why?? Just move on, damnit, or try to prove something about your heuristics (then again, see the NFL theorem).

--
An old-timer with old-timey ideas.
Re:The only problem... by elmartinos · 2008-02-04 06:15 · Score: 1

You are right that a specialized optimization usually produces better results than a PSO. But there are many cases where it is very time consuming to develop such an algorithm when it is good enough to just use the plain PSO. Compared to other metaheuristics, PSO does not not need lots of fitness evaluations, and it is very robust because it has few parameters. When it is good enough to use a simple off-the-shelf PSO, why develop a specialized optimizer?

--
Open Source Alternatives
Re:The only problem... by Anonymous Coward · 2008-02-04 06:24 · Score: 0

Yeah, that's why NASA uses genetic algorithms for antenna design instead of doing it manually, and estimates that it takes less work.

Some people at NASA have tried used GA for antenna design and had good results, however, the antennas used were all designed manually. You imply that NASA uses GA for all antennas, which isn't even slightly true.
Re:The only problem... by chthonicdaemon · 2008-02-04 07:44 · Score: 1

I have read several sources that say that PSO (and other stochastic algorithms) are the last resort -- what you throw at problems when they don't seem to be working any other way. Specifically, if you have no derivative information available or the derivatives are misleading, when you have interestingly shaped feasable regions or if you have many local minima, I think that PSO wins out on the 'total time to initial acceptable solution' criterion. Of course, if you are solving very similar problems repeatedly, PSO is not a good choice, but there are good reasons for using PSO. It's not just a fad.

--
Languages aren't inherently fast -- implementations are efficient
Re:The only problem... by dfn_deux · 2008-02-04 08:31 · Score: 1

Gah my mod points expired! Someone mod this up... NN!=GA

--
-*The above statement is printed entirely on recycled electrons*-
Re:The only problem... by Anonymous Coward · 2008-02-04 09:29 · Score: 0

He's probably saying that Backpropagation was less efficient than GA.

Posted anon to keep my mods.

Just wondering by Smordnys+s'regrepsA · 2008-02-03 19:30 · Score: 2, Insightful

The researchers have developed an iterative process where a swarm of images are created by a computer. These images are 'graded relative to each other, the fittest end up at the front of the swarm until a single individual that is the most effectively enhanced.'
Um... if the computer knew how to tell a good picture from a bad, couldn't it have just created a good picture in the first place? This all seems rather useless/confusing to me.

--
Just -1, Troll talking to another.

Re:Just wondering by Radish03 · 2008-02-03 19:39 · Score: 2, Interesting

It can tell good picture from bad, but it's completely relative. Sure, it can come up with a picture that's better than the original. But that's by no means the best it can do. This process continually attempts to create pictures that are better than the previous picture, apparently repeating this process until an image is found where any adjustments to it result in images that are worse in quality. Then that one is selected as the best version.
Re:Just wondering by jodelek · 2008-02-03 19:51 · Score: 0

Why are you so sceptics? Try to look at it from completely another point of view... eg. the USA's government can use that algorithm to "make-up" UFO photos... Maybe in the nearest future we will see monsters crawling on Mars's surface...
Re:Just wondering by Smordnys+s'regrepsA · 2008-02-03 19:54 · Score: 1

There has to be a basis for judgment! If it can judge a good picture from a bad picture, than it has to know *specifically* what makes that picture better. Why not use that knowledge to jump to the best picture (that it can define) from the first picture, instead of picking the best picture from thousands of pictures that are randomly created from the original? I'm saying it seems like they're doing things the hard way.

--
Just -1, Troll talking to another.
Re:Just wondering by TuringTest · 2008-02-03 22:02 · Score: 4, Insightful

it has to know *specifically* what makes that picture better. Why not use that knowledge to jump to the best picture (that it can define) from the first picture?
Because the algorithm doesn't have that kind of knowledge. In AI-based search we don't know how to define absolute functions of quality, but we know how to define (several) relative dimensions of improvement. (Disclaimer - I do this for a living).

Intelligent search is based on iteratively improving one of those dimensions, just a little bit, one at a time. This goes on until we find a solution that is as good as we can get in all dimensions at once; but we simply don't know how to combine all dimensions to create a formula that maximizes all them, because their relative improvements interact with each other in complex, chaotic ways.

--
Singularity: a belief in the "God" idea with the "demiurge" relation inverted.
Re:Just wondering by Anonymous Coward · 2008-02-03 22:34 · Score: 0

Because it's a lot easier to verify than compute? For example, it's a lot quicker to verify that factors result in a prime than it is to compute those factors.

Optimization algorithm by Anonymous Coward · 2008-02-03 19:34 · Score: 0

Is this something like an advanced version of the Metropolis-Hastings algorithm? I could certainly imagine Monte Carlo methods being employed in these types of analysis procedures.

Re:Optimization algorithm by CmdrGravy · 2008-02-03 21:55 · Score: 2, Funny

A first pass analysis certainly reveals some elements of Metropolis-Hastings may have been folded in but they do not comprise the entirety of the final solution which seems instead to be bulked up by a n'th pass reverse locality filter feeding off a more traditionally schwelpian treatment of the core triplets. Interestingly every fifth haynes cosignatory node seems to be commulated back to it's quatenary closest fit counterpart.

not a good idea by ILuvRamen · 2008-02-03 19:35 · Score: 2, Interesting

I've seen what Photoshop CS3's auto levels function does to some photos. It gets it right most of the time and when there needs to be little adjustment, it makes a little one and for really bad ones, it makes big adjustments. You could say it's judging the quality of the input image. Well it's right about 75% of the time. When it usually gets confused is when a picture is supposed to look significantly reg, green, or blue and it has no way of knowing that so it screws it up horribly while trying to tone it down. So I'm figuring any automated system won't possibly be remotely as smart as a human when it comes to sorting the "best" photos out front. It will probably just have a huge bias towards certain types of images.

--
Google's Super Secret Search Algorithm: SELECT @search_results FROM internet WHERE @search_results = 'good'

No good heuristic by randomc0de · 2008-02-03 19:36 · Score: 2, Interesting

This procedure sounds like it has the same problem as plain-old AI search - the lack of an obvious heuristic. The article says they use the number of pixels on an edge, but there's no obvious way of finding this - they've moved the computation up one step. The article is light on details so I'm sceptical. If they have a simple procedure for the fitness function, this is a great application.

--
Three rights make a left. Freedom of speech, freedom of the press, freedom of assembly.

not a good idea-poster child. by Anonymous Coward · 2008-02-03 19:52 · Score: 0

"It will probably just have a huge bias towards certain types of images."

So my handsome mug will look good in Photoshop?

yeah... by cosmocain · 2008-02-03 20:02 · Score: 2, Funny

This looks like a clever concept even if I haven't seen any results.

Hell, this needs no comment, it's funny on its own. Mod TFB +1, accidently funny.

Pics? by soulfury · 2008-02-03 20:02 · Score: 1

I'm currently googling for pics, but nothing comes up except for similarly-worded pages. Please post URL (via Coral) if you find one.

Not exactly comprehensive by vikstar · 2008-02-03 20:10 · Score: 5, Informative

For more detail, including the citation of the paper, see this http://www.primidi.com/2008/02/03.html

--
The question of whether a computer can think is no more interesting than the question of whether a submarine can swim.

Re:Not exactly comprehensive by timthephoto · 2008-02-03 22:37 · Score: 1

why is the world messing about trying to extract data clearly not present? most "decent" camera models are only 12bit anyway & use a bayer filter on a ccd sensor sigma released a camera last year without bayer stupidity, better to wait for the rest of the manufactures to catch up when we get full frame foveon sensors there will be no need for all this gueswork will there?
Re:Not exactly comprehensive by ceoyoyo · 2008-02-04 03:41 · Score: 1

Foveon isn't magic. You basically get triple the resolution in the sensor. Of course, the sensors are harder to make so for the same price you usually get about a third of the photosites....

Bayer interpolation works very well. There is no missing information.
Re:Not exactly comprehensive by timthephoto · 2008-02-04 04:15 · Score: 1

obviously there is missing info, but it's already guessed into place by the algorithms within the camera. I have been most disappointed with the output of my canon camera. when one considers that a 100iso/ASA film frame has the equivalent resolution of a 60meg sensor it is possible to know just how poor digital photo's are presently. it is so typical that the fans of everything digital will accept such poor quality compared to "old tech" it started with CD audio being 44.1k instead of the 2 x 64k considered broadcast quality in 1957, and has got steadily worse from then on.
Re:Not exactly comprehensive by ceoyoyo · 2008-02-04 06:51 · Score: 1

You have to use some pretty creative accounting to get a 35 mm film equivalent to be 60 megapixel. Most experts (and my own experience with both) put a 6-10 MP Bayer sensor as approximately equivalent in resolution to a good quality color negative film.

If you're so dissatisfied then you should probably use film.

Simply not enough information by TropicalCoder · 2008-02-03 20:15 · Score: 1

Tantalizing - but not enough to go on, so it is pretty much useless. I found the abstract here but it does little to elucidate the article.

Nothing new to see here by kegon · 2008-02-03 21:00 · Score: 3, Informative

They've reinvented genetic algorithms ?

Without seeing the details (read TFA but it's a summary and quite a bad one at that), I can't see why this would be better than a Bayesian optimisation with a photometric constraint. "The objective of the algorithm is to maximize the total number of pixels in the edges" sounds very, very simplified.

There are efficient ways of solving these things. Interesting that they invent an image processing algorithm but publish it in a non image processing journal - I wonder why that is ?

Re:Nothing new to see here by varaani · 2008-02-04 02:50 · Score: 1

PSO is quite closely related to genetic algorithms, and also Population Monte Carlo type of methods. In a computer simulation it probably doesn't matter all that much whether one "moves" existing particles in the search space or "generates" new ones based on the previous ones.

The choice of the objective function (for example, some kind of Bayesian posterior probability) is surely more important, and unless there is something very special in the structure of the optimization problem, I think that it's a bit silly to publish papers which test various optimization methods for a given problem and conclude that some of them perform better than the others. The results are typically dependent on a multitude of factors that cannot be controlled in the experiments (a significant one is the amount of effort one spends in finding the best optimization parameters), and do no progress science all that much, particularily if one does not have a hypothesis, and preferably also some analysis, _why_ some of the optimization methods are better than others.

By the way, here's a neat paper on the use of PMC in the global illumination (radiosity/raytracing/...) problem.

Nothing to defame here by Anonymous Coward · 2008-02-03 21:07 · Score: 0

"Interesting that they invent an image processing algorithm but publish it in a non image processing journal - I wonder why that is ?"

Well they could've published it on Wikipedia. And we both know how that'd be a boost.

Actual information by Rufus211 · 2008-02-03 21:10 · Score: 1

Erm, anyone have a link to anything that's actually worth reading, not a short press release? You know, maybe with some PICTURES of their image processing...

Bullshit FTA by EdIII · 2008-02-03 21:40 · Score: 4, Interesting

While reading the article I came across:

However, none comes up to the standards of the kind of image enhancement often seen in fiction, where a blurry distorted image on a screen is rendered pin-sharp at the click of a mouse. PSO, however, takes image enhancement a step closer to this ideal.

Unless I am REALLY missing something, it is next to impossible to go from a blurry distorted image to pin-sharp. Really close to impossible. It is a matter of data. If you start from blurry, you cannot actually obtain the information required to unblur it. It does not exist. Therefore, any results are fundamentally speculative. Contrast Levels are not exactly the same thing, since you are only shifting data already there. Edge enhancement, sharpness, is not actually representative of what the objects actually looked like. There is a big difference between taking a blurry box and enhancing the edges and taking somebodies face and effectively "refocusing" the image so you can see facial features more clearly. You could say this is a step closer and certainly novel approach to the problem. To actually get to science fiction levels of performance may be not actually be possible though.

Such enhancement might be useful in improving snapshots of CCTV quality for identification of individuals or vehicle number plates

Not really useful at all. At least from an evidence point of view. Since you cannot really be sure if that is the individual in the picture, the best you can approximate is closer to one of those sketches they provide. I'm not being racist, but certain races do look similar. If you took 100 Chinese people for example, and started progressively blurring their pictures, you would start to get pictures that you could not make a distinction between them, much less a definitive identification. So there had better be some corroborating evidence, since it won't take too much of an expert witness to shoot that down. So it would be better to say it could help identify possible suspects, not individuals. Burden of proof, reasonable doubt, and so on.

Another thought, even more concerning, is that if you took those 100 pictures and showed them to a test group that saw before and after shots for each individual, how effectively could they make identifications? What about a test group showed only the after shots? My point being, is that if you are predisposed towards identifying a certain individual you are more likely to do so. In fact, people remember faces in a similar way be exaggerating facial features. I believe it is referred to as face perception. So it might be possible for the human brain to identify, incorrectly, an individual from one of those blurred images. All in all, not solid enough for legal purposes, which CCTV identifications of individuals and license plates are certainly used for.

I could be wrong, but until I see actual pictures, I will have to play the part of the skeptic.

Great idea, and certainly thinking outside of the box, so they deserve respect for their work.

Re:Bullshit FTA by Garridan · 2008-02-03 22:41 · Score: 1

Soon, every surveillance camera video will be enhanced, and we'll see the face of Elvis on every criminal where there was once a blur...
Re:Bullshit FTA by lisaparratt · 2008-02-03 22:43 · Score: 2

Unless I am REALLY missing something, it is next to impossible to go from a blurry distorted image to pin-sharp. Really close to impossible. It is a matter of data. If you start from blurry, you cannot actually obtain the information required to unblur it. It does not exist. Untrue. The information is spread into the pixels over which it is blurred. With the appropriate convolution matrix, you can recover the pinsharp picture. Hackzine covered it lately: http://www.hackszine.com/blog/archive/2008/01/remove_shake_and_motion_blur_f.html
Re:Bullshit FTA by arotenbe · 2008-02-03 22:44 · Score: 1

Unless I am REALLY missing something, it is next to impossible to go from a blurry distorted image to pin-sharp. Really close to impossible. Actually, mathematically, it is completely impossible for most images. This is the same reason that any data compression algorithm must, at least some of the time, produce "compressed" files that are larger than the originals: if they didn't, there would be a many-to-one mapping, violating the pigeonhole principle.

--
Tomato wedge sperm darts that are Republican.
Re:Bullshit FTA by Garridan · 2008-02-03 22:53 · Score: 1

Have you seen the results of what you linked to? Shitty, every one. It goes from blurry, to grainy -- w00! No information (re)gained. There is no free lunch.
Re:Bullshit FTA by jamesh · 2008-02-03 23:00 · Score: 1

Untrue. The information is spread into the pixels over which it is blurred. With the appropriate convolution matrix, you can recover the pinsharp picture

All that article says is that you can make the image clearer. It even says that the zooming in that you see on the crime tv shows is not possible.

If you had a high enough resolution you might be able to apply a convolution matrix to the problem to 're-focus' it, but once you have the image in a digital form with a finite resolution, you can't do that much with it.

And what the crime tv shows do is 'zoom' in, which is different to re-focusing an image. They show an image being zoomed down to about 16x16 pixels and some magic software re-computes it into a high res image. As cool as it looks, it just can't be done. Just think of all the possible images that those 16x16 pixels could represent!
Re:Bullshit FTA by EB+FE · 2008-02-04 01:29 · Score: 2, Informative

The claim of TFA is not that their algorithm can take one blurry image and generate a less blurry image. The algorithm uses a series of pictures of the same subject (I assume something similar to bracketing exposures) and use the data from most of those images to sharpen edges in the image that already contains the most clearly defined edges. Imagining how this works is pretty simple. Suppose the best image has an edge that appears to be on pixel columns x and x+1 and those pixels have luminance values a and b. Another image of the same subject may have the corresponding edge on columns x+1 and x+2 with lum vals c and d, and so forth... Then, it's a matter of combining the information from the pixels to produce a better edge. Fair results could probably be achieved with something like a weighted average, but I'm sure the algorithm the researchers in the article use is better.

--
Vital papers will demonstrate their vitality by moving to where you can't find them.
Re:Bullshit FTA by Hellsbells · 2008-02-04 01:52 · Score: 1

If you start from blurry, you cannot actually obtain the information required to unblur it. It does not exist.

You can potentially sharpen parts of an image (to a degree) if the information exists elsewhere in the image. For instance if there are repeated elements in the image (images of text, man made structures, etc...). Human faces are also mostly symmetric.

With CCTV you also have a series of other very similar images to get information from in order to sharpen a single frame.
Re:Bullshit FTA by Chiralhydra · 2008-02-04 02:57 · Score: 1

Unless I am REALLY missing something, it is next to impossible to go from a blurry distorted image to pin-sharp.
Actually it is possible. It has been done to uncover blurred out credit card numbers, for instance. Also, in addition to the methods used in TFA, one can use fractal compression. This matches the 'shapes' in the image to individual fractals, and allows zooming in much further than originally possible without producing pixellation. This is used routinely in the publishing business with low-resolution images, with programs such as this: http://www.ononesoftware.com/detail.php?prodLine_id=2.

So, at least in principle, there's no reason why the software can't do that.
Re:Bullshit FTA by ceoyoyo · 2008-02-04 03:46 · Score: 1

That depends on how the blurring was caused. It IS possible (in theory) to sharpen an image to the point where it's MTF drops to zero. With most imaging systems the MTF drops off slowly, so there's quite a bit of sharpening that's possible. Deconvolution algorithms can work quite well in this area, particularly if you know the MTF. As the MTF drops nearer to zero you get extra noise, of course.

What's not possible is sharpening up to frequencies above where the MTF is zero. Since your imaging system multiplies the frequency spectrum of the true image by the MTF, these areas are multiplied by zero and really don't have any information in them.

CSI style surveillance camera enhancement is impossible, but you can get a surprising amount of additional detail out of a blurry photo with properly applied deconvolution.
Re:Bullshit FTA by Anonymous Coward · 2008-02-04 04:07 · Score: 0

Unless I am REALLY missing something, it is next to impossible to go from a blurry distorted image to pin-sharp. Really close to impossible. It is a matter of data. If you start from blurry, you cannot actually obtain the information required to unblur it. It does not exist. Therefore, any results are fundamentally speculative.
It is sometimes possible to enhance a blurry picture to obtain a deblurred version--it depends on what made the picture blurry in the first place and what you have to work with. If the picture has sufficient resolution and is not too noisy, and you have a good estimate of the point spread function, you can perform inverse filtering to deblur the picture. If the picture is blurred because of insufficient resolution, but you have multiple of them, it may be possible to combine the pictures to obtain one of a higher resolution.
Re:Bullshit FTA by Chris+Shannon · 2008-02-04 06:58 · Score: 1

CSI style surveillance camera enhancement is impossible, but you can get a surprising amount of additional detail out of a blurry photo with properly applied deconvolution. I agree. Check out the deconvolution examples using the Gimp Plug-in Refocus-it which is based on finding the minimum of the error function using Hopfield neural network, or Refocus which is based on a modified form of the Wiener filter, called the FIR (Finite Input Response) Wiener filter. Refocus is conveniently available as a Digikam plugin as well as a gimp plugin.
I've played with Refocus and have had some pretty good results with it, even better than unsharp mask, as the documentation states:
In practice I found that in virtually all cases the results of the FIR Wiener filter were at least as good as those of unsharp masking. The FIR Wiener filter is frequently better in restoring small details.
Unsharp masking is a high pass filter. Blurring, which is a convolution with a Gaussian, is multiplication with a DC centred Gaussian in the frequency domain, a low pass filter. Subtracting that from the original will give you the high frequency components. Unsharp masking adds more of those high frequency components back to the original. The thing about a blurred image is that the high frequencies of interest are by definition of "blurred", not present in the original, so unsharp masking can not restore them.

I've used Refocus for sharpening photographs of text for an OCR, but gocr mostly outputted garbage. I read the article with interest thinking that the fitness function of the particle swarm optimization for blurry scanned text could be the number of English words that are identified when the refocused image is run through an OCR.

--
"Follow me" the wise man said, but he walked behind.
Re:Bullshit FTA by ceoyoyo · 2008-02-04 07:30 · Score: 1

Unsharp masking is a spatial domain algorithm that basically increases local contrast on edges. It's effect might be sort of a high pass filter, but it's not in the nice simple way that a real high pass filter is.

Deconvolution, on the other hand, is a direct high pass filter. With non-blind deconvolution techniques the filter is designed to counter the low pass filter that caused the blurring. With blind techniques you usually pick some likely blurring function (like a Gaussian) and then apply it iteratively until you get the result you want.

Unsharp masking isn't really the right algorithm to compare deconvolution techniques to. It's kind of useful for PR for photo editing software because deconvolution is fairly uncommon in consumer photo editing, but deconvolution techniques should properly be compared to other deconvolution techniques: Fourier division is the simplest, or both maximum entropy and Lucy-Richardson are classic algorithms.
Re:Bullshit FTA by kegon · 2008-02-04 12:13 · Score: 1

You clearly don't know anything about image processing, but hey, this is Slashdot.
I could be wrong, but until I see actual pictures

Seeing pictures would not prove anything. A ground truth comparison is what is required.
Great idea, and certainly thinking outside of the box, so they deserve respect for their work.

No, respect for trying but it doesn't look like more than a small improvement, if that. We have to get hold of this paper and see if the results are presented in an appropriate scientific context, something not implied by the summary.
Re:Bullshit FTA by EdIII · 2008-02-04 13:25 · Score: 1

You clearly don't know anything about image processing, but hey, this is Slashdot.
Your right I don't know much about image processing other then a limited experience with Photoshop filters. However, this being Slashdot, I don't need to know what I am talking about right? Insult received.

I am not claiming I understood image processing either. I am approaching it from a logical, mathematical approach. Especially since they claim that this could be used for surveillance purposes to a legal end. Images that are that degraded in surveillance situations are certainly not the same as the Gimp examples some people post.

The poster that reminded us of the Pigeonhole principle, certainly provides an interesting mathematical foundation to my argument. Much appreciated. Accordingly, the proposed method is creating a multitude of possibilities, and then applying criteria to choose just one of the possibilities.

So I am coming from a simple position that you cannot go from LESS data to MORE data. It just does not work that way. Any data that you "add" is fundamentally speculative and its value is subjective. The very fact that it is subjective should preclude its use in any legal aspect.

If you take MP3's as an example, you cannot reclaim that information that is lost when going from a WAV file to an MP3. That data is lost forever. That is why it can be referred to as a lossy compression scheme.

When that camera is removed from the scene, you have lost the ability, to "encode" if you will, the data that represents the original scene. So all you are working with is a blurred image of a scene. So my thinking is that a blurred image is like the application of a lossy compression algorithm on the scene itself. Progressively taking away data, while maintaining the ability to represent the objects in the scene to a human eye. This can be done at different levels, and using MP3 as an example again, you can have 320K and 64K resultant files. The 320K might represent slightly blurred, while the 64K represents heavily blurred. So with respect to the scene, you might be able to tell there is a tree, but not exactly how many leaves are on it. You might be able to tell there is a person, but not who he/she is. You can see a sign, but not the characters. So the lossy compression's goal was successful. You can tell what objects are in the scene.

Since you seem to be so sophisticated and certainly imply that you understand image processing better then the average /., answer me this please:

If you have a blurred image of face, to the extent that you could not identify it, how is it possible to accurately predict (it is again, fundamentally a prediction, not a statement of fact) what pixels represent a clear image?

I'm really curious and look forward to the reply of a clearly knowledgeable person.
Re:Bullshit FTA by kegon · 2008-02-04 16:20 · Score: 1

Your right I don't know much about image processing other then a limited experience with Photoshop filters. However, this being Slashdot, I don't need to know what I am talking about right? Insult received.

I think it helps to keep things on topic if you don't make a long post speculating about things you clearly don't know. If you know you don't know, then how is it an insult ?

I am approaching it from a logical, mathematical approach

I don't see any logical or mathematical argument in your post. Now you're making an analogy to MP3 compression and adding lots of speculation, that doesn't form a logical argument either. As other posters have pointed out; given suitable knowledge or a model of the physical parameters in the scene it is possible to recover more, real image data. I would argue it is not a prediction if your model is accurate. e.g. after the incident was recorded on CCTV you went back and took some calibration images to measure lens distortion in the video etc.

I think you simply didn't like the fact that I pointed out 2 fundamental errors in your thinking. 1. Seeing the images would be enough to "know" if the algorithm was good - not scientific. 2. The idea was thinking outside of the box - check image processing literature before you make such a bold statement. Plenty of posts above have cited other works that apply.
Re:Bullshit FTA by Anonymous Coward · 2008-02-04 17:31 · Score: 0

You add information to the picture by providing an educated guess of how the blurring occurred (something which isn't too hard to make a decent guess at).
Knowing the output and an approximation of the transformation, we can make conjectures about the inputs (by applying the inverse of the blur function).
Re:Bullshit FTA by amirulbahr · 2008-02-05 12:30 · Score: 1

Unless I am REALLY missing something, it is next to impossible to go from a blurry distorted image to pin-sharp. Really close to impossible. It is a matter of data. If you start from blurry, you cannot actually obtain the information required to unblur it. It does not exist. But if you take another image of the same scene, you just captured some more information. An algorithm can attempt to combine this added information from two or more frames into a single image of the scene which has more information in it than a single frame.
I think the part you are missing is that this is about enhancing a scene using multiple images of the same scene.

This looks like a clever concept even ... by CmdrGravy · 2008-02-03 21:58 · Score: 1

This looks like a clever article alright, even if I haven't bothered reading it.

Can you say boids? by webword · 2008-02-04 01:28 · Score: 1

Totally makes me think of Craig Reynolds's "boids" -- take a look:

http://en.wikipedia.org/wiki/Boids

What's really cool is that boids force you to re-think how you define intelligence, well, at least collective intelligence. It's like watching ants at work. Love it.

--
How to Download YouTube Videos

metaheuristic by wikinerd · 2008-02-04 02:33 · Score: 1

Swarm intelligence is what I research. PSO is not really an algorithm, it is a metaheuristic. Of course when I talk with non-engineers I might also use the terms algorithm or recipe, but I would expect correct terminology on a site whose readership contains a large percentage of CS/EE degree holders.

Photogenetics by Anonymous Coward · 2008-02-04 03:29 · Score: 0

Looks like it works the same way as the old idea from Photogenetics. This time it looks like it involves less human interaction - which can be good or bad.

re by JohnVanVliet · 2008-02-04 08:59 · Score: 1

well i do get a very big laugh from CSI and there " enhanced photos " 1) start with a very good shot 2) degrade it and say it is the orig. 3) show the true orig. image as the "enhanced" one i still like greycstoration , pde_TschumperleDeriche2D , and pde_heatflow2D

--
"I don't pitch OpenSUSE Linux to my friends, i let Microsoft do it for me

FIR Wiener filter for deconvolution by Chris+Shannon · 2008-02-04 08:59 · Score: 1

Perhaps I should elaborate.

The (nonlinear) threshold setting on a digital unsharp mask algorithm cause my high pass filter analog to break down, but otherwise it's valid. So ignore the threshold, for a moment, in the unsharp mask. The implementation of the unsharp mask is in the spatial domain as you said, but (without the threshold) it has a dual in the frequency domain. The unsharp mask uses a convolution of the image with a Gaussian for blurring, followed by linear additions and subtractions. Convolution, addition and subtract all have duals in the frequency domain.

The Convolution Theorem states that convolution in one domain (e.g., spatial domain) equals point-wise multiplication in the other domain (e.g., frequency domain). Taking the Fourier transform of a Gaussian function yields another Gaussian function. The frequency domain Gaussian is a low pass filter. The subtraction takes an all pass filter (the original image) and subtracts the low pass filter (the blurred) resulting in a high pass filter. The high pass filter is then added back to the original, making the transfer function of the unsharp mask = 1+a*High PassFilter(f), where a = amount, and the High Pass Filter(f) = 1 - Low Pass Filter(f).

Unsharp masking isn't really the right algorithm to compare deconvolution techniques to. They are apples and oranges in their implementation, strengths and weakness, and mathematical foundations, but like apples and oranges, they are both used for the same purpose. Wikipedia does a fine job comparing them.

Deconvolution, on the other hand, is a direct high pass filter.
The Refocus program I was referring to uses a FIR filter. If by "direct" you mean implemented in the frequency domain then you won't agree that Refocus uses deconvolution.
"A FIR Wiener filter only uses a limited neighborhood of the source pixels and can be easily implemented as a convolution matrix." FIR Wiener filtering has the following advantages:

Low memory requirements. Only the convolution matrix must be stored.
Ease of implementation. There is no need to do a full Fourier transform.
The transformation is local. The results only depend on a small neighborhood of the original pixel.

The Unsharp Mask and the FIR Wiener filter have all these properties in common.

--
"Follow me" the wise man said, but he walked behind.

Re:FIR Wiener filter for deconvolution by ceoyoyo · 2008-02-04 09:27 · Score: 1

EVERY operation in the spatial domain has a dual in the frequency domain. Usually when you say high pass filter, you mean a particular class of operations in the frequency domain though. Unsharp mask behaves like a high pass filter, and may actually be one in the restricted sense you outlined.

On the other hand, FIR filters, I believe, have direct equivalent frequency domain filters, even if they are actually calculated in the spatial domain, no restrictions necessary. They're pure convolutions, without thresholds and such.

old news by Anonymous Coward · 2008-02-04 10:07 · Score: 0

circa 2006, a single pixel camera prototype using what sounds like a similar method was being developed and subsequently covered by slashdot here http://science.slashdot.org/article.pl?sid=06/10/19/2255239&from=rss. did it turn out that 1 pixel really wasn't enough and what we really need is brute computational force instead?

Mod Parent Funny by Anonymous Coward · 2008-02-04 11:21 · Score: 0

It is a picture of Uranus, not your anus! Good lord, are people moderating without reading now?

Btw, Uranus made me wet my pants! (from laughing)

Fear of the unknown is an amazing thing. by randomencounter · 2008-02-05 03:45 · Score: 1

We've optimized Uranus!

--
Forget diamonds, copyright is forever.

Evolutionary Algo by severoon · 2008-02-05 05:47 · Score: 1

From TFA it sounds more like an evolutionary algorithm than anything to do with swarms. It said the word swarm over and over but didn't actually describe anything to do with them...instead it talked about how to solve the traveling salesman problem.

--
but have you considered the following argument: shut up.

Swarm Sci-Fi by Woogiemonger · 2008-02-05 11:25 · Score: 1

Just a note.. If you want to read some fun swarm-centric sci-fi, pick up Crichton's "Prey", where he writes of simple one pixel cameras injected into the bloodstream, then swarm together to form an eye which acts as a miniature video camera. Among other things, he also writes of how humans are swarms themselves, consisting of tiny little dumb cells that work together to form a supposedly intelligent life-form.

Pics? by karlwilson · 2008-02-05 13:31 · Score: 1

Pics or it didn't happen.

Quote by valentingalea · 2008-02-05 13:46 · Score: 0

"This looks like a clever concept even if I haven't seen any results"
That's what she said!

Slashdot Mirror

Particle Swarm Optimization for Picture Analysis

90 comments