New Algorithms Improve Image Search
bc90021 writes "Electrical engineers from UC San Diego are making progress on an image search engine that analyzes the images themselves. At the core of this Supervised Multiclass Labeling system is a set of simple yet powerful algorithms developed at UCSD. Once you train the system (the 'supervised' part), you can set it loose on a database of unlabeled images. The system calculates the probability that various objects it has been trained to recognize are present, and labels the images accordingly. After labeling, images can be retrieved via keyword searches. Accuracy of the UCSD system has outpaced that of other content-based image labeling and retrieval systems in the literature. One of the co-authors works at Google, where the researchers have access to image collections at the largest of scales."
I remember when we had to go to a gas station and *buy* porn. Now you have computers out there finding porn for you. You kids today have it too easy!
Snarkiness aside, this is pretty cool stuff. I hope to see usable OSS code in a few years. Imagine how cool it would be to query "show me all pics with my daughter and her rabbits" and have it week through the 1000's of digital family photos.
Method of processing duck feet
change the way I search for Natalie Portman p0rn?
Microsoft: "You've got questions. We've got dancing paperclips."
How well does it work for Porn, hopefully it will be able to differentiate between a she-male and female.
The probability is either zero or one, because whether or not the feature being sought is present is a state of nature. It would be more helpful to call this number the confidence that the feature is present.
... was similarly trained to recognise tanks in landscapes. I was doing really well - getting a great score on the fresh images it was presented with.
Then they introduced it to a new batch of images and it fell apart.
Turns out that the initial set of images had all the tanks shot on a sunny day and all the tankless images shot on a cloudy day (or vice versa). It had learned to tell a sunny day from a cloudy day.
Ha ha.
I wish the article would mention more about why it is better than similar techniques that have been proposed in the past. (For example, http://luthuli.cs.uiuc.edu/~daf/papers/WAP-fin.pdf seems similar) For instance, where do they get their labels for the training data? A lot of people have tried using contextual words drawn from surrounding web text to limited success due to noise. It's also questionable how well their techniques can do if they need to pre-build a separate classification for each keyword. Finally, there are words that it seems impossible that they could ever distinguish. For example, 'man' vs. 'woman,' would be incredibly complicated for anything but a human. Where are the details? Oh yeah, it's a news story! Here's a link to the paper http://www.svcl.ucsd.edu/publications/journal/2007 /pami/pami07-semantics.pdf
...or is it something like this?
Yes, personal searches were bound to be the first thing to be mentioned, but what about when others (ISPs, bosses, co-workers) are performing these searches on computers you use? I'm sure most people are smart enough not to do such things at work, but what about pop-ups (you couldn't help getting those kinds of popups while searching for a 'fix' to an app), false matches (boss doesn't view, only flags you if the keyword search comes back positive), etc?
Cool! I can now have my image archive automatically sorted into my relevant categories:
Blondes
Brunettes
Redheads
Anal
Animals
Interracial
Bukake
But what if an image falls under more than category?
I wonder if these students are using this software library?
Nothing for 6-digit uids?
The problem is we all know what's gonna be the first result when searching "Caves on uranus"!!!
--
Great hosting 200GB Storage, 2_TB_ bandwidth, php, mysql, ssh, $7.95
Run this story again when the system can tell the difference between D, DD, and DDD. Bonus points if it can handle "higher" criteria.
long live autopr0n.
Or will this be one of those absurd "hey there's skintones in this picture so we know its teh porno!" algorithms?
I want to delete my account but Slashdot doesn't allow it.
"One of the co-authors works at Google, where the researchers have access to image collections at the largest of scales."
They just don't know the average slashdot user's porn collection yet.
An old AI joke was to call a limited domain version of this a "cat box". The idea a camera with a light on it that comes on whenever it's pointing at a cat.
is The White House.
I hope this help the R.I.C.O. case.
Patriotically,
K. Trout, C.P.A.
Is this really the first real success for that kind of "AI"? I'd rather thought that image classifiers based on neural networks and various other types of classifying techniques had been around for quite some time, and even used in realtime applications like self-driving cars that responded to road signs.
Aren't electrical engineers supposed to be designing hardware? Why are they fooling around with image search and algorithms? Leave that to the computer science department and get back to work.
It would be nice if we could humor these engineers to put as much effort into image spam filters :-P
Sarcasm is the recourse of a weak mind...
--
There is absolutely nothing newsworthy about this. On the contrary, you'll find tons of similar works - mostly as senior year student projects in CS/AI.
a robot challenge that will test robots' vision and language understanding.
the robots/sobots must be able to recognise objects automatically and perform tasks like: get the "star trek" poster or get the blue dry erase marker. the final event will be held at the twenty-second AAAI conference on artificial intelligence in vancouver, canada july 22-26 '07 [taken from ofpblog]
Since a huge % (perhaps most) image searches are for porn, it is probably a worthwhile thing for a search server to quickly classify likely porn as a way to reduce search server loading.
Engineering is the art of compromise.
Self driving cars? Yes, along with time-traveling Delorians and floating skateboards.
Self driving cars have to be, (at least in recent years) an absolute con, just to get grant money. Would you trust technology as stupid as what we have?
I keep waiting for a real image search to be created without the intermediarry step of tagging it with text.
I'll be happy when I can tell the search page "find images like this" and give it an existing picture or a sketch. Tagging is too reliant on the consistant metadata to be useful in a general way. Humans can easily find all pictures of, say, fluffy the cat in a pile of photos from all different sources. Can we teach a computer how to do that without having to wait for it to re-tag images from different sources before it can search?
Still, the better methods we find for tagging, the closer we get to that I guess...
AB HOC POSSUM VIDERE DOMUM TUUM
There is a considerable difference between technology demonstrators and movie props.
Neither tiltrotor transport aircraft nor warp drives are commercially available or in mass production, but they are in widely different categories. Self-driving cars are in the category with tiltrotors, while time-traveling Delorians (other than fixed-rate unidirectional travel, of course) are in the category with warp drives.
I don't trust humans as stupid as what we have on the road, for the most part.
I recall seeing an image search a little while back where people tagged images manually, building up a weighted list of tags. It might be a good idea to use a system like that, to train a system like this. Like the spam filters we all know and love.
No kidding!!! What do you say at this point?
One complaint about this work is that it requires tagging an initial set of images that are needed to train the system. Vasconcelos' work uses the academic standard "Corel" dataset of labeled images but also uses tagged images from Flickr to train the system. Using human computation games like the ESP game for images and ListenGame www.listengame.orgfor audio, collecting data is not as tough as it once was...
a picture's worth a thousand words, where are we going to store all the words for the useless myspace photo's that get archived?
You know, I have one simple request. And that is to have sharks with frickin' laser beams attached to their heads!
It's a little more plausible now that broadband is readily available but this has been portrayed on TV for years. Can you imagine some podunk field office connecting to an FBI database through a dialup and downloading high resolution images until they found just the right one? Then again, that would make for some good entertainment. Detective walks in..."I've got good news and bad news. The good news is we found the killer. The bad news is, he died of old age."
...because my master's thesis, back in 1996, was using neural nets with a fuzzy logic component to identify surface features on Landsat satellite imagery. The algorithm I came up with was even scale invariant.
Guess I should have published and patented...damn...there goes any feelings of validation...
I believe she's out there, somewhere.
they could just upload the "source" picture or data and wait for the remote machine to match it. Then download just the set of matching pics.
Nobody's mentioned Haar wavelets yet? Weird.
Look them up - they're part of OpenCV, and I'm pretty sure it's the same basic principles in action.
you linux faggots will be looking for all that dick sucking porn. how about you fags just die off?
This would be great for deviantArt, as one of the problems is mis-categorising their submissions. If a computer was able to help with that would make finding art of specific subjects/styles much easier.
Please bring it on!
See my art -> http://herbevore.deviantart.com
Self driving cars have to be, (at least in recent years) an absolute con
w s/news.html?in_article_id=393401&in_page_id=1770
VW is doing a pretty good job.
http://www.dailymail.co.uk/pages/live/articles/ne
Slashdot quality declines as the number of hot grits posts decreases. - Provolt's Law, Apr-09-2005
I missed the link, where can I torrent this?
Now I can search for porn stars that look like that girl in my Englsh class!
We're all going to die. i intend to deserve it.
> The system calculates the probability that various objects it has been
> trained to recognize are present, and labels the images accordingly
"Ok, Joe. Let 'er rip on this new test database."
Cock
Cock
Cock
Vagina
Cock
Cock
Hairy armpit
"Oh, cool! The upgrade works and can distinguish it!"
"Nah, wait until you see this!"
Cock
Cock
Cock
Midget with banana split in hairy ass crack with guy eating the banana split without using his hands on the Howard Stern show
Cock
Vagina
etc.
(-1: Post disagrees with my already-settled worldview) is not a valid mod option.
I once read about Google's image labeler, and decided to create a similar program, which would offer the same functionality, with additional features that are not available in Google's toy.
The project does not have a name, it is described on my site - advanced image labeling tool. What makes it different is that besides collecting tags for an image, it also gathers other data about the tagger - age, sex, education, etc. My initial idea was to use it for various studies and establish connections between one's social status and the image labels they provide.
Anyway, my point is that harvesting information about images can be fun, and it can have an impact on fields other than image processing or search engines.
The saddest poem
So many boobs; so little time.
Wow, it uses radar and "lazer" sensor instead of seeing. And it can evade road cones at high speed. That's the definition of safety isn't it?
You should know what a useless toy that is.
When they can trust their car to drive around schools, playgrounds, through ghettos, and New York city streets full of cars stopped in the middle and people behind them expecting them to break the law and go around (in traffic) call me.