Slashdot Mirror


Automatic Image Tagging

bignickel writes "Researchers at Penn State have applied for a patent on software that automatically recognizes objects in photos and tags them accordingly. The 'Automatic Linguistic Indexing of Pictures Real-Time' software (catchy name) trained a database using tens of thousands of images, and new images have 15 tags suggested based on comparisons with objects or concepts in the database. Not sure how you identify a 'concept,' and they're only talking about having one correct tag in the top 15, but still cool."

123 comments

  1. Not shockingly... by goldmeer · · Score: 4, Funny

    The vast majority of the images on the internets including The Google include "Pornography" in it's top 15 tags suggested. The accuracy rate is surprisingly high.

    1. Re:Not shockingly... by benplaut · · Score: 0, Redundant

      lot of people and boobs-> porn site advert :)

    2. Re:Not shockingly... by ObsessiveMathsFreak · · Score: 1

      I don't know how to define pornography, but I know how to tag an image with it when I see it.

      --
      May the Maths Be with you!
  2. That Sucks by JerkyBoy · · Score: 2, Funny

    Researchers at a publicly funded institution are using their research results for personal (financial gain). Pennsylvania's tax dollars at work? How is this legal?

    --


    Always do right. This will gratify some people and astonish the rest. -- Mark Twain
    1. Re:That Sucks by Skater · · Score: 0

      RTFA...it's not very long. "The university has applied for a patent on the invention." Any other conclusions you'd like to jump to?

    2. Re:That Sucks by Anonymous Coward · · Score: 0

      Universities shouldn't be _able_ to get patents. It's double taxation, plain and simple.

    3. Re:That Sucks by Nybarius · · Score: 2, Informative

      Contrary to what you might believe, there is nothing unethical about making money. The government even gives out grants for entrepreneurs, and lets them keep all the profits; it's good for the economy, overall. The profit motive is a much more powerful incentive to positive social change than the goodness that lies in the hearts of men,

      -Nyb

    4. Re:That Sucks by Anonymous Coward · · Score: 0

      How is that double? They still charge tuition, and unless they *enforce* the patent you haven't paid a dime.

    5. Re:That Sucks by Gemini_25_RB · · Score: 1

      Double taxation in what sense? Okay, maybe some public money goes into the research fund (moreso at public universities than private), but how much? I was under the impression that collegiate level research was difficult because the professors are required to obtain their own funding through corporate sponsorships, grants, and private donations. Now the case _for_ patents. I believe that it is important that universities be able to patent their developments, otherwise someone else will. It simply becomes a free-for-all among the corporate world; the first company to discover the piece of research gets the prize!

    6. Re:That Sucks by Manchot · · Score: 1

      Just because the institution is partially publicly funded doesn't mean that the research is. As a matter of fact, at many public universities, the big research groups have the "opposite" of public funding. As an example, use the University of Illinois. In theory, they're a public university. In practice, they get 20% of their budget from the state. This means that the big research groups in the College of Engineering, some of whom bring in millions of dollars a year, can end up pay up to 50% in taxes to subsidize the other departments. If you didn't allow professors to get patents, that's equivalent to telling them that they can't use their research results funded by their money. The result would be that at least some of the best ones would stay in the private sector.

    7. Re:That Sucks by megaditto · · Score: 1
      Penn State is a private school (privately chartered by the Commonwealth) despite what the name implies. It also receives less than 5% of it's funding from the state.

      from http://www.psu.edu/ur/about/character.html
      Today Penn State is one of four 'state-related' universities (along with the University of Pittsburgh, Temple University, and Lincoln University), institutions that are not state-owned and -operated but that have the character of public universities and receive substantial state appropriations.


      Interestingly, Penn State offers all residents in-state tuition at a loss, without being adequately compenstated by PA.
      --
      Obama likes poor people so much, he wants to make more of them.
    8. Re:That Sucks by sprocketbox · · Score: 1

      I think how well Penn State is compensated by the state can be calculated in lots of different ways. I'm sure Penn State benefits from things like state maintained roads, sewer systems and other general public infrastructure. As an educational institution there is some chance that they don't pay property taxes which means that they don't pay in to the funds that pay for that same infrastructure.

      All that said, I think software patents suck, no matter who is doing the patenting.

    9. Re:That Sucks by tomstdenis · · Score: 1

      Hint: They do enforce patents, that's why they get them.

      There is a way to document prior art. It's called a journal.

      These are not defensive patents. They're designed solely to make the uni money. The problem? They made those patents with your tax dollars [at least the public unis]. So you have a public institution telling you that you can't use an algorithm or idea because they took YOUR money and patented it.

      Tom

      --
      Someday, I'll have a real sig.
    10. Re:That Sucks by megaditto · · Score: 1

      Agreed. I was merely addressing GPs point about Penn State being a public school.

      Regarding the compensation levels, Penn State receives less than the other 'state-affiliated' private universities in PA (though my info is somewhat dated).

      And I agree with your statement about the software patents, though I think .edu's generally allow the public their free not-for-profit use. Think about FreeBSD's origin at the University of California at Berkeley.

      And EULAs suck, but linux has one!

      --
      Obama likes poor people so much, he wants to make more of them.
    11. Re:That Sucks by Anonymous Coward · · Score: 0

      Only when the profit motive is in-line with positive social change; but often it is not, which is why (say) organized crime, rape, etc. exist. Perfectly rational acts in a world where what's good isn't necessarily rational.

    12. Re:That Sucks by Anonymous Coward · · Score: 0

      What you just said, by definition, makes it a public school.

    13. Re:That Sucks by Anonymous Coward · · Score: 0

      The fallacy of your argument is that there is a differential - a group of a few people get control over the product of research that was publically funded. It is arguable that the economy would be better off if the entire public got a shot at using the fruits of their tax dollars to try to create some sort of successful business venture from it rather than a few elites.

      Regardless, however, we funded that research, therefore, there is no reason why a chosen few should only be allowed to use it.

    14. Re:That Sucks by Anonymous Coward · · Score: 0

      UK's Oxford is public too then.

    15. Re:That Sucks by grimsweep · · Score: 1
      Actually, this type of research is often primarily funded by an external source. It's not unusual to have contracts (implied or otherwise) with private sources that stipulate what can and cannot be done with the results. The most your tax dollars have done is to provide them with a means of carrying it out in a well-equipped environment. Technically, they're still taxed on it and the majority of the profit goes to the school, which indirectly earns money for the state.

      Arguably, this would be like taking issue with publicly funded roads and highways being used by private businesses to make money (i.e. the freight business). We're all paying for those roads, but without them they would be unable to make a profit.

      I gather this wouldn't be such a point of contention if it a patent wasn't involved. This said, I believe this study agrees with your statement.

    16. Re:That Sucks by KillerDeathRobot · · Score: 1

      Um, rape is practically never a crime committed in pursuit of financial profit.

      --
      Thinkin' Lincoln - a web comic of presidential proportions
    17. Re:That Sucks by tomstdenis · · Score: 1

      I have no problem with a uni offering consulting or products [they can produce] for sale. I have a problem with patents.

      The POINT of a uni is to train/educate and THEN PROMOTE FURTHER DEVELOPMENT. That's the point of your PhD is to add something new to the field.

      If I'm paying you to develop science for society [e.g. a good thing] and then you rob society of the use of your idea then what's the point? Oh in 20 years society will benefit of it ... big whoop.

      I wouldn't have a problem if a gang of grad students got together and offered consulting [for example] services to raise funds for their studies. At least they're providing something of value to society.

      Think about it, it's like having one of your own employees patent something from under your feet that they invented while you paid them to work in your lab.

      --
      Someday, I'll have a real sig.
    18. Re:That Sucks by Anonymous Coward · · Score: 0

      Researchers at a publicly funded institution are using their research results for personal (financial gain). Pennsylvania's tax dollars at work? How is this legal?

      You can thank Pennsylvania's governor, "Fast" Eddie Rendell.

    19. Re:That Sucks by pantalanaga11 · · Score: 1

      As an alum still paying off out of state tution, PSU is NOT a public institution. The University gets next to nothing from the state of PA.

  3. Re:Tag for /. by Anonymous Coward · · Score: 0

    i prefer the 'yes', 'no', 'amirite', 'itsatrap' tags to the slashdot stories.

    clearly the way that they were meant to be used!

  4. Just how broad is a concept? by Anonymous Coward · · Score: 0

    I think I could categorise most things using less than 15 (admittedly very broad) tags. Animal, person, plant, machine, sports, vehicle, furniture, book, etc.

    1. Re:Just how broad is a concept? by James+McGuigan · · Score: 1
      I think I could categorise most things using less than 15 (admittedly very broad) tags. Animal, person, plant, machine, sports, vehicle, furniture, book, etc.
      So which of these categories does a mushroom fit into?
    2. Re:Just how broad is a concept? by zomper514 · · Score: 0

      In my country categories would be:

      God
      Man
      Horse
      Dog
      Woman
      Rat
      Crustilly (like a beatle).

  5. The other 50% is the problem by Heir+Of+The+Mess · · Score: 2, Informative

    I've seen lots of systems like this. The problem is in the 50% of the images that don't work, so basically you have to manually tag 50% of your images.

    I saw an interesting one about 10 years ago. It took an X-Ray image, did an edge detection, converted all the edges to a slope vs distance 2D plot, and conerted edge curves to a radius and distance plot, then used a kind of statistical correlation algorithm to pick which part of the body the image was from. I could imagine that you could apply something similar to the luminance of an image to pick out objects, and then maybe do some color transforms and stuff to improve results. The article says they do it in 1.4 seconds per image though, which is impressive.

    --
    Australian running a company that does C# / C++ / Java / SQL / Python / Mathematica
    1. Re:The other 50% is the problem by cloudmaster · · Score: 2, Insightful

      Since you don't know *which* 50% it'll get right, though, you end up having to look at 100% to determine if the system got it right or not. At that point, it's only saving you a few seconds of typing / picking from a drop-down list. :)

    2. Re:The other 50% is the problem by llauren · · Score: 1

      If they did it with any kind of sense, they would not tag pictures they weren't confident enough of. But of course, i didn't read the fine article. This is /. after all :)

    3. Re:The other 50% is the problem by pe1chl · · Score: 1

      It is quite common to start building a system like this (image recognition, speech recognition, automatic translation, etc etc) and publish a press release stating that "the initial results are promising".
      That is because the coarse approach to the problem is relatively uncomplicated, and after building some framework and inputting some reference data it is easy to make the system do some things right. Like guessing keywords correct for 50% of the input.

      What is hard is to get it correct for close to 100% of the input. That is why you usually never hear again from such projects. The initial ramp-up in "promising results" quickly flattens and no noticable further progress is made unless a completely novel approach is taken.

    4. Re:The other 50% is the problem by Peter+Cooper · · Score: 1

      It depends what you're using the image tagging for. If it's just for search, a la Google, then it doesn't matter if you get some false positives (though it had better be way under 50%!). I mean, if you search for something on Google you sometimes come up with totally irrelevant stuff to the query, and people aren't that worried about it.

  6. Prior art? by Sirch · · Score: 1

    Not RTFA to be honest, but can I claim prior art?

    http://www.relle.co.uk/papers/2003-Content_Based_I mage_Retrieval__A_Words_and_Pictures_Approach.pdf

    We didn't have enough time to train the system properly, but itstarted off well...

    1. Re:Prior art? by Anonymous Coward · · Score: 0

      here's an earlier system that builds on public domain research

      http://www.beholdsearch.com/

      the authors may in fact patent a particular algorithmic aspect of their system, rather than their approach as a whole

    2. Re:Prior art? by SausageOfDoom · · Score: 1

      I think lots of people have done this before - I did something very similar four years ago, and it's actually pretty simple (assuming you understand degree-level vision processing), especially if they're only aiming to get 1 tag right in 15. I think that this is just another example of the US patent system is getting out of control.

    3. Re:Prior art? by nietpiet · · Score: 1

      I'm sorry, but that pdf link you posted, is not a scientific article, it seems like a requirements specification made by business students.
      The science in your 'prior art' is mostly segmentation, and has very limited validation.

      If you say Kobus Barnard's work has prior art, that is true, because his work is very related to the ALIP system.
      see http://kobus.ca/research/talks/INRIA-02/index.html

    4. Re:Prior art? by Anonymous Coward · · Score: 0

      The technique described in the article falls into the broad category of automatic image annotation. This wikipedia article may be helpful for looking at previous academic work in this area: http://en.wikipedia.org/wiki/Automatic_image_annot ation

  7. LIPS by ch-chuck · · Score: 1

    You've got to hand it to those cunning linguists at Penn State.

    --
    try { do() || do_not(); } catch (JediException err) { yoda(err); }
    1. Re:LIPS by mattwarden · · Score: 1

      cunning linguists

      That's disgusting.

    2. Re:LIPS by Original+Replica · · Score: 2, Funny

      "cunning linguists
      That's disgusting."

      Actually, if done well, it's quite pleasant for all involved.

      --
      We are all just people.
    3. Re:LIPS by shashark · · Score: 2, Funny

      They might be cunning linguists - but you sure are a master debator.

  8. Google, others doing similar research? by EMIce · · Score: 1

    I'm sure a lot of research is being done in this area, in fact there is lots of interest implement this sort of thing in DSP for robot vision. How much of what this patent covers overlaps with what the others are working on? Is this something completely out from left field or does it fit the trend of where this area research was headed anyway?

    1. Re:Google, others doing similar research? by MikeFM · · Score: 1

      I did this as a project a few years ago when I was building something similar to, the yet unheard of, Flickr. Except I didn't limit myself to 15 tags. I just went through thousands of pics and tagged them with keywords I thought of as I did them and used that information to train a system that'd then go through and tag other pics. It wasn't perfect but did work pretty well and I think it could have been awesome. I tried pitching it to Google as a mashup of Google Images and Slashdot with powerful indexing and search features but I never got any response to any of my emails. Guess you have to have a PhD to get them to care about your ideas.

      Anyway - it's not to hard to train a program to pull out things such as colors, shading, patterns, and shapes from pics and then from those it can learn what combinations are good indicators of certain keywords. For example, it's pretty easy to recognize most nudity as you'll get skin tones and usually get certain shapes (breasts are usually pretty similar in shape) so that with enough training it becomes pretty easy to pick those out of random images. Likewise it's pretty easy to pick out the shape of an airplane in a photo.

      You can really do a lot with neural nets if you feel like bothering. The trick is to feed them the right information so that they can do something useful.

      --
      At what price learning? At what cost wisdom? The price is a man's peace of mind, and the cost is his life.
  9. A video an the subject by damgx · · Score: 2, Informative

    Luis Van Ahn did something almost the same, his idea though is to use humans aswell.

    View the video on Human Computation

    --
    I only read slash. for the articles...
    1. Re:A video an the subject by thebes · · Score: 1

      I thought this was going to be a bunch of chinese people in a basement calculating numbers and stuff.

    2. Re:A video an the subject by bunratty · · Score: 1

      Or better yet, just go ahead and play The ESP Game and Peekaboom.

      --
      What a fool believes, he sees, no wise man has the power to reason away.
    3. Re:A video an the subject by Anonymous Coward · · Score: 0

      Luis Van Ahn did something almost the same

      No, he did the exact opposite. That people want to tag images is pretty obvious, the interesting part is how it's done. This research uses a machine learning algorithm, Ahn's work is focused precisely on avoiding the use of such algorithms, just as these people's research is focused on avoiding the need for humans to have a part in the process.

      Besides, this approach isn't even new. There's a ton of similar work. What's impressive is the speed they do it with.

      Sigh.

  10. retrievr by iceph03nix · · Score: 1

    For those who missed retrievr...http://labs.systemone.at/retrievr/ While this is good for hours of entertainment, i hope what theyre promoting is better.

    --
    Never attribute to malice that which can be adequately explained by stupidity
    1. Re:retrievr by ExFCER · · Score: 1

      The closest I got to my image was a phalanx symbol... http://www.flickr.com/photos/13915774@N00/24319591

  11. Nobody is impressed by this by indigest · · Score: 1
    FTFA:
    The analysis takes about 1.4 seconds per image and in 98 per cent of tests suggests at least one correct tag in the top 15.
    I suspect you could generate a list of 15 sufficiently vague words that would cover 98% of all images. Here's a start: people, sport, animal, trees...
    1. Re:Nobody is impressed by this by Anonymous Coward · · Score: 0

      Let me round out the list for yah:

      people, sport, animal, trees, porn, porno, pornography, pr0n, boobies, b00bies, tits, ass, pussy, sex, girl

    2. Re:Nobody is impressed by this by Scottitude · · Score: 1
      I "tested" it with an image of Gord Downie holding a microphone in the wash of a blue spotlight. The tags this "groundbreaking" program suggested were:

      building city modern historical architecture people ocean_animal fish sub_sea water space cyber art ocean sport

      Only one "suggested" tag is relevant. Too bad so much time and effort went into such a waste of time and effort.

    3. Re:Nobody is impressed by this by linj · · Score: 1
      I suspect you could generate a list of 15 sufficiently vague words that would cover 98% of all images. Here's a start: people, sport, animal, trees...


      No kidding... How about "pixels"?
  12. I swear I saw a butterfly by packetmon · · Score: 1

    at least that's what I told the psychologist. Then on the second look, it looked like splotched ink on a paper that was then folded in half... I hope this software doesn't think like me cause at the end of it all, I saw a segfaulted X server on fvwm

  13. API by dampjam · · Score: 1

    I currently work for the group doing this - a very cool new feature will be launched in the next week that I am writing (stay tuned). Yes - this project has been done many times before by many people (to lesser degrees of success than this), but the thing to keep in mind is that this is realtime. It takes less than a second for the tags to be generated. All previous systems required a much larger amount of processing time. Check out www.alipr.com to try it yourself!

    1. Re:API by OzPhIsH · · Score: 1

      Just wondering what type of classifier you're using. Is it just an extension of one of the classical classification approaches, or a weird hybrid approach? Is there an actual research/conference paper those of us in the field can read instead of this fluff article? The real time nature of classifications makes me think you're handling the incoming images as stream data. Maybe somehow you can extend it to video, which is after all, nothing more than an image stream...

      --

      "To lead the people, you must walk behind them"

    2. Re:API by dampjam · · Score: 1

      I will ask them to upload it to alipr.com tomorrow... there might be a problem because it's published and now belongs to the journal. It was just presented at the ACM Multimedia Conference at UCSB, if you get those proceedings you can find it.

      Make sure to check back tomorrow to be able to search based on the tags that the computer suggests, people verify, and ones that people enter manually. I just got all the cron jobs working together.

  14. 1 out of 15 ? impressive by Gothmolly · · Score: 1

    How do they get less than a 50% average that you'd get by just guessing?

    (yes, assuming a normal distribution of 'concepts' in the pictures, etc)

    --
    I want to delete my account but Slashdot doesn't allow it.
    1. Re:1 out of 15 ? impressive by CrabbMan · · Score: 1

      How do they get less than a 50% average that you'd get by just guessing? (yes, assuming a normal distribution of 'concepts' in the pictures, etc)

      I hate to be a hater. . . but firstly, I doubt any sort of Gaussian or even mixture of Gaussians will work well to describe the distribution of picture labels. And secondly, you get a 50% average by making guesses about an even Bernoulli distribution, like a coin flip.

    2. Re:1 out of 15 ? impressive by Anonymous Coward · · Score: 0

      Here's my algorithm:

      def guess_tags(image):
                import random, time
                random.seed(time.time())
                words = file("/usr/share/dict/web2").readlines()
                tags = ["nature", "man-made"]
                for i in range(1, 13):
                          random_word = words[random.randint(1, len(words))]
                          tags += [random_word]
                return tags

    3. Re:1 out of 15 ? impressive by wayward_bruce · · Score: 2, Insightful
      How do they get less than a 50% average that you'd get by just guessing?
      How do you get that 50% is average on guessing? Their tag pool contains 332 "concepts", which means that randomly picking 15 would give you about 1/22 chance of getting a correct tag for a picture that is tagged with one word. For a two-tag image, you get 1/11. To get up to 50% you'd have to work with images tagged with four or five words. Did I miss something here? Besides, the claim is that "in 98 per cent of tests suggests at least one correct tag in the top 15", the keywords here being "98%" and "at least". We don't know how the number of correctly identified tags is distributed, so we can't say much about that anyway. This reminds me of Pres Eckhart and John Mauchly inviting a group of female "computers" to show them their first two blocks of tubes perform a computation of 5*1000. One of these ladies later commented that they had a whole lot of equipment for such a simple computation.
  15. w00t!!! by rts008 · · Score: 2, Funny

    Now almost 7% of my pr0n will get tagged correctly!
    That's cool, the rest of it will be like opening xmas presents!

    *file: 123456.jpeg>open>Aghh! Goatse!*

    Hmmm...This may be neat when it gets a LITTLE more accurate, but a cool start none the less.
    Kudus to the gang for getting a grip on a hard problem...erm..nevermind.

    --
    Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
    1. Re:w00t!!! by Anonymous Coward · · Score: 0

      Of course, the question in everyone's mind is: Why would you have Goatse saved in your pr0n? :)

    2. Re:w00t!!! by CCFreak2K · · Score: 2, Funny

      You named your penis "problem?"

      --
      "Beware of he who would deny you access to information, for in his heart he dreams himself your master."
    3. Re:w00t!!! by rts008 · · Score: 1

      A lot of the time it is. Especially it's sense of timing.

      --
      Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
    4. Re:w00t!!! by nsillik · · Score: 1

      I ran their software on the mental image after reading your comment:
      Tags: Disgusting, small, horrifying, triceratops, Grenada

      Three out of five ain't bad.

    5. Re:w00t!!! by rts008 · · Score: 1

      "Tags: Disgusting, small, horrifying, triceratops, Grenada"

      Okay, I get the first three, but tricerotops aand Grenada? (disclaimer: I trained Spl Forces teams to go into Grenada- and yes, it WAS as fscked up as you might have heard!- so I may have a whole different point of view/perspective about Grenada than you may have :-) )

      I'm intrigued, especially about the tricerotops, and would appreciate an explaination if you would be so kind.
      (no sarcasm intended or implied- I'm really curious!)

      --
      Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
    6. Re:w00t!!! by nsillik · · Score: 1

      It was an example of how poor the software (could) be. I'm not really judging the software, but most implementations I've seen give way-off responses like that.

  16. The obvious by Anonymous Coward · · Score: 0

    This enables a word search of images. Suppose that I'm doing research on the advertising for Cat's Paw Boot Products. This technology would enable me to find such images which are incidental parts of other images. In other words, it could turn up the fact that Fred's Boot Shop in downtown Moose Jaw had such a sign in its window in 1935. The photograph might be labeled 'Downtown 1935' and there would be no mention of Cat's Paw anywhere. It might even turn up the fact that Al Capone's old limo was parked just down the street. This is actually exciting news for researchers.

  17. In future news... by Parallax+Blue · · Score: 1

    Image recognition software is making it even easier for your kids to find porn! More at 6...

    1. Re:In future news... by mrchaotica · · Score: 1

      If the algorithm can recognize it in order to find it, it can do so in order to block it too.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  18. As the software said ... by Anonymous Coward · · Score: 0
    "Ahh, but I don't see a goat in this picture?"


    Snap ... it's down.

  19. IBM was working on this years ago... by rthille · · Score: 1


    Not sure how far they got, but remember reading that IBM was working on this and had some reasonable success at object recognition in images. I'd love to be able to classify the 10k digital images I've got around. Especially if it can recognize individuals (not that it would know their names initially, but would be trainable).

    --
    Awesome furniture, accessories and cabinetry in Santa Rosa, CA: http://humanity-home.com/
    1. Re:IBM was working on this years ago... by Anonymous Coward · · Score: 0

      Especially if it can recognize individuals (not that it would know their names initially, but would be trainable).

      You should be able to do that with off-the-shelf facial recognition software.

    2. Re:IBM was working on this years ago... by griffjon · · Score: 1

      I'm sure it can figure out "Jenna Jameson" quick enough.

      --
      Returned Peace Corps IT Volunteer
    3. Re:IBM was working on this years ago... by Concerned+Onlooker · · Score: 1

      Yep, it is called QBIC--query by image content. The web site points to another web site or two that use QBIC for retrieving images from a collection.

      Facial recognition is one thing, but if you just want to try to categorize your current collection you might try imgSeek, which is a pretty cool program. Keep in mind that no one has really yet hit upon a great general purpose algorithm for finding matches to images or query by content. There is a large subjective component in categorizing images. If an image is mostly monochromatic blue and it's a picture of a boat, does it get classified as a boat or does it get lumped in with other predominantly blue images? How do you decide whether something is more blue than it is a boat? I suppose at that point a human has to step in to say "I'm really interested in the shape of the object more than I am the color right now."

      --
      http://www.rootstrikers.org/
    4. Re:IBM was working on this years ago... by MicrosoftRepresentit · · Score: 0

      Um...I don't think you understand the concept of 'keywords' (hint: note the plural)

    5. Re:IBM was working on this years ago... by Concerned+Onlooker · · Score: 1

      I'm not sure what your comment means. I was responding to a comment about IBM's QBIC technology as well as a comment about classification. Keywords may be a part of the original story but they weren't pertinent to what I was responding to.

      I think I've had it with Slashdot. Too many people like you.

      --
      http://www.rootstrikers.org/
  20. Reportedly by stunt_penguin · · Score: 2, Funny

    Reportedly the researchers showed the system a picture of a Death Star, and it correctly tagged the image with 'thatsnomoon'.

    The system has clearly been let crawl the web for far too long.

    --
    When the posters fear their moderators, there is tyranny; when the moderators fears the posters, there is liberty.
  21. Re:Tag for /. by Amouth · · Score: 1

    "itsatrap" is the best... but use sparingly

    --
    '...if only "Jumping to a Conclusion" was an event in the Olympics.'
  22. balls by Anonymous Coward · · Score: 0

    anyone who has balls please repliy 2 this message!

    1. Re:balls by ExFCER · · Score: 1

      Please see my other post in this thread. THX.

  23. Google will buy it... by Duggeek · · Score: 1

    Unless Jupiter Media gets to it first.

    Someone like myself would understand the hours of data-entry and database development that goes into indexing imagery. I research photo copyrights for a living.

    The fact that there is a feasible, automated system that can do the work will certainly cut down the man-hours for that sort of work; at least by half.

    Pity, though. I heard that Google and others had a telecommuting thing that paid people to recognize what's in a photo. Sorry to hear they'll be out of a job soon.

    --
    This post © Copyrite Duggeek, all rights reversed.
  24. Workarounds....... by Anachragnome · · Score: 1

    .........SexSurfer logs in to begin his daily search of the web to find more images to rip in an effort to increase his database of porn images, utilizing this technology, only to find that most of the images consist of naked women with political statements printed on their asses......

    Seriously now, I am sure their are people out there that have already got ideas rolling around in their heads about how they can use this technology to hijack images to their advantage. Once somebody understands how the technology works it is only a matter of time before it is used for nefarious purposes, by means of "tricking" the technology. And in the process, invalidating any possible means by which the developers can realize a return on their investments.

    Personally, I'd love to use such a technology(if it actually works) to sift through the plethora of "crap" images I have to search through on the web. It can be really frustrating to do a search only to find that a vast amount of the results are TOTALLY out of context simply because of the title tag attached.

    1. Re:Workarounds....... by AmberBlackCat · · Score: 1

      Couldn't you just search for +ass -iraq -china?

  25. Won't this... by Anonymous Coward · · Score: 0

    Won't this take away many of the pr0n-sorting jobs and slow down the pr0n economy?

    I for one will not stand for this!

  26. One tag in 15 correct by theeddie55 · · Score: 1

    well that really had to happen, i've just tried it and you can't really go wrong if one of the top 15 tags is 'photo' and another of the tags is 'thing'.

  27. image spam filtering by grapeape · · Score: 1

    Now all they need to do is come up with a way to recognize spam words in image text without the overhead of OCR and they can make a fortune on that alone.

    1. Re:image spam filtering by Anonymous Coward · · Score: 0

      well true current OCR are cpu intensive per image but they are used anyway... and guess what spammers realised that and they adopted new technique... they send now gif which paints "slowly" different part of image + they add noise to avoid text detection.. like with bayesian... they get antispam product and they find out where the crack is... and the loop continues... so antispam is good if you got antispam provider which updates techniques in the same pace as spammers..

  28. Now all we need... by Anonymous Coward · · Score: 0

    Automated dupe tagging.

  29. Uni assignment by wayneo13 · · Score: 1

    This reminds me of a uni assignment that i did where we matched images based on colour.

  30. Penn State and Patents by Anonymous Coward · · Score: 0

    If anyone cares, Penn State has a strict policy with patents, detailed here among other places. It all comes from the fact that most of the University's reseach is paid for by grants and industry cooperation. I didn't RTFA yet, but I'd bet that this is going to be immediately licensed to either the federal government or other such body, whomever funded the research. Otherwise, it could very well become public domain.

  31. I'm not... by Morphine007 · · Score: 1

    ... usually a pedant... but you don't train a database. It was likely a neural net, but TFA is rather thin on details. Anyone got a link to their paper?

  32. Neural networks by Anonymous Coward · · Score: 0

    Couldnt something like this be easily accomplished using neural networks.. I mean u will need a good dictinary and some base patterns to teach the network and then split up the image to compare for patterns. I am not an expert of image processing or neural networks but I have dabbled in some matlab based coding for both and it seems like a doable program the only problem I see is how to split up the image and at what point, edge detection maybe

  33. Re:Tag for /. by Anonymous Coward · · Score: 0

    "ponies" "lasers" "sharks" and "fud" are my favourites, "fud" being the most abused tag on the site..

  34. Lots of implications for Surveillance, however. by Anonymous Coward · · Score: 1, Interesting

    Makes it easier to process all that data generated by all those security cams.

    Is there a "Big Brother" category on Slashdot, yet?

  35. So how long before... by Anonymous Coward · · Score: 0

    How long before Google buys them out?

    ---
    Yes, +5 Insightful me!

    1. Re:So how long before... by Impy+the+Impiuos+Imp · · Score: 1
      > software that automatically recognizes objects in photos and tags them accordingly
      Scanning computer BD38230X372HDAC...
       
      Picture 1...
      Banner ad...face...chair...dog...end analysis
       
      Picture 2...
      Full-lipped female face...pretty eyes...face recognition: Portman
       
      Picture 3...
      Wood paneled room...tit...tit...tit...large diameter areola
      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
  36. XRay much easier though by SuperKendall · · Score: 2, Informative

    The human body is pretty much the same between people, and XRays are generally shot from similar directions person to person - so the kind of check you are describing seems like it would yield high matches for pretty much any part of the body.

    In the real world we have an object you might take a picture of from any angle, using a myriad of focal lengths, with variable levels of distorition depending on the lens and camera used. Really nasty for generic object recognition. I think the best we can hope for in terms of accuracy is perhaps some kind of facial recognition autmatically recognizing and tagging people in images.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:XRay much easier though by mrchaotica · · Score: 1
      In the real world we have an object you might take a picture of from any angle, using a myriad of focal lengths, with variable levels of distorition depending on the lens and camera used. Really nasty for generic object recognition.

      Don't forget occlusion!

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

  37. Wrong approach? by havardi · · Score: 1

    This would be cool and all, but why not focus more on letting humans do the hard work-- like if I could take a picture of a tree and then press a button and say aloud; "redwood tree", and have that tag the file.

  38. Well I sure hope they invented it before early 80s by presidenteloco · · Score: 1

    When I was studying textbooks on how to do this in undergrad comp sci.

    --

    Where are we going and why are we in a handbasket?
  39. So they say... by Ninwa · · Score: 1

    That's all fine and great that they can tell us, but why the heck couldn't they make a web-interface for it so I could try it out?

  40. Bullshit Patents by OzPhIsH · · Score: 1

    This just your standard data mining classification system, simply applied to image data as input, with the tags being possible classifications. This is an obvious application to ANYONE in the field. Software patents suck.

    --

    "To lead the people, you must walk behind them"

    1. Re:Bullshit Patents by heinousjay · · Score: 1

      If it's so obvious, how did they do it?

      --
      Slashdot - where whining about luck is the new way to make the world you want.
    2. Re:Bullshit Patents by OzPhIsH · · Score: 2, Insightful

      The application is obvious, although, I'll admit, their EXACT method isn't. But at it's core, it is basic supervised learning. Feed your classifier a training a set of images that are already tagged. Extract the features of the image and use those features to predict the tags. When the predicted classifications don't match the actual tags, adjust the model, rinse and repeat. Just pick up a data mining book. Like I said, lots of people are working on image classification, and this is an obvious application, at least to those in data mining/machine learning related fields. That doesn't make it an EASY thing to do successfully. If it were easy, there wouldn't be so much research going on. In that sense this group gets my respect for doing a pretty successful job. My concern is the patent. People already look at images and classify them based on content. That's what tagging IS. When computer software is written to automatically do something that every normal person does anyway, should that be patentable? How is this different than people giving Amazon tons of shit for their patents on their product recommender system?

      --

      "To lead the people, you must walk behind them"

  41. confucius says... by recharged95 · · Score: 1
    Considering the adage that "a picture is worth a thousand words", they're going to have a lot more words to index--where the words may not follow a specific taxonomy.

    And that's one of the problems: does an image define the taxonomy or taxonomy defines the image [type]?

  42. When it's going to be on Flickr? by pigreco314 · · Score: 1

    I quickly run out of fantasy when it comes to assign tags to my pictures: an automated mass tag finder will save hours of my precious time while uploading photos to Flickr.

    --
    "linux" is a very common word and was not included in your search.
  43. Neural Nets by gekoscan · · Score: 2, Insightful

    How can you take a neural network and train it, then patent that?
    That's like patenting training a dog to fetch a stick, it's completely rediculous.

    You take software capable of generalizing a neural network algorithm by feeding it pictures and associating each picture with certain tags. It then creates a generalized algorithm model based on what you fed it initially. So that when you give new input it is capable of outputting tags most similar to what you initially trained it.

    So yes this software can recognize boxes, shapes, other objects, maybe scenes etc and associate them with tags... but ask them how the algorithm works under the hood =) They have no idea... a neural network is like a black box after it has been trained. You feed it input and it gives you output based on it's initial training. The inner workings are chaotic spaghetti values set on each neuron weighting and can't be deciphered.

    How can you patent software that is a black box inside?

    "Yes hello patent office? I have this box that manufactures microprocessors. I feed it all the materials and it outputs a shiny new processor. I am not sure of the manufacturing process internally but the output works great. I would like to patent this manufacturing process.

    "Okay your patent number is 247286-"BLACK BOX"-9

    The whole point of a neural network is it generalizes what you train it and can future predict any input based on that.

    It's like having the invention of the first mirror and everytime someone put something different infront of it, that person called up the art gallery because they had a new painting that they wanted in their name (because depending what was in front of it you get a different reflection).

  44. unimpressing by Eivind · · Score: 1
    If it worked, it'd be very useful. However, getting the top 1 tag correct 50% of the time (which is the only quantifiable claim in the article) is pretty straigthforward. For most peoples photo-albums that can be done by the following AI-program: "print 'people'"

    There's a few subjects that are so common that it's more or less a given they'll be in a large fraction of the photos. Outputting "people, buildings, nature, animals, plants, city" would probably give atleast 1-2 "correct" tags for 90% of whats in peoples photoalbums.

    I had a class on neural networks and their (weak) sort of "ai", one task was to build a program to separate male from female names. The best programs could manage 80% or so, which is sorta decent. Until you realize that checking against static lists of the top 100 male/female names, if it's not in the list guess female if it ends in 'a', otherwise guess randomly will get you aproximately 95%. Furthermore, the latter program runs an order of magnitude faster, is more easily debuggable, can be understood by anyone, and can trivially be "extended" to reach 99% or more, simply by extending the lists of known male/female names.

  45. link? by Tom · · Score: 1

    So where's the download link? How can software matter if I can't get it? ;-)

    --
    Assorted stuff I do sometimes: Lemuria.org
  46. Pictionary! by f0rtytw0 · · Score: 1

    I now have someone to play pictionary with.

    --
    this is the most important sig ever! In your face 446154!
  47. Big deal by Oligonicella · · Score: 1

    So what were they doing, throwing a dart at a damn board? That success rate is no better then randomly applying vague words.
     
    Move along to real research.

  48. wonder how this compares by Corporate+Gadfly · · Score: 1

    I wonder how this compares with Riya. At some point, there were plenty of rumours of a possible Google purchase of Riya. Then again they were rumours.

    I haven't RTFA and I don't have any experience with Riya either, so consider the above posting a waste of time (if you must).

    --
    Corporate Gadfly
    Jonathan Archer: the most beaten up Enterprise captain in Star Trek history
  49. Grammar whore right here... by Anonymous Coward · · Score: 0

    The vast majority of the images on the internets including The Google include "Pornography" in it's top 15 tags suggested. The accuracy rate is surprisingly high.

    What the hell kind of sentence is that?

  50. MOD PARENT FUNNY! Re:Pictionary! by corychristison · · Score: 1

    I burst into laughter when I came across this comment.

    I wish I had mod points.

  51. Very important... by Not-a-Neg · · Score: 1

    Does it detect breast size?

    --
    -==- Buy a Mac and leave me alone!
  52. Wouldn't it be easier... by GWBasic · · Score: 1

    Wouldn't it be easier to pay starving children in [???] $0.01 / hour to tag images?

  53. Great point! by SuperKendall · · Score: 1

    Don't forget occlusion!

    Yes, the real world has poles and fences.

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley