Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

[ Create a new account ]

New Algorithms Improve Image Search

Posted by kdawson on Tue Apr 03, 2007 03:25 PM
from the where's-waldo dept.
bc90021 writes "Electrical engineers from UC San Diego are making progress on an image search engine that analyzes the images themselves. At the core of this Supervised Multiclass Labeling system is a set of simple yet powerful algorithms developed at UCSD. Once you train the system (the 'supervised' part), you can set it loose on a database of unlabeled images. The system calculates the probability that various objects it has been trained to recognize are present, and labels the images accordingly. After labeling, images can be retrieved via keyword searches. Accuracy of the UCSD system has outpaced that of other content-based image labeling and retrieval systems in the literature. One of the co-authors works at Google, where the researchers have access to image collections at the largest of scales."
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • when I was your age (Score:5, Funny)

    by Anonymous Coward on Tuesday April 03 2007, @03:28PM (#18593677)
    I remember when we had to go to a gas station and *buy* porn. Now you have computers out there finding porn for you. You kids today have it too easy!
  • Cool! (Score:4, Interesting)

    by Deagol (323173) on Tuesday April 03 2007, @03:32PM (#18593761)
    (http://slashdot.org/)
    If this doesn't revolutionize the searching of online porn galleries, I don't know what will. :)

    Snarkiness aside, this is pretty cool stuff. I hope to see usable OSS code in a few years. Imagine how cool it would be to query "show me all pics with my daughter and her rabbits" and have it week through the 1000's of digital family photos.

    • Re:Cool! (Score:5, Funny)

      by Cheapy (809643) on Tuesday April 03 2007, @03:44PM (#18593995)
      I find it disturbing that you combine porn, your daughter, and rabbits all in your post.

      You have issues.
      [ Parent ]
      • Re:Cool! by smaddox (Score:1) Tuesday April 03 2007, @08:16PM
    • Re:Cool! by UbuntuDupe (Score:2) Tuesday April 03 2007, @03:46PM
      • Re:Cool! by sdfad1 (Score:1) Wednesday April 04 2007, @01:20AM
      • 1 reply beneath your current threshold.
    • Re:Cool! (Score:5, Funny)

      by Tackhead (54550) on Tuesday April 03 2007, @03:52PM (#18594159)
      > If this doesn't revolutionize the searching of online porn galleries, I don't know what will. :)
      >
      > Snarkiness aside, this is pretty cool stuff. I hope to see usable OSS code in a few years. Imagine how cool it would be to query "show me all pics with my daughter and her rabbits" and have it week through the 1000's of digital family photos.

      ...the coolness of which is directly proportional to hotness of your daughter, the hotness of whom must then be further weighted by multiplying her hotness by some function of her age. The age-multiplier curve features an abrupt discontinuity that jumps 0.00 to 1.00 at age 18, and some sort of exponential backoff function that starts decreasing the multiplier at around age 35-45.

      But apart from the fact that it's almost Easter, what's with the rabbits? *clickity clic*-hey, I didn't know you could do that with Cadbury easter creme eggs!

      (Rule #34: There is porn of it. No exceptions.)

      [ Parent ]
      • Re:Cool! by andphi (Score:2) Tuesday April 03 2007, @04:19PM
        • Re:Cool! (Score:5, Funny)

          by Anonymous Coward on Tuesday April 03 2007, @04:36PM (#18595111)
          > But, sadly this is Slashdot, so images that contain girls but aren't pr0n are apparently incomprehensible.

          Fortunately, this is Slashdot, so discussions of pr0n that don't feature square-waves, multipliers, and exponential backoff functions are apparently incomprehensible too!

          (What are these "girls" of which you speak? I only remember Millie Amp... she was imaginary, skinny as a wire, but when her insulation got stripped, she stopped resisting, got really hot, and started to moan "ohm, ohm, ohm"?)

          [ Parent ]
          • Re:Cool! (Score:4, Funny)

            by andphi (899406) <phillipsam@gma[ ]com ['il.' in gap]> on Tuesday April 03 2007, @05:04PM (#18595617)
            (Last Journal: Thursday November 08, @11:55AM)
            By 'girls', I mean the limiting reagent in human reproduction. As a class of compounds, 'girls' are extremely common but somewhat volatile, so creating bonds with them is sometimes difficult. They are attracted to other similarly elusive compounds. Examples of these attracting compounds include 'Time', 'emotional vulnerability', and 'financial stability'.
            [ Parent ]
        • 1 reply beneath your current threshold.
      • Re:Cool! by chris_eineke (Score:2) Tuesday April 03 2007, @04:29PM
        • 1 reply beneath your current threshold.
      • Re:Cool! by Maxo-Texas (Score:2) Tuesday April 03 2007, @04:59PM
      • Re:Cool! by hoggoth (Score:3) Tuesday April 03 2007, @05:26PM
        • Re:Cool! by crawly (Score:2) Tuesday April 03 2007, @10:00PM
      • 1 reply beneath your current threshold.
  • so how does this ... (Score:5, Funny)

    change the way I search for Natalie Portman p0rn?
  • Probability (Score:4, Interesting)

    by DoofusOfDeath (636671) on Tuesday April 03 2007, @03:34PM (#18593805)

    The system calculates the probability that various objects it has been trained to recognize are present,

    The probability is either zero or one, because whether or not the feature being sought is present is a state of nature. It would be more helpful to call this number the confidence that the feature is present.

    • Re:Probability by AnonymousCactus (Score:2) Tuesday April 03 2007, @03:45PM
    • Re:Probability by august sun (Score:2) Tuesday April 03 2007, @03:47PM
    • Re:Probability (Score:4, Insightful)

      by Anonymous Coward on Tuesday April 03 2007, @03:47PM (#18594055)
      Not if it is a Bayesian probability [wikipedia.org].
      [ Parent ]
      • Re:Probability by DoofusOfDeath (Score:1) Tuesday April 03 2007, @03:56PM
      • Re:Probability (Score:5, Interesting)

        by timeOday (582209) on Tuesday April 03 2007, @04:47PM (#18595313)
        Or a fuzzy set, as (virtually) all set in the real world are.

        For instance, the set of pictures for which the statement "is this a picture of a chair" is true. There is no objective criteria for this. So imagine you have a bunch of pictures and show each one to a thousand people. Sometimes you might get 0 or 1000 "yes" responses, but often you'll get some number in between (because there are chairs, but barely visible, the picture includes a kids booster seat, or a rock big enough to sit on). This could be interpreted as a probability that somebody will consider a picture to be of a chair.

        [ Parent ]
        • Re:Probability by beav007 (Score:1) Wednesday April 04 2007, @01:40AM
        • Re:Probability by CrimsonScythe (Score:1) Wednesday April 04 2007, @11:03AM
      • 1 reply beneath your current threshold.
    • Re:Probability by aztec rain god (Score:1) Tuesday April 03 2007, @03:48PM
    • Re:Probability by guycouch (Score:1) Tuesday April 03 2007, @03:52PM
    • Re:Probability by zCyl (Score:2) Tuesday April 03 2007, @03:54PM
    • 1 reply beneath your current threshold.
  • by Anonymous Coward on Tuesday April 03 2007, @03:36PM (#18593827)
    ... was similarly trained to recognise tanks in landscapes. I was doing really well - getting a great score on the fresh images it was presented with.

    Then they introduced it to a new batch of images and it fell apart.

    Turns out that the initial set of images had all the tanks shot on a sunny day and all the tankless images shot on a cloudy day (or vice versa). It had learned to tell a sunny day from a cloudy day.

    Ha ha.
  • Why is it better? (Score:2, Informative)

    by AnonymousCactus (810364) on Tuesday April 03 2007, @03:37PM (#18593867)
    I wish the article would mention more about why it is better than similar techniques that have been proposed in the past. (For example, http://luthuli.cs.uiuc.edu/~daf/papers/WAP-fin.pdf [uiuc.edu] seems similar) For instance, where do they get their labels for the training data? A lot of people have tried using contextual words drawn from surrounding web text to limited success due to noise. It's also questionable how well their techniques can do if they need to pre-build a separate classification for each keyword. Finally, there are words that it seems impossible that they could ever distinguish. For example, 'man' vs. 'woman,' would be incredibly complicated for anything but a human. Where are the details? Oh yeah, it's a news story! Here's a link to the paper http://www.svcl.ucsd.edu/publications/journal/2007 /pami/pami07-semantics.pdf [ucsd.edu]
  • Does it accually work... (Score:1, Funny)

    by BurningPi (1032288) on Tuesday April 03 2007, @03:38PM (#18593881)
    ...or is it something like this [slashdot.org]?
  • pr0n (Score:1)

    by SKiRgE (411560) on Tuesday April 03 2007, @03:39PM (#18593899)
    Yes, personal searches were bound to be the first thing to be mentioned, but what about when others (ISPs, bosses, co-workers) are performing these searches on computers you use? I'm sure most people are smart enough not to do such things at work, but what about pop-ups (you couldn't help getting those kinds of popups while searching for a 'fix' to an app), false matches (boss doesn't view, only flags you if the keyword search comes back positive), etc?
    • Re:pr0n by biscon (Score:1) Tuesday April 03 2007, @03:55PM
    • Re:pr0n by LighterShadeOfBlack (Score:2) Tuesday April 03 2007, @04:02PM
      • 1 reply beneath your current threshold.
  • I wonder if these students are using this [slashdot.org] software library?
  • The problem is... (Score:5, Funny)

    by Life700MB (930032) on Tuesday April 03 2007, @03:47PM (#18594053)

    The problem is we all know what's gonna be the first result when searching "Caves on uranus"!!!

    --
    Great hosting [dreamhost.com] 200GB Storage, 2_TB_ bandwidth, php, mysql, ssh, $7.95
    • 1 reply beneath your current threshold.
  • by The Orange Mage (1057436) on Tuesday April 03 2007, @03:49PM (#18594113)
    (http://www.maegworks.com/)
    Run this story again when the system can tell the difference between D, DD, and DDD. Bonus points if it can handle "higher" criteria.
  • by Cafe Alpha (891670) on Tuesday April 03 2007, @03:55PM (#18594241)
    An old AI joke was to call a limited domain version of this a "cat box". The idea a camera with a light on it that comes on whenever it's pointing at a cat.
  • Image SPAM (Score:1)

    by reh187 (182368) on Tuesday April 03 2007, @04:05PM (#18594411)
    (http://420am.org/)
    It would be nice if we could humor these engineers to put as much effort into image spam filters :-P
  • Not exactly new (Score:1, Flamebait)

    by denoir (960304) on Tuesday April 03 2007, @04:10PM (#18594525)
    Their work seems to be based on Gaussian Mixture Models which have been around for two decades or so. It's not a very advanced method either and there are a bunch of better adaptive systems for image recognition, wavelet neural networks being an obvious example (and they've been around more than a decade).

    There is absolutely nothing newsworthy about this. On the contrary, you'll find tons of similar works - mostly as senior year student projects in CS/AI.

  • by OtherFarm (1010907) on Tuesday April 03 2007, @04:12PM (#18594571)
    a robot challenge that will test robots' vision and language understanding [cmu.edu].
    the robots/sobots must be able to recognise objects automatically and perform tasks like: get the "star trek" poster or get the blue dry erase marker. the final event will be held at the twenty-second AAAI conference on artificial intelligence [aaai.org] in vancouver, canada july 22-26 '07 [taken from ofpblog [skynet.be]]
  • by fikx (704101) on Tuesday April 03 2007, @04:32PM (#18595041)
    (Last Journal: Monday November 21 2005, @02:44PM)
    I keep waiting for a real image search to be created without the intermediarry step of tagging it with text.
    I'll be happy when I can tell the search page "find images like this" and give it an existing picture or a sketch. Tagging is too reliant on the consistant metadata to be useful in a general way. Humans can easily find all pictures of, say, fluffy the cat in a pile of photos from all different sources. Can we teach a computer how to do that without having to wait for it to re-tag images from different sources before it can search?
    Still, the better methods we find for tagging, the closer we get to that I guess...
  • I recall seeing an image search a little while back where people tagged images manually, building up a weighted list of tags. It might be a good idea to use a system like that, to train a system like this. Like the spam filters we all know and love.
  • Using games to get lots of tags (Score:2, Insightful)

    by lukeinusa (1083759) on Tuesday April 03 2007, @05:02PM (#18595579)
    One complaint about this work is that it requires tagging an initial set of images that are needed to train the system. Vasconcelos' work uses the academic standard "Corel" dataset of labeled images but also uses tagged images from Flickr to train the system. Using human computation games like the ESP game for images and ListenGame www.listengame.org [listengame.org]for audio, collecting data is not as tough as it once was...
  • if... (Score:1)

    by RedElf (249078) on Tuesday April 03 2007, @05:25PM (#18595983)
    (http://slashdot.org/~RedElf)
    a picture's worth a thousand words, where are we going to store all the words for the useless myspace photo's that get archived?
  • by wallywam1 (715057) on Tuesday April 03 2007, @06:00PM (#18596481)
    It alwasy cracks me up on TV how they show a fingerprint or face recognition system searching for matches. The camera cuts to a computer that's looping away through a bunch of pictures until a match is found. Admittedly, I don't know all of the details, but obviously current systems must have some sort of indexed data points that are entered in a database and then you run queries against the database for potential matches.

    It's a little more plausible now that broadband is readily available but this has been portrayed on TV for years. Can you imagine some podunk field office connecting to an FBI database through a dialup and downloading high resolution images until they found just the right one? Then again, that would make for some good entertainment. Detective walks in..."I've got good news and bad news. The good news is we found the killer. The bad news is, he died of old age."

    • 1 reply beneath your current threshold.
  • by pongo000 (97357) on Tuesday April 03 2007, @06:13PM (#18596645)
    ...because my master's thesis, back in 1996, was using neural nets with a fuzzy logic component to identify surface features on Landsat satellite imagery. The algorithm I came up with was even scale invariant.

    Guess I should have published and patented...damn...there goes any feelings of validation...
  • Haar wavelets (Score:1)

    by neminem (561346) <neminem&gmail,com> on Tuesday April 03 2007, @09:52PM (#18598753)
    (http://www3.hmc.edu/~afield)
    Nobody's mentioned Haar wavelets yet? Weird.

    Look them up - they're part of OpenCV, and I'm pretty sure it's the same basic principles in action.
  • Useful application (Score:1)

    by kn0tw0rk (773805) on Wednesday April 04 2007, @01:19AM (#18600155)
    (Last Journal: Monday March 07 2005, @09:21PM)
    This would be great for deviantArt, as one of the problems is mis-categorising their submissions. If a computer was able to help with that would make finding art of specific subjects/styles much easier.

    Please bring it on!
  • by Ichimaru_Taishou (997267) on Wednesday April 04 2007, @08:51AM (#18604033)
    I missed the link, where can I torrent this?
  • Finally! (Score:1)

    by Cheezymadman (1083175) on Wednesday April 04 2007, @10:22AM (#18605819)
    Now I can search for porn stars that look like that girl in my Englsh class!
  • > The system calculates the probability that various objects it has been
    > trained to recognize are present, and labels the images accordingly

    "Ok, Joe. Let 'er rip on this new test database."

    Cock
    Cock
    Cock
    Vagina
    Cock
    Cock
    Hairy armpit

    "Oh, cool! The upgrade works and can distinguish it!"
    "Nah, wait until you see this!"

    Cock
    Cock
    Cock
    Midget with banana split in hairy ass crack with guy eating the banana split without using his hands on the Howard Stern show
    Cock
    Vagina
    etc.
  • by peter303 (12292) on Wednesday April 04 2007, @10:46AM (#18606359)
    So many boobs; so little time.
  • by Feyr (449684) on Tuesday April 03 2007, @03:47PM (#18594063)
    (Last Journal: Friday January 03 2003, @03:39PM)
    tags
    [ Parent ]
  • by djupedal (584558) on Tuesday April 03 2007, @04:24PM (#18594879)
    C'mon...get with the times.

    The keyword here is 'footprint', and since the new everything is now software, electrical engineers, as an example, would have a need to reduce the heat and energy 'footprint' of a given piece of equipment used to run the software that accomplishes the hit.

    If a hardware engineer can come up with a better search method via software that works quicker, reduces errors and further searching, uses a smaller processor, less heat, etc. then he/she has just reduced the energy footprint, regardless if he/she had to make the software that drove the smaller footprint or not.
    [ Parent ]
  • 10 replies beneath your current threshold.