Become a fan of Slashdot on Facebook

 



Forgot your password?
typodupeerror
×
Software

New Algorithms Improve Image Search 111

bc90021 writes "Electrical engineers from UC San Diego are making progress on an image search engine that analyzes the images themselves. At the core of this Supervised Multiclass Labeling system is a set of simple yet powerful algorithms developed at UCSD. Once you train the system (the 'supervised' part), you can set it loose on a database of unlabeled images. The system calculates the probability that various objects it has been trained to recognize are present, and labels the images accordingly. After labeling, images can be retrieved via keyword searches. Accuracy of the UCSD system has outpaced that of other content-based image labeling and retrieval systems in the literature. One of the co-authors works at Google, where the researchers have access to image collections at the largest of scales."
This discussion has been archived. No new comments can be posted.

New Algorithms Improve Image Search

Comments Filter:
  • Cool! (Score:4, Interesting)

    by Deagol ( 323173 ) on Tuesday April 03, 2007 @04:32PM (#18593761) Homepage
    If this doesn't revolutionize the searching of online porn galleries, I don't know what will. :)

    Snarkiness aside, this is pretty cool stuff. I hope to see usable OSS code in a few years. Imagine how cool it would be to query "show me all pics with my daughter and her rabbits" and have it week through the 1000's of digital family photos.

  • Probability (Score:4, Interesting)

    by DoofusOfDeath ( 636671 ) on Tuesday April 03, 2007 @04:34PM (#18593805)

    The system calculates the probability that various objects it has been trained to recognize are present,

    The probability is either zero or one, because whether or not the feature being sought is present is a state of nature. It would be more helpful to call this number the confidence that the feature is present.

  • by Anonymous Coward on Tuesday April 03, 2007 @04:36PM (#18593827)
    ... was similarly trained to recognise tanks in landscapes. I was doing really well - getting a great score on the fresh images it was presented with.

    Then they introduced it to a new batch of images and it fell apart.

    Turns out that the initial set of images had all the tanks shot on a sunny day and all the tankless images shot on a cloudy day (or vice versa). It had learned to tell a sunny day from a cloudy day.

    Ha ha.
  • Re:Cool! (Score:2, Interesting)

    by UbuntuDupe ( 970646 ) * on Tuesday April 03, 2007 @04:46PM (#18594029) Journal
    Correct me if I'm wrong, and I'd like to be wrong, but isn't this (just) another application of Bayesian logic like is done for spam? They have some kind of way of quantifying the image in a number of variables and then use training to match certain variable values to a search term.

    (Even if it is, I don't want to trivialize the road from theory to practice, I just want to know what's different.)

    I did something a little while ago where I had a program search through text, and for all occurrences of all n-character strings (where you choose n) appearing, it would gather the information about how often each other character comes after each string appearing in the text. Then you'd give it an n-character string and it would use those probabilities to generate a new body of text. It was cool, even if it generated complete garbage except for large n.

    I hope to see usable OSS code in a few years.

    You mean for this algorithm, or at all?
  • Re:Probability (Score:2, Interesting)

    by august sun ( 799030 ) on Tuesday April 03, 2007 @04:47PM (#18594043)
    I though quantum mechanics allows for (mandates?) parallel universes for these variable states...

    So in Schroedinger's cat, in one universe the cat is alive and in one it is dead, and by observing the cat you only find out which universe you are in?

    Couldn't we therefore just say the probability is 1 that the object exists in some universe?

  • by Prysorra ( 1040518 ) on Tuesday April 03, 2007 @05:05PM (#18594431)
    That's one of the famous uses of image analysis - finding the presence of human skin in digital pictures.

    Skin detection.....5.5 million hits on Google.

    Once you can do this accurately, companies like McAffee and Norton can scan the internet and database pr0n sites for the whole web. Keep in mind that there's a subscription service that allows a Norton database to filter websites for them.

    Parents...

  • by EmbeddedJanitor ( 597831 ) on Tuesday April 03, 2007 @05:22PM (#18594829)
    Since a huge % (perhaps most) image searches are for porn, it is probably a worthwhile thing for a search server to quickly classify likely porn as a way to reduce search server loading.
  • Re:Probability (Score:5, Interesting)

    by timeOday ( 582209 ) on Tuesday April 03, 2007 @05:47PM (#18595313)
    Or a fuzzy set, as (virtually) all set in the real world are.

    For instance, the set of pictures for which the statement "is this a picture of a chair" is true. There is no objective criteria for this. So imagine you have a bunch of pictures and show each one to a thousand people. Sometimes you might get 0 or 1000 "yes" responses, but often you'll get some number in between (because there are chairs, but barely visible, the picture includes a kids booster seat, or a rock big enough to sit on). This could be interpreted as a probability that somebody will consider a picture to be of a chair.

  • Re:Cool! (Score:3, Interesting)

    by hoggoth ( 414195 ) on Tuesday April 03, 2007 @06:26PM (#18596007) Journal
    > ...the coolness of which is directly proportional to hotness of your daughter, the hotness of whom must then be further weighted by multiplying her hotness by some function of her age. The age-multiplier curve features an abrupt discontinuity that jumps 0.00 to 1.00 at age 18, and some sort of exponential backoff function that starts decreasing the multiplier at around age 35-45.

    Hotness = BeautyFactor * SexyFactor * AgeHotnesseAdjustment
    AgeHotnessAdjustment = cos(2*(Age-18)/3.14159)

    Gives you maximum hotness at 18, falling slowly in the 20's, dropping rapidly after that.
    Also, some hotness under 18 (lets be realistic!) , but not too far under 18

  • Re:Cool! (Score:2, Interesting)

    by crawly ( 890914 ) on Tuesday April 03, 2007 @11:00PM (#18598813)
    Umm no, that isn't what you want at all, that would give you a pretty horrible periodic function.

    Try something like this
    if age<18: AgeHotnessAdjustment = 0
    else: AgeHotnessAdjustment = 1/exp((Age-18)/20)

If a thing's worth doing, it is worth doing badly. -- G.K. Chesterton

Working...