Slashdot Log In
Searching by Image Instead of Keywords
Posted by
samzenpus
on Wed May 04, 2005 08:00 PM
from the find-me-something-square-and-green dept.
from the find-me-something-square-and-green dept.
Content based image retrieval (CBIR), the technique to search for images not by keywords, but by comparing features of the images themselves has been the focus of much research ever since the web emerged. Consider for instance adding CBIR to Google Images, where you would be able to search for images similar to a query image instead of using keywords. A research project at Penn State University has recently been applied to the biggest aviation photo database in the world with close to 800,000 images. You can search for images similar to a photo already in their database (click "View similar photos") or submit your own query image. Some queries generate better results than others but CBIR is certainly here to stay and will be standard in many image applications of the future.
This discussion has been archived.
No new comments can be posted.
Searching by Image Instead of Keywords
|
Log In/Create an Account
| Top
| 184 comments
| Search Discussion
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Think of the greatness to society! (Score:5, Funny)
Arm jokes... (Score:2, Funny)
Location? (Score:5, Funny)
Wow (Score:5, Interesting)
(http://allenu.sdf1.org/hazuki/ProjectHazuki.html | Last Journal: Friday September 03 2004, @04:53PM)
It will be interesting if we ever get to a stage where we can just search for a random object (or person) in a database of photos. Then you could take pictures of everything with an always-on camera and if you need to find where you put your car keys, just do a search.
Re:Wow (Score:4, Informative)
If you are only interested in searching for images on your own computer, have a look at imgSeek. http://imgseek.python-hosting.com/ [python-hosting.com]
It's been around for some time now. You can not only use an existing image to search, but also do a rough sketch. Check the screenshots: [sourceforge.net]
Nice complement to what has been presented in this article.
Brilliant! (Score:1)
(http://scott-klein.com/)
They would need major manpower to maintain this db (Score:3, Funny)
They would need a team of outsource Indian workers to go through each picture one by one!
I am not Indian but...can I apply for the image filtering job?
I said this first, I should get the job
Top Search (Score:3, Funny)
(http://daishi.aeoth.net/)
security issues (Score:1, Offtopic)
(http://slashdot.org/)
The ever popular 'Breast' option... (Score:1)
+"34b"
-"puffy"
Similar images (Score:2, Funny)
1. Airliners.net
A site with almost 1,000,000 aviation images.
Wow !!! I tested their Sample search [airliners.net] and all the results were aeroplane photos !!! Ok, ok the site only has airplanes but still
On a more serious note the alogorithms seem to look for similatity in the colors and lighting rather than the subjects (for example it shows the interior of a cabin in photos similar to a whole plane in the sky. To really see its effectiveness we need to test in in the real world (google images) . The 'artisticly revealing' photo you always liked
The Human Brain (Score:1)
(http://wfpearson.blogspot.com/ | Last Journal: Monday May 01 2006, @08:29PM)
IANABP, I am not a bio-physicist but it seems very much like artificial intelligence to me.
And for 'lynx' users... (Score:5, Funny)
Some relevant research papers (Score:5, Informative)
(http://edgeofvision.com/ | Last Journal: Wednesday June 20, @08:07PM)
* Finding Naked People [hmc.edu] (Fleck et al, 1996)
* Video google: A text retrieval approach to object matching in videos [ieee.org] (Sivic & Zisserman, 2003): web page demo here [ox.ac.uk]
* Names and Faces in the News [columbia.edu] (Berg et al, 2004)
* FACERET: An Interactive Face Retrieval System Based on Self-Organizing Maps [springerlink.com] (Ruiz-del-Solar et al, 2002)
* Costume: A New Feature for Automatic Video Content Indexing [www.irit.fr] (Jaffre 2005)
One more: automatic film character retrieval (Score:4, Informative)
(http://edgeofvision.com/ | Last Journal: Wednesday June 20, @08:07PM)
Automatic Face Recognition for Film Character Retrieval in Feature-Length Films [cam.ac.uk] (Arandjelovic & Zisserman, 2005)
The objective of this work is to recognize all the frontal faces of a character in the closed world of a movie or situation comedy, given a small number of query faces. This is challenging because faces in a feature-length film are relatively uncontrolled with a wide variability of scale, pose, illumination, and expressions, and also may be partially occluded. We develop a recognition method based on a cascade of processing steps that normalize for the effects of the changing imaging environment. In particular there are three areas of novelty: (i) we suppress the background surrounding the face, enabling the maximum area of the face to be retained for recognition rather than a subset; (ii) we include a pose refinement step to optimize the registration between the test image and face exemplar; and (iii) we use robust distance to a sub-space to allow for partial occlusion and expression change. The method is applied and evaluated on several feature length films. It is demonstrated that high recall rates (over 92%) can be achieved whilst maintaining good precision (over 93%).
and then we have reverse "Googling" for images.. (Score:5, Interesting)
(http://dotpavan.googlepages.com/home)
One has to guess the search word which generated a given set of 20 images in google's image search [robinson.name]
When things are moving forward, we have soomthing to talk about "those good ole days" but frankly the game is interesting initially but later gets boring due to the frequent repetitions..
Is it just colour? (Score:5, Interesting)
(http://www.illuminatingscience.org/)
What I got was an awful lot of red planes - some of which were actually Qantas planes, but I think more by coincidence (i.e., they're red) than design. Many images had nothing to do with Qantas, or even a red plane - they simply had a lot of red in the image.
This is impressive in some ways, but in others it seems like it's simply looking for similar patches of colour. I haven't done enough testing to see what happens if,say, I gave it a half red half green image.
Interesting, but not ready for public consumption just yet. A bit like A.L.I.C.E. the artifial intelligence system actually - neat, but not practical. Yet!
Great! (Score:5, Funny)
(http://www.ministry-of-fun.com/)
eVision? (Score:1)
(http://polarweasel.org/)
I guess eVision [evisionglobal.com] were just too early to market with their visual search engine. Here's a demo or three [evisionglobal.com] of eVe in action.
It sure was cool, just too far ahead of its time...
Several image viewers do this already (Score:1)
It's not quite "put in an image and find me all the similar ones" but the underlying technology is the same, usually creating some kind of "signature" of each image and then comparing the signatures to find others visually similar.
Great for de-duplicating your por^M^M^MPhoto Collection.
IP Enforcement Nightmare (Score:2, Interesting)
The big problem to me is specifying input. I know the "look" of the Mona Lisa's smile, but even with the best pen input methods I'd never be able to mimic DaVinci's subtle emotion of the smile; my hands just aren't capable of doing so. Using photos of the painting could simplify this, but this almost assumes that I'm only looking for the parody's and commercial exploiters of the image rather than the image itself (after all, I have the image to start with). And it raises the further issue that many photographic reproductions of the Mona Lisa that I can get my hands on are still under copyright and I'd be doing something legally questionable with an image long in the public domain.
Add to this the "infinite number of monkeys" issue where legally litigious companies will use technologies like this to scan the internet for litigation targets. Imagine Disney using a cell of Rafiki from the Lion King to find legally similar images that were created after the Lion King was released even if they were only superficially similar. Now do this for all movies back to Snow White or Steamboat WIlly and you could get to be a real visual mob boss with ownership (or at least threat of litigation) over huge libraries of works that weren't even created to intentionally violate Disney "Intellectual Property".
My need for this technology is small considering the input problems I'd have with my artistic abilities, while the litigation nightmare from large databases of "similar" visual data would seem to be more bothersome than helpful. I rather hope these visual search and categorizing methods don't catch on.
_Mind at Light Speed_ by David Nolte (Score:1)
(http://www.dustindecker.com/)
However, he put forth the concept of replacing the bit as the common unit of data with actual images - best described as holographic images of light manipulated by light. A picture really _would_ be worth a thousand words in such a system!
He was my professor, it sucked. (Score:1)
This reminds me of Gibson's Pattern Recognition (Score:3, Interesting)
(https://dawgchain.at/ | Last Journal: Friday January 26 2007, @01:14PM)
How does it do that? (Score:3, Funny)
(Last Journal: Monday January 06 2003, @10:36PM)
more sophisticated than colour matching (Score:1)
There is a GNU project related to this GIFT (Score:5, Interesting)
The GIFT (the GNU Image-Finding Tool) is a Content Based Image Retrieval System (CBIRS). It enables you to do Query By Example on images, giving you the opportunity to improve query results by relevance feedback. For processing your queries the program relies entirely on the content of the images, freeing you from the need to annotate all images before querying the collection.
GIFT [gnu.org] It worked pretty well for me in the demos they linked too. I have been waiting for this type of application to gain momentum.
Sorry, someone else is currently using the system. (Score:1)
(http://www.ninwa.net/ | Last Journal: Thursday July 27 2006, @06:55PM)
Would it work for animated .gifs? (Score:2, Interesting)
(http://del.icio.us/tzarius | Last Journal: Thursday December 29 2005, @12:50AM)
It says "multi lock on" and a date, but all Google reports is other forum posts looking for the creator of the image. Apparently, there's a high-res version of it too.
Apps Already do this. (Score:1)
(http://www.rslittle.com/)
It's db of airplane pictures, they are all similar (Score:1)
I guess anyone can get a research grant these days.
Pretty controllable test (Score:2)
Now, do this for something like Google Images or PBase or collections spanning infinite numbers of subjects and image sizes, then I'll get excited.
No, I've never had a rejection from A.net, I've never submitted there. Two minutes in their forum will tell you how anal their 'screeners' are, for whatever reason. It's just freakin' pictures of airplanes, for chrissakes.
You can tell pretty quickly how they match (Score:1)
Music (Score:1)
(http://home20.inet.tele.dk/sn0wflake/ | Last Journal: Monday August 16 2004, @05:13AM)
Spotlight (Score:1)
Is this a joke? (Score:3, Interesting)
(http://www.kentuckyfriedorphan.com/ | Last Journal: Sunday March 23 2003, @12:46AM)
Next experiment: I took a picture of a highly distinctive plane, a harrier, climbing at a steep angle and viewed in profile. I got, in return, a list of passenger jets, and even a helicopter. Hardly surprisingly, all of the result pictures had the same bluish white sky as my original image. That was literally the only similarity.
According to the introduction on the search page the heuristics used compares colors, contrast and shapes in the images themselves. I saw no correlation whatsoever between shapes, and any correlation in contrast seems to be to be the result of the search engine simply looking for images that contain the same colors in a similar ratio to the original. In short, nothing to see here, move along.
On the other hand, one of the projects listed under the Penn State University link looks fairly fascinating. The Riemann a-LIP project [psu.edu] (automatic linguistic indexing of pictures) doesn't allow user input of images, unfortunately, but it does show some fairly fascinating attempts at verbally qualifying image data. For example, it describes a blue and orange mandelbrot as pattern agate shimer abstract scene, and a sunset over a lake as Berlin Devon Namibia landscape lake scene. Okay, it may still need some work, but it sure beats the hell out of the "find the same color airplane engine".
Oh, you mean like imgseek? (Score:4, Informative)
(http://sogeeky.net/)
Requirements (Score:1)
(http://www.solidz.com/)
But what I don't understand is... (Score:1)
Oracle interMEDIA does this for a while now? (Score:1)
This document was written in 2002 (and that version is as old too)
I can remember some sales guy saying "You can look for a couch that's like this one. But blue."
So that means it should do a bit more than just color patches..
and then there is Google images..
This is similar (Score:1)
ummm... (Score:1)
Purdue University's 3D Shape Search (Score:2, Informative)
altavista (Score:2, Informative)
Professor Wang (Score:1)
Uh-oh. (Score:2)
(http://danbirchall.multiply.com/)
Of course, given the usual course of things, it will instead be deployed at JFK's formerly-TWA terminal, assigned facial recognition tasks, and immediately declare everyone to be among the 10-most-wanted terrorists. I can't wait.
How about just searching for an identical image? (Score:1)
(http://home.happyface.net/)
This would be useful to me as a photographer to see if anyone out there is using my photos.
Worked on something similar (Score:3, Informative)
(http://www.pureinnovation.com/)
I've not RTFA (not had the time), but our approach was to split the images into segments (based on colour and texture) which were assumed to be objects. The segments would then be analyzed for various feature vectors, such as shape, texture, colour etc. These vectors would then be added into a database of numbers, and finally the segments grouped, giving a collection of classified sections which (hopefully) represent similar objects.
From related metadata such as keywords, you could then hope to build up an idea of what keyword matches which section. You could also come up with a relevance between two images, and thus search for similar images.
We didn't have enough time to make it bulletproof by any means, but our limited results were very promising.
Sorry I can't find the paper, but we've got some screenshots of the application here [soton.ac.uk] and here [soton.ac.uk] (you can see false colouring applied to the original image to display the segments)
Reee diii cuuuu looooooooous ! (Score:2)
Image search will kinda work for airplanes in this database,as there are a very limited set of airplane model numbers, which are going to be attached to each photo.
But if the database didnt have these text clues the image search is going to be unlikely to see the similarity between an 747 in the air, as seen from the ground, with a head-on view of a 747, or one at the gate, or one in a hangar, or one in twilight, or one of a different color.
Maybe it could be done by outsourcing the task to India?
Too Literal (Score:2)
(Last Journal: Tuesday August 28 2001, @07:17AM)
However, I think it would be better if it were able to realize what the 'background' was and filter it out. (Though I couldn't begin to guess how you'd do this.)
For example, I searched for this image [airliners.net]. Many of the results [airliners.net] are of something completely different, such as a white jet. Which is nothing like a camo helicopter. But the sky and the ground are pretty similar, and I think that's how it's matching.
It's incredible that we got this far, but I think there's still a long ways to go before it's at the stage where you put in an image, and are awed at how quickly it works.
Also note that I'd tend to think this would exponentially more difficult than searching HTML files, so it might be much more expensive to implement large-scale.
CBIR is the same problem as AI (Score:2)
Even in the restricted context of aeroplanes this is not a trivial problem. Someone in the list of replies submitted an image of a warthog (A-10) and got nonsensical results. Somehow the CBIR system would need to be able to infer a model of the A-10 from a given random 2D projection, and match it to the other 2D projections of the same A-10 model that they do have in the DB. This doesn't sound impossible but it is hard, and I suspect the Penn State people didn't do that. Instead they are probably matching on colour, texture, general appearance, etc.
This is not to say that CBIR is not a nice problem to apply new image processing/image analysis algorithms to, which are developed all the time.
Video loops etc (Score:1)
(http://jamestravels.com/)
Re:wtf? (Score:5, Interesting)
(http://www.cursor.org/)
Re:wtf? (Score:1, Funny)
Re:wtf? (Score:3, Interesting)