Could IBM Shake up the Search Engine World?

Follow Slashdot stories on Twitter

Could IBM Shake up the Search Engine World? 193

Posted by timothy on Monday August 08, 2005 @07:43PM from the slow-approach-then-wham-wham-wham dept.

overshoot writes "IBM has just tossed a bucket of chum into the whole search showdown, which Microsoft thought was between them and Google. Apparently, IBM Research has developed a 'key facts' search technology (as distinct from 'key words') over the last several years. Now they're going public with it -- by putting it on SourceForge under an OSS license!" (According to the article, it's expected to show up on SourceForge by the end of this year, not immediately.)

This discussion has been archived. No new comments can be posted.

Could IBM Shake up the Search Engine World?

Load All Comments

Search 193 Comments Log In/Create an Account

Comments Filter:

Slow down IBM ... (Score:2, Funny)

by Anonymous Coward writes:

The search bar on your site barely works as it is.
- Re:Slow down IBM ... (Score:1)
  
  by scotty1024 ( 584849 ) writes:
  
  I agree, I try their search tool at www.ibm.com every so often and I still to this day have to use Google to find anything on their web site. My money is on them donating it to FOSS so someone can fix it for them.
SourceForge proposal... (Score:2, Funny)

by RoadkillBunny ( 662203 ) writes:

It will be funny if sf.net denies them. But then, I guess they got a deal with them already.
- Re:SourceForge proposal... (Score:1)
  
  by GoldAnt ( 899329 ) writes:
  
  I've always thought google adequately searched for me. AFAIK there isn't anyone else with the same amount of resources dedicated to searching...?
  - Re:SourceForge proposal... (Score:2)
    
    by B3ryllium ( 571199 ) writes:
    
    I think you missed the story where Yahoo outran Google ...
  - Re:SourceForge proposal... (Score:2)
    
    by maxwell demon ( 590494 ) writes:
    
    If I understand the linked article correctly, the new thing is that they don't just look for the occurence of certain words, but try to get some (probably very basic) sort of meaning of the thing. Which I think could give a big advantage esp. for exclusion searches (e.g. Einstein -physicist).
ok but (Score:5, Funny)

by Anonymous Coward writes: on Monday August 08, 2005 @07:46PM (#13274709)

I'll stick to letting Google know every single detail of my life thanks.

Share
twitter facebook
Yay. (Score:4, Funny)

by Sinryc ( 834433 ) writes: on Monday August 08, 2005 @07:46PM (#13274713)

Yay, now EVERYONE can make their own Search Engine and say how they are SO much better then everyone elses!

Share
twitter facebook
- Re:Yay. (Score:2, Funny)
  
  by TheOtherAgentM ( 700696 ) writes:
  
  I plan to make mine far inferior, but drive people to use my search engine with spyware.
  - Re:Yay. (Score:2)
    
    by Karzz1 ( 306015 ) writes:
    
    Bill? Is that you?
    
    Seriously though, isnt that how msn search gets the skewed usage statistics that it does (ok, I digress, IE is *technically* not spyware..... yet).
- Re:Yay. (Score:3, Insightful)
  
  by b0r1s ( 170449 ) writes:
  
  Size of index, speed (requiring hardware, content nodes, etc), tuning (algorithms may be alike, but small tuning makes all the difference with the SEO spam going around), and anti-abuse (worms searching for phpBB urls are bad, m-kay) will keep this from being a 'free perfect search for everyone' tool.
- Re:Yay. (Score:5, Interesting)
  
  by gstoddart ( 321705 ) writes: on Monday August 08, 2005 @08:36PM (#13275013) Homepage
  
  Yay, now EVERYONE can make their own Search Engine and say how they are SO much better then everyone elses!
  
  Well, let's just hope it becomes one big, honkin' FOSS project.
  
  Search technology is huge. Having it available which apparently can index conceptual links as opposed to literal links is astounding.
  
  I say smart move on IBM's side. Get all the publicity of opening up really cool tech to the open-source community, then proceed to make a gazillion dollars in professional services gigs, and get the added benefit of everyone making your tech better because it's useful.
  
  Provided this isn't steamingly fresh technology (unlikely from IBM realy) they should see some interest in this.
  
  I for one, can imagine a nice bunch of associative content, and am wondering how much resources this might require to run on a machine and I'm going to go RTFA. =)
  
  Parent Share
  twitter facebook
  - Re:Yay. (Score:1)
    
    by sentanta ( 619440 ) writes:
    
    If this is licensed under an Open Source license wouldn't Google, Yahoo, etc take whatever is worthwhile and incorporate it into their existing search algorithms?
    - Re:Yay. (Score:2)
      
      by Basje ( 26968 ) writes:
      
      Not if IBM patents it.
      
      That way either
      1. Google, MS, Yahoo etc can use the open source implementation (which is a licence to use the code including the patented stuff), possibly requiring opening their own codebase or
      2. they licence the patents from IBM
      
      Remember IBM still has the largest patent portfolio.
- Re:Yay. (Score:2)
  
  by coop0030 ( 263345 ) * writes:
  
  I for one, am not excited about the fact that any Joe Shmoe could send out robots to index my pages. If there are thousands of robots indexing my pages every day I am going to have a pretty large bandwidth bill to pay.
  
  Let's hope it is complicated enough that not everyone will be able to set up their own search engine easily.
  
  I would be excited though if it was a single large open source entity that works on a competing search engine. That would be neat!
  - Re:Yay. (Score:2)
    
    by jurt1235 ( 834677 ) writes:
    
    There are already thousands of bots and spiders busy on the web. Some really ridiculous ones, so this one more will not really matter.
- Re:Yay. (Score:2)
  
  by shokk ( 187512 ) writes:
  
  Yay! Now web sites can be hit by 100x the irrelevant search engine traffic instead of a few like Yahoo and Google that actually matter. This is a DoD in the making. I'm sure there will be more than a few that decide to ignore robots.txt.
Long hard road. (Score:1, Redundant)

by UlfGabe ( 846629 ) writes:

I applaude IBM for taking this stance and entering the hotly contested search engine world.

More competition is better. I would enjoy more innovation. They do have a hard long road to follow however, and they may find it difficult.

Check out my journal if interested in a difficult problem.
- Re:Long hard road. (Score:2)
  
  by Carnildo ( 712617 ) writes:
  
  The search algorithm is just a minor part of running a search engine. The key part, which Google has down pat, is getting the results from a metric buttload of web pages, doing it fast, and doing it for a very large number of people at once.
  - Just for Reference (Score:5, Funny)
    
    by AoT ( 107216 ) writes: on Monday August 08, 2005 @10:49PM (#13275702) Homepage Journal
    
    10 tads = 1 few
    
    10 fews = 1 some
    
    10 somes = 1 alot
    
    10 alots = 1 load
    
    10 loads = 1 buttload
    
    10 buttloads = 1 assload
    
    10 assloads = 1 shitload
    
    10 shitloads = 1 fuckload
    
    I do not have the book here or I would give the non-metric chart, you know how hard it is to remeber how many hogsheads are in an imperial buttload?
    
    Parent Share
    twitter facebook
- Re:Long hard road. (Score:1)
  
  by drawdevm2000 ( 906099 ) writes:
  
  Yeah but I doubt IBM made like a web search engine like Google, I bet you its just a single site search engine, purely for one site only. But then again you neevr know.
  - Re:Long hard road. (Score:1)
    
    by carl0ski ( 838038 ) writes:
    
    after reading the blurb there was not even a mention of web based. OSS community is itching for an effect Desktop search algorythm maybe this is it. Software to index anything
http://almaden.ibm.com/cs/crawler (Score:5, Informative)

by Urgo ( 28400 ) writes: on Monday August 08, 2005 @07:52PM (#13274765) Homepage

wfp2.almaden.ibm.com - - [08/Aug/2005:15:48:34 -0400] "GET /robots.txt HTTP/1.0" 200 69 "-" "http://www.almaden.ibm.com/cs/crawler [fc7]"
wfp2.almaden.ibm.com - - [08/Aug/2005:15:48:38 -0400] "GET / HTTP/1.0" 200 41317 "-" "http://www.almaden.ibm.com/cs/crawler [fc7]"

I've been getting once a day connections on my server from ibm for quite some time now (a year or so). Doesn't surprise me in the least. :)

Share
twitter facebook
- Re:http://almaden.ibm.com/cs/crawler (Score:2)
  
  by anagama ( 611277 ) writes:
  
  Riuniti on ice - so nice.
- Re:http://almaden.ibm.com/cs/crawler (Score:1)
  
  by muzza ( 64255 ) writes:
  
  almaden.ibm.com
  
  I had to look at that twice because the first time I read laden.bin.com... they really are declaring search engine jihad!
  - Re:http://almaden.ibm.com/cs/crawler (Score:2, Funny)
    
    by johnnytv ( 899977 ) writes:
    
    /usr/local/bin/laden
- Re:http://almaden.ibm.com/cs/crawler (Score:2)
  
  by kinema ( 630983 ) writes:
  
  "GET / HTTP/1.0"
  Why HTTP v1.0 and not v1.1?
not a web search engine (Score:5, Insightful)

by sled ( 10079 ) writes: on Monday August 08, 2005 @07:54PM (#13274776) Homepage

From TFA: "While simple but powerful keyword searches have revolutionized how Internet users locate and retrieve information, IBM is looking to transform how office workers sift through the piles of data stored inside organizations."

The posting implies that IBM is entering into competition with MS and Google. I saw no indication that IBM intends to launch a web search engine.

Share
twitter facebook
- Re:not a web search engine (Score:4, Informative)
  
  by b0r1s ( 170449 ) writes: on Monday August 08, 2005 @08:11PM (#13274878) Homepage
  
  The Google appliance is marketed (if not in the online docs, at least in person) as an enterprise tool for organizations to search their internal data. While this ceratinly isn't their primary revenue stream, this tool would in fact compete with that aspect of Google's business.
  
  Parent Share
  twitter facebook
- Chum (Score:2)
  
  by overshoot ( 39700 ) writes:
  
  The posting implies that IBM is entering into competition with MS and Google.
  No, the posting (at least tried to) implies that IBM is changing the rules on the search game.
  Chum are the bait that you throw to sharks to get them fighting each other.
- Re:not a web search engine (Score:1)
  
  by SlashEdsDoYourJobs ( 905360 ) writes:
  
  One of Google's products is an intranet appliance for "sifting through the piles of data stored inside organisations". This would put IBM in direct competition with them in that market. Public search isn't the only thing that Google does, you know.
- - Both? (Score:2)
    
    by A nonymous Coward ( 7548 ) * writes:
    
    Both "both" are not both needed. :-) for you idiot moderators on crack.
Finally some competition (Score:2, Insightful)

by Device666 ( 901563 ) writes:

Now I think Microsoft has a big problem... Now they really should start becoming innovative... And google finally could have a nice open source competitor. This will increase innovation in giant leaps and ofcourse would make it hard for microsoft ever to beat Google.. This will be a worthy test of the power of open source!!!
- Re:Finally some competition (Score:2, Insightful)
  
  by Donny Smith ( 567043 ) writes:
  
  > Now I think Microsoft has a big problem...
  
  How's that?
  This software has 0% market share (and that was with all the IBM's sales, support and development efforts).
  They couldn't make a dent in the market (why do you think they're releasing it to open source if it's so good)?
  
  >And google finally could have a nice open source competitor.
  
  I don't think so. Those search engine guys are mean mother fuckers - thousands and thousands of full-time engineers working on solely one task - imporoving their search pro
  - Re:Finally some competition (Score:2)
    
    by mforbes ( 575538 ) writes:
    
    I don't think so. Those search engine guys are mean mother fuckers - thousands and thousands of full-time engineers working on solely one task - imporoving their search products/services.
    <snip>
    Google's engineers will be on collective vacation, taking it easy while allowing this open source search engine to get its shit together.
    Make up your mind. Are they on vacation or are they working solely on improving their search engine? (leaving out any comments about your use of such colorful language)
IBM has so much unpublished advanced research (Score:2, Interesting)

by snotclot ( 836055 ) writes:

IBM is pretty crazy when it comes to advanced research in any of its fields.

I have heard of stories from researchers there that IBM has its own terminology for alot of technical EE/CS stuff, as they discovered it way before the world did but were so secretive they didn't publish any of it.

I'm not surprised if IBM has enough tech in search to seriously knock down Google!

This OSS thing comes as a surprise, as it contradicts their secretiveness about their research.
chum and guns (Score:5, Funny)

by Burz ( 138833 ) writes: on Monday August 08, 2005 @07:58PM (#13274800) Homepage Journal

a bucket of chum into the whole search showdown,

This is an awful mixed metaphor. How does Slashdot expect its readers to navigate the treacherous IT seas with such poorly-seasoned and half-baked information?

Share
twitter facebook
- Re:chum and guns (Score:1)
  
  by Soko ( 17987 ) writes:
  
  I think teh reference is from here [xnet.com]:
  
  Amy, I think you're going to earn a place as our Official ASR Sysadmin's Chum. In a secondary, particularly bloody-minded sense of the word.
  
  Steve VanDevender
  
  First thing that I though of.
  
  Soko
- Re:chum and guns (Score:5, Funny)
  
  by overshoot ( 39700 ) writes: on Monday August 08, 2005 @08:19PM (#13274918)
  
  This is an awful mixed metaphor. How does Slashdot expect its readers to navigate the treacherous IT seas with such poorly-seasoned and half-baked information?
  It's easy when you're three sheets to the wind, even if you pepper your reply with editorial condiments. Anyway, the goose is sufficiently sauced to be worth a gander.
  
  Parent Share
  twitter facebook
  - Re:chum and guns (Score:2)
    
    by Hektor_Troy ( 262592 ) writes:
    
    And remember, that a penny saved is worth two in the bushes. Oh, and don't cross the road, if you can't get out of the kitchen.
- Re:chum and guns (Score:5, Funny)
  
  by CaptainCarrot ( 84625 ) writes: on Monday August 08, 2005 @08:22PM (#13274937)
  
  I know! It throws a monkey wrench into that entire kettle of fish! There's no foothold you could sink your teeth into! It blows your mind from the ground up!
  ...and so forth.
  
  Parent Share
  twitter facebook
- Re:chum and guns (Score:2)
  
  by Tsu Dho Nimh ( 663417 ) writes:
  
  Hey, give the guy a break! He was writing that blurb BEFORE HIS MORNING CUP OF COFFEE!!!!
  I wuz there.
what about yahoo!? (Score:5, Insightful)

by dezmund ( 903218 ) writes: on Monday August 08, 2005 @07:59PM (#13274807)

MSN thought it was between them and google?
http://news.yahoo.com/news?tmpl=story&u=/cmp/20050 722/tc_cmp/166401634 [yahoo.com]
sorry bill, but if anything its between yahoo (22% share of all searches) and google (47%).

Not to mention most of those MSN searches (12%) are from IE users who don't know how to change their browser's start page.

Share
twitter facebook
- Re:what about yahoo!? (Score:1)
  
  by Exitar ( 809068 ) writes:
  
  A friend of mine for example.
  I expect this post to be modded informative of course...
- Re:what about yahoo!? (Score:4, Funny)
  
  by Punboy ( 737239 ) writes: on Monday August 08, 2005 @09:40PM (#13275336) Homepage
  
  Plus, those who think that the address bar are for system commands (and are thus afraid of it) and the search-bar is where you type in the website address o.O
  
  My grandparents are weird.
  
  Parent Share
  twitter facebook
- Re:what about yahoo!? (Score:2)
  
  by ciroknight ( 601098 ) writes:
  
  Uh where do you get your numbers and do you work for Yahoo/Google/MSN/etc.?
  
  Last I heard it's pretty darned impossible to tell just how many searches are processed by which search engine unless you are actually within those companies and have access to that company's numbers.
  
  It's possible to get averages from websites by referal beacons, but some engines list sites higher than others, some are enhanced by paid ads, etc. etc. IT's just not scientific at all to post percentages of what you don't know.
  
  S
- Re:what about yahoo!? (Score:2)
  
  by bhtooefr ( 649901 ) writes:
  
  I know someone who KNOWS how to change her browser start page (although at our school, she couldn't unless she used another browser), yet STILL uses MSN. She told me that she doesn't use Google because she doesn't trust it. Fair enough, but MSN instead? WTF is THAT?
  
  And it's not MS love, either - I even had her using Opera for a while, and it wasn't even b/c of security. Of course, this was all at school, and they took down the public share, so Opera was harder to use...
Get it now (Score:4, Informative)

by QuantumG ( 50515 ) writes: <qg@biodome.org> on Monday August 08, 2005 @07:59PM (#13274810) Homepage Journal

Unstructured Information Management Architecture SDK [ibm.com]. The UIMA SDK (Software Development Kit), is an all-JavaTM implementation of the UIMA framework, and it supports the implementation, description, composition, and deployment of UIMA components and applications. It also supports the developer with an Eclipse -based development environment that includes a set of tools and utilities for using UIMA.

Go you crazy Java dudes, go.

Share
twitter facebook
I, for one, ... (Score:2, Funny)

by kaan ( 88626 ) writes:

I, for one, welcome our new chum-tossing search-engine overlords...
This means K-... (Score:1, Offtopic)

by TransEurope ( 889206 ) writes:

KDeskserach?

KDeskfinder?

Koogle?

Kahoo?

...in the next KDE :D
What is still missing... (Score:2)

by sploxx ( 622853 ) writes:

is a P2P layer on top of this complete with efficient, distributed and secure search. A good P2P search engine is still missing and (IMHO) one of the more important things needed, last but not least for political reasons (privacy, censorship etc.).

That would make it possible to give back control of every aspect of the 'web experience' to the user.

Ok, I'm dreaming :-)
- Re:What is still missing... (Score:3, Informative)
  
  by nostriluu ( 138310 ) writes:
  
  You could start with this: http://www.yacy.net/yacy/ [yacy.net]
  - Re:What is still missing... (Score:2)
    
    by sploxx ( 622853 ) writes:
    
    Looks interesting, thanks for the link! :)
Just ignore the link in the slashdot item (Score:5, Informative)

by hackwrench ( 573697 ) writes: <hackwrench@hotmail.com> on Monday August 08, 2005 @08:08PM (#13274860) Homepage Journal

The important information is simply the url http://www.alphaworks.ibm.com/tech/uima/ [ibm.com]

Share
twitter facebook
- Re:Just ignore the link in the slashdot item (Score:5, Informative)
  
  by SnprBoB86 ( 576143 ) writes: on Monday August 08, 2005 @10:34PM (#13275620) Homepage
  
  Definitely read/skim the SDK User's Guide http://dl.alphaworks.ibm.com/technologies/uima/UIM A_SDK_Users_Guide_Reference.pdf [ibm.com]
  
  The annotator premise is almost too simple; it's brilliant.
  
  Parent Share
  twitter facebook
The "Don't Be Evil" Contest... (Score:2)

by ScentCone ( 795499 ) writes:

...will sure light up. There will be so many people trying out-do the not-doing-evil of all of the other search engines that they'll have to resort to being evil just to prove how not evil they are.
Just a thought - distrubuted search (Score:1)

by Eightyford ( 893696 ) writes:

I'm not sure if this is feasable as it would be hard to ward off spammers, but is there any chance that we could see an OSS distributed search system that works like SETI@HOME?

Maybe I'll patent it, before Epicrealm does...
- We might start to see the limits of OSS (Score:2)
  
  by tentimestwenty ( 693290 ) writes:
  
  I'm in agreement here. If anyone can see the algorithms, then it's going to be pretty easy to manipulate the results and ruing the efficiency. Perhaps this will be the first example of the limits of OSS due to the necessity for secrecy.
huh? (Score:2)

by pokka ( 557695 ) writes:

which Microsoft thought was between them and Google.

Where did this come from? It certainly wasn't part of the article. With BAIDU's IPO [fool.com], and Yahoo expanding its index count [yahoo.com] to 20B pages (almost 4x Google's count), I seriously doubt that anyone in the search engine business thinks they can predict who will dominate in a few years - it's possible that the next "pagerank killer" is written by some CS grad students or by a search engine company that hardly anyone has heard of (yet).
- Re:huh? (Score:1)
  
  by a gash ( 891166 ) writes:
  
  Pagerank is beautiful for it's simplicity, but it is a specific implementation of the search layer. What IBM is touching on here is not a search layer, it's parseing layer on top of the search engine. That parseing layer will be where the search companies fight it out over the next few years. The concept is called Natural Language Search and I'm sure all the big boys have been working on it for some time now. IBM hasn't hit it here, but they defintely just took a step ahead of google et al.
  
  Ask Jeeves tr
Wait... (Score:2)

by nmb3000 ( 741169 ) writes:

...which Microsoft thought was between them and Google.

I think it still is pretty much between them (and perhaps Yahoo) as IBM is obviously not actively persuing this market. From first glance it appears that they wanted to give search engines a swing, and in the end decided not to go after it. However being IBM, instead of burying their research they released it into the public so others can benefit from it.

While this is good, but Microsoft and Google really have nothing to worry about. It's not like Bi
- Re:Wait... (Score:2)
  
  by MichaelSmith ( 789609 ) writes:
  
  However being IBM, instead of burying their research they released it into the public so others can benefit from it.
  IBM may yet live to benefit from this project. A new google-like startup will need their own software to start their business, so they won't use it. IBM own the copyright, and have their own capital so they could start their own search engine with the OSS software at a later date.
Big Blue Marbles (Score:4, Insightful)

by Doc Ruby ( 173196 ) writes: on Monday August 08, 2005 @08:20PM (#13274927) Homepage Journal

So Google and MS will incorporate the "key facts" code into their products. That won't exactly shake up the search engine world. It will (possibly) improve it for everyone, and maybe (if "key facts" works better than their proprietary "key words" functions) even let another engine compete in their category. The latter might shake something up. But, like every other mass human activity, this competition is fought over brand names. Google clevery established a terrific brand, through careful simplicity and consistency in graphic and info design. This IBM release would merely grant more substance to the existing brands, and some substance to any newly emerging one. Which new brand would have to establish its own competitive value, largely through style.

IBM's move does have the power to shake up the open/proprietary software jihad underway. If Microsoft used their open code, it would be hard for MS to claim that open source is inherently bad, or proprietary code is inherently superior. Google would demonstrate the same argument, but no one complains about Google's code remaining proprietary, because it mainly runs on their servers, which few people yet demand should be opened to outsiders. These are the kind of subtle strategic moves that let IBM continue to pull the strings of the entire industry. Success that generates more business and flexibility for IBM, in the mixed open/proprietary space it's carving for itself, will also demonstrate another powerful idea. American corporations can achieve market influence through strategic deployment of basic R&D. Not just through proprietary products, but also through manipulation of competitors who adopt open tech they create.

All in all, this looks like a smart move by IBM. Let's hope 1> this rumor is true; 2> the tech is really good; and 3> we're not already too far gone down the entrenched lines between our corporate jihadis to get the benefit of the mutual cooperation that this tech could enable, to great mutual benefit.

Share
twitter facebook
- - Re:Big Blue Marbles (Score:3, Interesting)
    
    by Doc Ruby ( 173196 ) writes:
    
    The evolution of GPL software into embedded apps that interop with other, non-GPL apps, shows that one basic premise of the FSF worldview is wrong: users and programmers actually have different values, not identical ones, at least where getting the source code is concerned. Practically no users, and even only few programmers, and , have expressed any desire (beyond mere whining) to get the source code for apps with which they only want to interop. So GPL requirements to release new code that hasn't actually
will it be good enough (Score:1)

by Fr05t ( 69968 ) writes:

to know I'm looking for amateur or anal when I search for 'a'?
Little to do with opponents... (Score:5, Informative)

by AutopsyReport ( 856852 ) writes: on Monday August 08, 2005 @08:30PM (#13274977)

From the article... "I don't see any of the major players moving into this area," Arthur Ciccolo, head of search technology at IBM Research, said of how major consumer Internet search companies such as Google, Yahoo Inc. and Microsoft have focused on the public Internet instead of private record data retrieval.
And from the Slashdot summary... IBM has just tossed a bucket of chum into the whole search showdown, which Microsoft thought was between them and Google.
No, IBM's technology has little to do with Google, Yahoo or Microsoft's search technology. This isn't a competition until either three introduce similar technology. Reading the article's third paragraph would clarify this, and would make the summary a little more accurate, too.

Share
twitter facebook
- Re:Little to do with opponents... (Score:2)
  
  by evilviper ( 135110 ) writes:
  
  No, IBM's technology has little to do with Google, Yahoo or Microsoft's search technology. This isn't a competition until either three introduce similar technology.
  
  Similar private-record search products? Like the google search appliance that has been around for years now?
  
  http://www.google.com/enterprise/ [google.com]
IBM DB2 extensions... (Score:2)

by farrellj ( 563 ) writes:

About 8 years ago, when I was writing software for OS/2, I ran across an interesting extension that IBM had for its DB2 software, called (I think) the Ultimedia extensions. These would allow you to search photos for a type of object that it understood. So you could tell it to search for all pictures that had a red ball and a tree...and it would return a list of all photos with those two objects. It was really interesting, but I have not heard anything about it since then...

ttyl
- Re:IBM DB2 extensions... (Score:2)
  
  by electrichamster ( 703053 ) writes:
  
  Looks interesting, there's a blurb about it here:
  http://www-306.ibm.com/software/data/umm/umm.html [ibm.com]
Why wait for SourceForge? (Score:4, Informative)

by r_jensen11 ( 598210 ) writes: on Monday August 08, 2005 @08:32PM (#13274990)

It's available now [ibm.com]. As the article says:

UIMA technology is expected to be made available through open-source software site SourceForge by the end of 2005. The UIMA framework can currently be downloaded free of charge from IBM AlphaWorks at http://www.alphaworks.ibm.com/tech/uima/ [ibm.com].

So, I ask, why wait for it to appear on SF if we can get it now?

Share
twitter facebook
- Re:Why wait for SourceForge? (Score:3, Informative)
  
  by Anonymous Coward writes:
  
  um, because it's closed source right now
IBM has so much unpublished advanced research (Score:1)

by jigglysnot ( 906155 ) writes:

IBM is pretty crazy when it comes to advanced research in any of its fields. I have heard of stories from researchers there that IBM has its own terminology for alot of technical EE/CS stuff, as they discovered it way before the world did but were so secretive they didn't publish any of it. I'm not surprised if IBM has enough tech in search to seriously knock down Google! This OSS thing comes as a surprise, as it contradicts their secretiveness about their research.
- Re:IBM has so much unpublished advanced research (Score:1)
  
  by xiaomonkey ( 872442 ) writes:
  
  Interesting....
  
  I thought IBM tried to patent everything and anything plausibly patentable that came across the desk of someone on their research team.
  
  If they patent everything, they can be pretty sure that they'll be able to extract some pretty hefty licensing fees from the industry at large. However, if they keep too many things under wraps, while they might gain a competitive advantage for a product that they're bringing to market relatively soon, they risk loosing the ability to file for all of the r
Open Source, but who will be able to run it? (Score:2)

by michaeldot ( 751590 ) writes:

The key to search engines, whatever their underlying ranking algorithm, is trawling through the couple of billion pages on the net to generate the data to be be searched.

Obviously most of us simply don't have the bandwidth or the computing power & storage to do that.

So are IBM treating the search engine source release as a hypothetical interest for people who can't actually make practical use of it, or are they going to give access to their own trawled data?

If the latter, then this is very significant.
- Re:Open Source, but who will be able to run it? (Score:2)
  
  by michaeldot ( 751590 ) writes:
  
  Scrub that, I assumed from the (arguably misleading) "Google vs Microsoft" in the intro that it was search in the web context. RTFA showed it's about corporate data searching, so my "net trawling" comment makes no sense. Sorry. Wishful thinking I guess. Gotta learn not to RTFIntros.
I would look forward to this (Score:2)

by Mal-2 ( 675116 ) writes:

There have been many times when I have known what something is or does (since I've seen it in action), but not what it is called. If I could search for information on the basis of known facts, rather than just guessing at search terms, I think I would have much quicker success at such searches. I can usually find whatever I needed to know, but it can take weeks if I don't know the words to search for. Sometimes it takes joining mailing lists or asking people personally. Yeah it works, and the current system
They keep giving stuff away! (Score:1)

by elgee ( 308600 ) writes:

No wonder my IBM stock is tanked.
Don't count on it being of any use. (Score:2)

by duffbeer703 ( 177751 ) writes:

If this radical new technology is anything like the new, improved, "Deep Blue" search backing IBMs support pages, its a real piece of junk, almost like Altavista circa 1998.
- Re:Don't count on it being of any use. (Score:2)
  
  by cant_get_a_good_nick ( 172131 ) writes:
  
  It isn't. It's new, called UIMA. We eval'ed this under an NDA. I's pretty cool, though what we saw was an SDK more than a package you can install. it is more of an infrastructure, so can be used to create new engines.
Not quite a new concept... (Score:2)

by Excelsior ( 164338 ) writes:

You almost make it sound as if this is the first OSS search engine out there. Apache Jakarta's Nutch [apache.org], a subproject of Lucene, has been around for over two years. I haven't done tons of research on the subject, so I'm betting Nutch isn't the only one.
Jaws reference (Score:2)

by SethJohnson ( 112166 ) writes:

....IBM tossed a bucket of chum into the whole search showdown...

To which Paul Allen responded from the deck of his yacht [yachtcrew-cv.com], "We're going to need a bigger boat!"

Seth
Proposal to start OSS search organisation (Score:2)

by jurt1235 ( 834677 ) writes:

Fellow /. viewers,

Why not do the following: Several of us have access to sufficient infrastructure (own/lease with diskspace to spare, plus a bandwidth surplus).
Why do we not combine that in a distributed search environment with mirrored nodes with this technique of IBM. The addition of the distributed technology to spider and index the web will be a significant challenge, but the concept is I think pretty appealing. I for one will be willing to "donate" the necessary domain and starting facilities.

Anyon
So, whats the real deal here? (Score:2)

by chefren ( 17219 ) writes:

After searching for a whooping 5 minutes and even googling (gasp!) I couldn't find any decent article about what this actually is, just lots of info on how to use it. It looks like there is a new query language so it might be interesting for query expansion. But how does it extract these key facts from the documents? Does it do real natural language analysis? Just guess by looking at the document terms like every other search technology? Or is it just a framework that doesn't really do anything by itself? I
A Bucket of Cold Water (Score:2)

by bobej1977 ( 580278 ) writes:

This is interesting, but somwhat deceptive. IBM has created a framework, not an actual search engine. The framework is effectively a data layout combined with a processing pipeline and query engine that gives emphasis to semantic processing of information, rather than strictly textual. See IBM's FAQ regarding Annotations [ibm.com].
You still have to buy the software that will plug into the framework in order to actually process the information, though some open source projects are certain to come along.
This is
Author says Google is largest computer company... (Score:2)

by mcguyver ( 589810 ) writes:

I like how the author points out Google as being the "worlds largest computer company" in the same article as IBM. Apparently having the company name International Business Machines, having $100B in assets and revenues of $100B a year will not trump a hyped up dot-com company with $3B in assets and revenues of $3B a year. Surely Google is the world leader in search but when did that become the only function of computers?
- Re:IBM? YOU SERIOUS? (Score:1)
  
  by Glooty-Us-Maximus ( 865500 ) writes:
  
  Is that why powerhouses such as Ebay use it as well as other IBM products such as Websphere?
  - Re:IBM? YOU SERIOUS? (Score:2, Interesting)
    
    by oopsdude ( 906146 ) writes:
    
    IBM has always been cozy with eBay; as I recall, eBay's logo said "powered by IBM" for quite a long time.
    - Re:IBM? YOU SERIOUS? (Score:1)
      
      by Glooty-Us-Maximus ( 865500 ) writes:
      
      And how does that detract from the quality of IBM's products? I'm sure IBM may have given them discounts for some free advertising in the form of that logo, but I don't feel that Ebay would have gone with an inferior product which would cost them money through downtime or the inability to handle a higher number of users.
  - eBay runs on Sun, not IBM. (Score:3, Informative)
    
    by NeoBeans ( 591740 ) * writes:
    
    Is that why powerhouses such as Ebay use it as well as other IBM products such as Websphere?
    That was a long time ago in a galaxy far, far, away. eBay now runs on Sun [ebay.com].
- Re:Information wants to be free... (Score:1)
  
  by WillAffleckUW ( 858324 ) writes:
  
  as in freedom, not free as in beer.
  
  isn't that supposed to be bheer?
  
  However, while I agree that Information wants to be free, I prefer cider myself.
- What information really wants (Score:2)
  
  by InfiniteWisdom ( 530090 ) writes:
  
  ...is for you to stop anthropomorphizing it/
- - Re:Beer wants to be free, too. (Score:1)
    
    by Kloog ( 883423 ) writes:
    
    Seems to work for everyone else. I wouldn't want to rock the boat while navigating these chum-filled waters.
- Re:The only thing IBM is going to do... (Score:1)
  
  by guaigean ( 867316 ) writes:
  
  Why exactly? Because they are gaining more supporters through community offerings?
- Re:Spotlight! (Score:1, Funny)
  
  by Anonymous Coward writes:
  
  Depends on watts and type of bulb.
- Re:Spotlight! (Score:3, Insightful)
  
  by FLAGGR ( 800770 ) writes:
  
  Damnit, are you talking about spotlight in Tiger? There's a huge goddamn difference between a desktop indexing search and an internet search engine. My god. The scale is like, so insanly different (and if the Apple PR has said anything about it being scallable to the likes of an internet search, then I'm selling my mac, NOW) How does this compare to spotlight? How does an apple compare to an orange? How does the color red compare to the number 7.623? How does 6 in the afternoon compare to the goatse man?
  - Re:Spotlight! (Score:2)
    
    by FLAGGR ( 800770 ) writes:
    
    Goddamn it, just read the article. The slashdot summary was slightly misleading. I still hold that they are vastly different technologies, but a little less so now that I RTFA. Sorry.
    - Re:Spotlight! (Score:2)
      
      by bradkittenbrink ( 608877 ) writes:
      
      Don't apologize, How does 6 in the afternoon compare to the goatse man? just became my new .sig.
  - Re:Spotlight! (Score:1)
    
    by seramar ( 655396 ) writes:
    
    "How does 6 in the afternoon compare to the goatse man"
    
    By about 180 degrees.
- Re:Google? (Score:3, Informative)
  
  by confusion ( 14388 ) writes:
  
  I'm guessing that IBM has a 50% higher market cap, 30X Google's revenues and $110B in assets doesn't come into play here?
  
  Jerry
- Re:LOL - Google is more than an algorithm (Score:2, Interesting)
  
  by pogson ( 856666 ) writes:
  
  Google is a search engine farm. It would take a while for anyone to catch up even with a better recipe. If IBM's stuff is FOSS, Google could use it.
  This is good news anyway. Keyword/phrase searching becomes less useful as the universe expands. I have 11000 texts fully indexed with swish-e and I get way too many hits unless I use phrases. If I knew what phrase was in the books I sought, I would not need the search engine.
  I love search engines because I cannot figure out how to organize a file cabinet or a ha

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Slow down IBM ... (Score:2, Funny)

Re:Slow down IBM ... (Score:1)

SourceForge proposal... (Score:2, Funny)

Re:SourceForge proposal... (Score:1)

Re:SourceForge proposal... (Score:2)

Re:SourceForge proposal... (Score:2)

ok but (Score:5, Funny)

Yay. (Score:4, Funny)

Re:Yay. (Score:2, Funny)

Re:Yay. (Score:2)

Re:Yay. (Score:3, Insightful)

Re:Yay. (Score:5, Interesting)

Re:Yay. (Score:1)

Re:Yay. (Score:2)

Re:Yay. (Score:2)

Re:Yay. (Score:2)

Re:Yay. (Score:2)

Long hard road. (Score:1, Redundant)

Re:Long hard road. (Score:2)

Just for Reference (Score:5, Funny)

Re:Long hard road. (Score:1)

Re:Long hard road. (Score:1)

http://almaden.ibm.com/cs/crawler (Score:5, Informative)

Re:http://almaden.ibm.com/cs/crawler (Score:2)

Re:http://almaden.ibm.com/cs/crawler (Score:1)

Re:http://almaden.ibm.com/cs/crawler (Score:2, Funny)

Re:http://almaden.ibm.com/cs/crawler (Score:2)

not a web search engine (Score:5, Insightful)

Re:not a web search engine (Score:4, Informative)

Chum (Score:2)

Re:not a web search engine (Score:1)

Both? (Score:2)

Finally some competition (Score:2, Insightful)

Re:Finally some competition (Score:2, Insightful)

Re:Finally some competition (Score:2)

IBM has so much unpublished advanced research (Score:2, Interesting)

chum and guns (Score:5, Funny)

Re:chum and guns (Score:1)

Re:chum and guns (Score:5, Funny)

Re:chum and guns (Score:2)

Re:chum and guns (Score:5, Funny)

Re:chum and guns (Score:2)

what about yahoo!? (Score:5, Insightful)

Re:what about yahoo!? (Score:1)

Re:what about yahoo!? (Score:4, Funny)

Re:what about yahoo!? (Score:2)

Re:what about yahoo!? (Score:2)

Get it now (Score:4, Informative)

I, for one, ... (Score:2, Funny)

This means K-... (Score:1, Offtopic)

What is still missing... (Score:2)

Re:What is still missing... (Score:3, Informative)

Re:What is still missing... (Score:2)

Just ignore the link in the slashdot item (Score:5, Informative)

Re:Just ignore the link in the slashdot item (Score:5, Informative)

The "Don't Be Evil" Contest... (Score:2)

Just a thought - distrubuted search (Score:1)

We might start to see the limits of OSS (Score:2)

huh? (Score:2)

Re:huh? (Score:1)

Wait... (Score:2)

Re:Wait... (Score:2)

Big Blue Marbles (Score:4, Insightful)

Re:Big Blue Marbles (Score:3, Interesting)

will it be good enough (Score:1)

Little to do with opponents... (Score:5, Informative)

Re:Little to do with opponents... (Score:2)

IBM DB2 extensions... (Score:2)

Re:IBM DB2 extensions... (Score:2)

Why wait for SourceForge? (Score:4, Informative)

Re:Why wait for SourceForge? (Score:3, Informative)

IBM has so much unpublished advanced research (Score:1)

Re:IBM has so much unpublished advanced research (Score:1)

Open Source, but who will be able to run it? (Score:2)

Re:Open Source, but who will be able to run it? (Score:2)

I would look forward to this (Score:2)

They keep giving stuff away! (Score:1)

Don't count on it being of any use. (Score:2)

Re:Don't count on it being of any use. (Score:2)

Not quite a new concept... (Score:2)