New Clustering Search Engine to battle Google 189
Sophrosyne writes "The New York Times is reporting a new search engine [free if DNA on file with Homeland Security] named "Clusty" is going to try and take Google head-on. The new search engine was developed by three former CMU computer scientists who formed the company Vivisimo. The search engine uses Overture for it's results but offers new features such as an encyclopedia search, clustered results, and a gossip search."
Klutsy? (Score:5, Insightful)
More like New Clustering Search Engine goes Beta. Let's wait until it's production stable before talking about who it's going to take down in a fist fight reminiscent of the Spock/Kirk battle in Amok Time [ericweisstein.com].
Clusty by Vivisimo? Did I even spell that right? They need to consider naming things that people can:
A) pronounce
B) spell
C) are actual words or at least close to words that qualify for both A & B.
Clusty sounds like something you would call the fat cheerleader. It also will be often mispronounced as Klutsy, so it's a very bad name for a search engine (of all things).
The search engine uses Overture for it's results but offers new features such as an encyclopedia search, clustered results, and a gossip search.
This is a Microsoft tactic: add features to get market share, and it's an evil tactic because nothing new comes out of it, except bloat and bad karma. The fact this is based on Overature leads me to believe that it won't be able to take Google head-on at all. Clusty uses the Google interface but shows sponsored results first (evil), and displays 404 pages [clusty.com] in the results. (FYI dteam was the first 3d design guild that is no longer)
I don't think they really have a hope of competing with Google. If it ain't broke don't fix it, so most people will just continue to use Google.
Re:Klutsy? (Score:5, Informative)
and the exact point you make was mentioned back then.
heres the article (january)
http://slashdot.org/articles/04/01/05/1839233.s
I think clusty.com is better, but now makes me think of unclean prostitutes.
Klitsy? (Score:3, Funny)
And Google makes me think of clean prostitutes!
Re:Klutsy? (Score:2, Insightful)
Re:Klutsy? (Score:5, Funny)
A) pronounce
Well, Google has got everyone beat in this regard. "Google" is probably the first thing a baby says (and hence I'm sure it is hardwired into our brains). The only thing that could beat "Google" would be "dada" or "burp". Any takers?
Dada (Score:5, Insightful)
You joke, but a search engine named Dada would likely be well received for the name, and if it was a good system it could find a nice user base. I mean it has taken Google *years* to perfect its systems and they started with a good premise: do no evil. That was when all the search engines were cashing in on ads. A lot of people were turned off of the internet because of that, until Google came along. So it was purposeful, not evil, and light/easy to use.
My suggestion to anyone trying to take on Google is that they should do something else unless google becomes evil, and because power corrupts and absolute power corrupts absolutely -- it's just a matter of time before Google turns evil. Maybe not, though.
Re:Dada (Score:3, Funny)
Re:Dada (Score:2)
--
And because I'm only going to make one post and crawl into bed, the color scheme is horrid, the layout not very frienly, and the dialogue options for advanced searches so un-organised it's painful to use.
Fix the name, colors and layout, and it'd actually be a contender against google. I'd say borrow from everyone else other thank ask jeeves and go with a nice clean white background with easilly readable results, clearly layed out
Dada would be... (Score:5, Funny)
This could be very, very difficult. How would you implement such a thing, from a technical standpoint?
Re:Dada would be... (Score:2)
Re:Klutsy? (Score:1)
I totally agree on the naming issue: I was thinking "Klusty the Clown"??? And "Vivisimo" sounds more like a toy company to me.
The interface isn't gonna sell it either. It reminds me more of http://alltheweb.com/ [alltheweb.com] (at least the front page) though - but uglier.
Re:Klutsy? (Score:2, Informative)
Valdes-Perez said his company dumped the name Vivisimo for the search engine because it was ``an obstacle.''
``It's a name that is difficult to pronounce and type and spell. Other than that, it's a great name,'' he quipped.
But the new name may face similar challenges, Valdes-Perez acknowledged. Though it is easy to remember, for many people Clusty evokes the name Krusty the Clown, the not-so-kid-friendly character on ``
Re:Klutsy? (Score:2)
I wonder if they thought of "vivis"? Kind of sounds cool... most cultures can pronounce it, and it has no tricky syllable combinations. It's evolved from something related to project, and it's easy to remember and not mix up letters from memory.
Re:Klutsy? (Score:3, Insightful)
A) pronounce
B) spell
C) are actual words or at least close to words that qualify for both A & B.
The main reason why I used altavista for so long was actually because I didn't manage to spell google right. Honestly. I had to try all kind of combinations everytime I wanted to go there, like gogle, googel, gogel. I should also say that english is not my native language.
Re:Klutsy? (Score:3, Insightful)
Re:Klutsy? (Score:2)
Have you heard of bookmarks or autocomplete?
By the way, I never used Altavista because it reminded me too much of Asta La Vista, I had difficulties remembering which one was the right one.
Re:Klutsy? (Score:2)
However I agree with you totally in one point - Clusty is already totally bloated. If you look at google itself's example who won
Re:Klutsy, but... (Score:1)
They are using Linux with Apache
Results are clustered that's the only think I love right now. But I'm dam sure rest of competitor get this stuff soon
Re:Klutsy? (Score:2)
Re:Klutsy? (Score:3, Insightful)
Whether it's beta or not doesn't matter. Google picked up most of its steam by word of mouth while it was still in beta and was already on its way to becoming the dominant search engine by the time it took off the beta tag. Just look at Google's own Gmail beta. Hotmail and Yahoo! didn't have to "wait
Re:Klutsy? (Score:2)
Google has sponsored results too. At least they're separating them out from the main index, like G does.
Re:Klutsy? (Score:2)
Re:Klutsy? (Score:2)
Not according to your examples - every one is multisyllabic...
Of course, your main point is dead on. I suspect the problem with thematically-related site names is built-in. Take books - you'd remember "related name", then would try book, bookstore, library, text, novel, etc., rarely if ever going back to the right site.* The fact that "Amazon" wasn't one of the taken-for-granted synonyms for book meant that people had to make the unlikely menta
is going to try and take Google head-on. (Score:2, Insightful)
A cute name is a start.
Re:is going to try and take Google head-on. (Score:4, Funny)
Re:is going to try and take Google head-on. (Score:2)
Re:is going to try and take Google head-on. (Score:1, Funny)
And "Clusty" is not a cute name. It's kind of creepy.
Gossip filter (Score:3, Insightful)
Re:Gossip filter (Score:1)
Somebody did an "unclean search" for Google using normal search and "clean" search results the same way but I'm too lazy to google for it.
Since when is search a solved problem? (Score:3, Insightful)
All Clusty, A9 and the other more recent search engines seem to do is add more gimmicks to search results from yahoo and Google respectively. To some extent, this seems to be exactly what Google is doing recently as well: the searches are hardly getting beter, instead we can search news, search references (try define:), search printed text, do automatic conversions, etc etc.
But the truth is that not only are the searches at Google not getting better: they are getting worse. It seems like PageRank is more or less unused nowadays, and Google just uses easily manipulated things like searchterm in URL, searchterm in Title, how recently updated, to rank pages. I think anybody who uses Google to search for specific things must have observed that it works only a fraction of how well it did when it was new.
So what is going on here? Does everybody consider the basic searching a solved problem, and that we don't need to find pages better than google does? Or is a good search that cannot be manipulated really an intractable problem?
If I owned Google stock, I would really be wondering how many of all those thousands of PhD's at the Googleplex are working on this, and how many are writing gimmicks and elegant webmail applications. Or maybe one of them already proved that the problem can't be solved, and Google is just hoping to make as much money as possible before the secret comes out...
Re:Since when is search a solved problem? (Score:2)
But Clusty's clusters seem like a nice gimmick. Do a search for 'Debian', and it returns the usual, but with options to look at the clusters 'Debian GNU', 'Install', 'Package', 'Reviews', and so on.
Of course, these clusters don't work very well for every search. It's anglo-centric, and searches for things that are unknown to the English speaking world get some really
Re:Since when is search a solved problem? (Score:2)
Re:Since when is search a solved problem? (Score:1)
Re:Since when is search a solved problem? (Score:2, Interesting)
This is a battle that will always go on. Change your page rank system and people will just start gaming it again.
What Clusty/Vivisimo accomplishes is that by clustering data, it takes sequence out of play. Even if my preferred pages for "Debian's soci
Re:Since when is search a solved problem? (Score:2)
Why not just search for "Debian's social contract" if that's what you're looking for?
Re:Since when is search a solved problem? (Score:2)
Re:Since when is search a solved problem? (Score:2)
Google has been pretty stagnant for months now, but it was always easy enough to manipulate, and is generally considered harder to game now. Right now they are not giving good rank to most new sites for some reason, perhaps to fight "SE spam."
Google is probably in the process of massive overhauls too. It hasn't really updated its algorithm much in months, and many people believe we're in for a big shake-up.
Certainly there is no such thi
Re:Since when is search a solved problem? (Score:2)
That is pretty fresh coming from somebody spamming his every post with commercials for a two-bit pyramid scheme...
Not impressed; but more competition is good (Score:5, Insightful)
Under the heading "House" are the news items:
And under the heading "Record", are listed:
Having said this, I wish Vivisimo all the luck. Google needs more competition; it is what will give us the Next Great Search Engine(tm).
Ob: I, for one, would like to welcome our new clustering overlords.. ;-)
Re:Not impressed; but more competition is good (Score:1)
Re:Not impressed; but more competition is good (Score:1)
I gave 'Clusty' the same search and it really did reduce the amount of time it took to get the relevant information. Yes, most of the results were the same, and I didn't get any new answers for my problem, but I did get to the information f
Nothing (Score:1, Funny)
Re:Nothing (Score:1)
ooh a complete suite of search engines (Score:2, Funny)
not a very reliable porn search engine.
Encyclopedia? Bah! (Score:4, Insightful)
Now if they'd done some sort of deal with Britanica to gain search access to its online library, THAT would be a resource worth posting to
I will need... (Score:2)
Cb..
Clusters, Folders, yakkity-yak (Score:1)
I occasionally used Vivisimo's search engine years ago (I don't know if it had any association with Overture at the time).
It would take the search results and place them in categorized folders so that you could narrow down the search. Naturally, they picked the categories.
I suppose some filtering would be nice, but do I really need them to do something I should have done when I came up with my search parameters in the first place?
Slashdot Tab (Score:1, Informative)
Hrmm... (Score:3, Interesting)
I gave a9 [a9.com] a try, I like the interface and some of the new features like the search history and the multiple search panes. But shortly after I found myself using google again. Even though a9 uses google, and the results are almost identical, I didn't find anything compelling enough to make me switch.
Does anyone else feel they might be missing some results if they were to use another search engine?
What must a new search engine provide to "steal" users from google?
Free iPods? Sure! [freeipods.com]
With .sig: (Score:3, Funny)
Free iPods? Sure!
Well, I guess that's one way to do it...
Why do Clusty... (Score:1, Redundant)
Re:Why do Clusty... - publicity (Score:2)
Even on Slashdot, a lot of people didn't know about Vivisimo.
Now you do.
The money they put in the rebranding is FAR less then wath this amount of publicity would cost to buy.
A better mousetrap (Score:4, Interesting)
Re:A better mousetrap (Score:1)
Back in the days, when the search engine war begun, I had no problem typing in http://www.altavista.digital.com or whatever, it was simply my choice of search engine.
Today, google is my primary choice, but if they will ever bloat it with more than it is today (neat, simple and honest!) I will search for a better one, the one which suits my needs the most, or a page which simply parsers googles results and presents them nicely, in short terms, something nicer.
Google's powers is not in th
It's more impressive than Slashdotters realize (Score:5, Interesting)
The results include MSN and Gigablast and Lycos. Basically, that means Yahoo's crawling plus Gigablast. Yahoo has ramped up their crawling since March, and is on a par with Google. They've been slow about passing all of it to MSN in a timely fashion, but by now MSN has most of it. I think Lycos, which also uses Yahoo's Inktomi, is about the same as MSN.
The clustering is the best of any search engine, meta or otherwise. You don't have to have JavaScript enabled, which is a big plus over the Vivisimo interface I remember from a year ago.
Finally, I was delighted to see that Clusty.com does not set a cookie unless you customize. Even the cookie for customization looked like it lacked a unique ID. I emailed Clusty and they confirmed for me that they have no plans for a unique ID in their cookie.
Google tracks you with a unique ID across all of their services, and saves everything it knows about you. Google's cookie expires in 2038.
Now I ask you, why do Slashdotters feel the need to dump on Clusty?
Re:It's more impressive than Slashdotters realize (Score:1)
By the way, the Gator/Overture partnership has got [eweek.com]
Re:It's more impressive than Slashdotters realize (Score:2)
Google dumps a list on you. You have to refine the search yourself or evaluate these results by examination. I like the concept of clustering of these search results because *if* it works, then it's work I don't have to do. In practice, clusty appears to g
Mozilla search plugin from the actual company (Score:4, Informative)
Re:Mozilla search plugin from the actual company (Score:3, Interesting)
At any rate Firefox has a box in the corner that's directly linked to google
Re:Mozilla search plugin from the actual company (Score:2)
Nowadays, using autofill is just too much work for me.
Re:Mozilla search plugin from the actual company (Score:2)
the "plug in" is less than you think (Score:1)
oh , and a nice icon,
they're just trying to kiss up to mozilla users.
Re:Mozilla search plugin from the actual company (Score:1)
All these toolbars and crap exist for IE mainly because MS seems to have stopped actively developing it.
interesting but ... (Score:1)
plus the name, as stated in previous comments, sounds like something ikky, like a cockroach cluster or something, nifty idea and design though sponsered results suck being in the results and not on the side
I use vivissimo in preference to google quite ofte (Score:2)
Simply because for *many* kind of searches, for example looking for a supplier of aluminium extrusions for heatsinking, ALL google's top returns are for OTHER fucking indexing sites like fucking kellysearch and NOT fucking sites owneed and run by aluminium extruders.
This is a trend that has become ever more prevalent over time, and it makes google ever more irellevant.
this will of course get modded troll, see journal
It beats Google (Score:3, Informative)
It's not perfect, but it's a good start. I'm sure
One neat consequence for web marketers will be more targetted traffic. With Google, you have to hope searchers will be savvy enough to use 3-4 keywords to search for exactly what they want- if they can click on two more KWs that refine their search, we'll see the inventory of cheaper 3 KW terms go up significantly.
Search is a dialog, not a ranking (Score:4, Insightful)
The search of "Apple" illustrates this well. This search, like many is deeply ambiguous. It could refer to the computer company, to the fruit, to the record company, to New York City, to the singer (Fiona), or to Apple Valley (MN or CA). Even if the search engine knows that it refers to the computer company, it's still ambiguous. It could refer to the company (as an investment), the products (for purchase), or a question (as in technical support).
The point is that each of these ambiguous alternatives creates an independent cluster of hits. One cannot even rank hits within a cluster due to a hierarchy of ambiguity. Within the Apple computer cluster are distinct subclusters for computer purchase, investment evalaution, and technical support. Although one can create a ranking within each subsubsubsubcluster, it is impossible to construct a meanful rank for all hits across all clusters - the second hit for "purchasing an Apple computer laptop" is not comparable to the 2nd hit for "Apple Records".
Instead of a pagerank scheme that sorts the universe of hits the instant the user enters the search, search engines should be more interactive. The first page of hits would emphasize breadth -- displaying hits most representative of a broad range of alternative clusters. The UI would enable a "more like this"/"fewer like this" selection process that tells the search engine what the searcher is actually looking for. As the searcher selects hits, the subsequent pages might show popularity-ranked hits within the clusters that seem to interest the searcher.
Each hit and each page would serve a double-duty -- serving the searcher's need to get information from the internet, and answering the search engine's question about the needs of the searcher for that particular search. Until the search engine understands each searcher and each search, it cannot hope to rank the hits.
nice!!!! (Score:1)
nothing really big about the encyclopedia search, it just mirrors wikipedia which already has a good search engine for its articles....unless clusty adds something like natural language search or a close to search I see no benefit for using clusty over wikipedia.
'Clusty' is bad name though. (Score:2)
tabs (Score:2, Insightful)
Pretty darn good search engine! (Score:1, Redundant)
And these clusters are a great idea.
Definitely worth bookmarking!
Clustering Client for OS X (Score:2)
theConcept [mesadynamics.com] is a client-based clustering/thematic search engine that works with Google, Wiki, DMOZ and other search engines (it data mines result and analyzes most significant keywords from the source pages). If you have OS X, you can check it out for free.
</shameless plug>
Hooray! Hooray! Hooray! It puts Wikipedia first! (Score:4, Interesting)
Awesome - clustering seems to be very effective (Score:2)
That said, I think that Clusty will do very well. I use Firefox and Safari for my browsers; after breakfast I am going to configure one of them to use Clusty by default.
One thing that make Clusty so interesting to me is that I have been working (for the l
oblig simpsons ref (Score:1, Redundant)
Maybe this will give Clusty the crown.
clustering? already got that... (Score:1)
Vivisimo (Score:1)
"Hey Vivisimo guys, we're starting up a new company here and you have a very powerful results format. What would it take to license your engine to help organize and present our data to the public?"
Vivisimo's answer: Well, that depends on how much money you're going to make of course. Exactly how much money do you have and do you expect to make?
Google's answer: We're designing a hardw
Why clustering? (Score:2)
For example:
I search for 2600. I want information on the Atari 2600.
As soon as I realize that 2600 is a hacker magazine, I add the word "atari" to my search.
Why is this harder than clustering?
I do like the Open in Preview Pane feature, though.
--
Definition of irony (Score:1)
The slashdot search doesn't work... (Score:2)
But it doesn't work! A simple search for "Grits" [clusty.com] or
"Linux" [clusty.com] return 0 results.
Marketing Slip-Up... (Score:3, Funny)
- I refuse to use anything that sounds like children's slang for a bogey or some other lump of offensiveness. Whoever thought that name up needs to be drummed out of marketing forever. The layout of the main page is reminiscent of Ask Jeeves (which is a bad thing, it automatically makes me think 'bad searches') and search pages look cluttered and the vivid background against the soft shades of the foreground looks awful. This 'Clustered Searching' is a good idea, badly executed. Next please.
What's it's name... (Score:3, Funny)
Which is superior.. my brief take on it (Score:2)
I have no problem with this way Google works, I found backlinking to be tremendously useful when implementing a gigabyte-sized database on htdig, and Google "just works".
Clusty on the other hand works to reduce my information saturation, it will reduce the
Mozilla Search Extension (Score:2)
Re:Mozilla Search Extension (Score:2)
Also, it only takes about 15 minutes to write a search extension. Tons of them exist; just Google for one.
i just couldn't care less in such cases (Score:4, Insightful)
If some are so revolutionary, then why are they using someone else's engine by adding some stuff most people most probably never find out what to use for. Doesn't A9 ring a bell for anyone, or does it.
I have an idea. Let's make a totally new and ground breaking search engine which will use Google's results, but hey, the main idea: let's have a different logo and paint the site pink !
Geez, I sometimes just can't stop wondering about all the freaky things that money can be earned from these days.
real headline (Score:2)
Sponsered results at the top (Score:2)
Rule number one (Score:2)
Rule #2, innovate the actual searching, not just the organizing.
I'm impressed (Score:2)
This search engine *IS* better than Google (Score:2)
Only 4 days late (Score:2)
Googles newest competitor? Thursday September 30, @08:31AM Rejected
And I didn't even link to a NY Times article but rather an AP story.
You can see that I have this story listed in my journal at this link [slashdot.org].
Keep up the good work. This is the kind of nonsense we have come to expect from /.
P.S. I will usually post some of the stories which have been rejected so be sure to check my journal every so often. Af
Clue! (Score:5, Funny)
I think there's your first clue for why your story was rejected.
Re:Clue! (Score:2)
Re:Clue! (Score:4, Funny)
Re:Clue! (Score:2)
And where's the concluding link to Roland Piquepaille's Blog?
Re:Clue! (Score:2)
I also got "recieved" wrong also. "I before E except after C" was pounded into my head enough in grade school you'd think I would rememeber it.
Well, I guess that was why it got rejected! Cause I'm an idiot!
Oh well, at least someone got it posted on here, and that's all that matters really.
Re:The interface looks pretty "cruddy" (Score:1, Interesting)
Re:The interface looks pretty "cruddy" (Score:2, Informative)
Back when Teoma came out I remember thinking the same thing, but soon forgot about it.
And yes, I have seen Linux. I was until a year ago a perpetual new-distro-installing-slut to see if Linux was up to scratch. Sadly, I still have to tinker and fiddle with the thing to even ge
Re:Vivisimo... (Score:1)
If you only have a word or two to go by, then the clustering really pays off in spades, as it will offer you several routes to follow up in your search results. Very much like, say, kartoo is great in the way that it shows what unites various results.
The problem with kartoo is just th