Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

The Top 100 Alternative Search Engines

Posted by Hemos on Mon Jan 29, 2007 09:12 AM
ReadWriteWeb writes "Search Engine Optimizer (SEO) Charles S. Knight has compiled a list of the top 100 alternative search engines. The list includes Artificial Intelligence systems, Clustering engines, Recommendation Search engines, Metasearch, and many more hidden gems of search. People use four main search engines for 99.99% of their searches: Google, Yahoo!, MSN, and Ask.com (in that order). But Knight has discovered, via his work as an SEO, that in the other .01% lies a vast multitude of the most innovative and creative search engines around."
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • Intranet Vs Internet (Score:5, Informative)

    by eldavojohn (898314) * <my/.username@@@gmail.com> on Monday January 29 2007, @09:14AM (#17798990) Homepage Journal
    As with most lists, I've got some nitpicking to do.

    All the large companies I've worked for don't care about world wide web search engines. Those engines used by the populace with the revenue coming from ads or 'paid search' or some indirect service business model. Now, a lot of companies are interested in Enterprise Search Engines and would pay a lot of money directly to a search engine company to come in and set up the technology to do intranet searches.

    The engine we currently use at my fortune 500 company sucks. I mean it is the worst. I would rather have a blindfold on with stumps for hands trying to type in an estimation of the internal IP address than use our search engine. That said, I have been told that we investigated using "Google Technology" although my superiors soon found that it wasn't at all better than what we already had. And so I've heard of a few others that have doubted Google's ability to dominate in a closed domain. They are clearly the winners in an open domain internet search but I haven't seen anyone take advantage of it as well internally ... yet.

    So while the external market may be broken down 99.99 to 0.01, the internal enterprise search side isn't that lopsided.

    Two engines that I've used and found to be novel ideas are BrightPlanet's Deep Query Manager [brightplanet.com] and Collexis [collexis.us] (NIH demo [collexis.net]). DQM is able to extract data from databases that are available through search on the local page but are not indexed by Google. DQM has you create jobs since they take so long to run. Collexis can process raw text and fingerprint it, then compare that fingerprint to documents that have been fingerprinted quickly. Two ideas that Google, MSN & Yahoo! don't really cover. I find it odd that a site like Yoople (what appears to be a slow German Google) made it on this list but not DQM.
    • The engine we currently use at my fortune 500 company sucks. I mean it is the worst.

      Could be worse, you could be using Slashdot's search function!
    • by JehCt (879940) * on Monday January 29 2007, @09:23AM (#17799110) Homepage Journal
      Could it be that your internal search sucks because your internal data is poorly organized, poorly written, and poorly tagged? Google isn't very good at finding stuff without the necessary clues. Search is only as good as the data stream. That's why SEOs always have lots of work - so many bad websites, so little time. If you know what you want, Google is great. If you don't know what you want (don't know the keywords), read Wikipedia to learn what to ask for.
      • This is so true, in so many aspects. So often, people underestimate the importance of good data. You can have the perfect system, but if you don't have the data to put into the system, then the system is useless. Garbage in == Garbage out.
    • by Anonymous Coward

      Enterprise search is always difficult, companies like Fast Search and Transfer [fastsearch.com] specialize in it. They have all sorts of document filtering pipelines and customized database connectors to hook all sorts of data into their search system. It also scales quite will across a cluster.

      On the specific topic of this supposed top 100 list, I notice it is mostly a list of what is 'neat' as most high ranking non-top 4 search engines are not listed. Neilsen and most other traffic rating groups would definitely not

    • by mosel-saar-ruwer (732341) on Monday January 29 2007, @10:41AM (#17800122)

      The engine we currently use at my fortune 500 company sucks. I mean it is the worst. I would rather have a blindfold on with stumps for hands trying to type in an estimation of the internal IP address than use our search engine. That said, I have been told that we investigated using "Google Technology" although my superiors soon found that it wasn't at all better than what we already had. And so I've heard of a few others that have doubted Google's ability to dominate in a closed domain. They are clearly the winners in an open domain internet search but I haven't seen anyone take advantage of it as well internally ... yet.

      Closed domains have this thing called "Access Rights" - typically governed by either Novell Directory Services, or Microsoft Active Directory.

      By and large, most enterprises don't want the janitor to be able to get on a kiosk terminal and surf the local search engine until he arrives at the document entitled FISCAL_YEAR_BRIBES_PAID_TO_MEMBERS_OF_THE_LOCAL_ZO NING_BOARD.DOC - that's the kind of thing that only the most elite of the grand poo-bahs are allowed to access.

      So a "closed domain" document spider is gonna have to be granted Administrator/Supervisory rights to the authentication infrastructure [which is a HUGE security risk in and of itself], and then it's gonna have to keep track of the pertinent access control lists before deciding whether or not individual users have the right to view search results.

      And if, as is typical, you've got four or five different authentication infrastructures in an enterprise [Novell Directory Services, Microsoft Active Directory, Sun iPlanet Directory Services, Oracle Internet Directory Services, etc etc etc], and if they aren't all tied together in some kinduva coherent LDAP framework, then that's just a massively complex project to even think about attempting to undertake.

    • your comment doesnt make it quite clear, have you evaluated the GoogleSearchAppliance [google.com]?
  • I was expecting to see specialized search engines, rather than generalized ones that happen to use unusual algorithms. Things like Baidu Mp3 search or Astalavista; the ones that allow special-purpose searches that feeding them into google would just produce crap.
    • Slifter.com [slifter.com], which is on the list, is pretty specialized. It's a mobile local product search. Its data comes from retailer inventory (not on the web) and it's meant to be run from a mobile device.

      (Yes, I work for them.)

      • Neat. It actually helped me find a potential source for something I've been trying to find local because I want to touch it before I buy it. (Intuos3 6x11)

        But it also returns results from websites... Buy.com is not 'local' for me. It needs an option to choose 'brick and mortar' only or it's a pretty major hassle to use.
    • by Odiumjunkie (926074) on Monday January 29 2007, @09:31AM (#17799218)
      I agree, I was expecting better - in my opinion there are a host of features that people have been talking about for years that haven't appeared in any search engine I've used.

      Things like:

      * searching by md5 hash to find where a random file on your hard drive came from

      * allowing the specifying of precise image size or dimensions to find a specific image - e.g., google indexes an image, you see the thumbnail, want to find it, but the original site is down - why can't google show me other images that match the original size and dimensions of that cached image, to help me find a mirror?

      * A search engine that rec0gn!s3s 4|_|_ 5p3c!4L cH4rAters

      * filtering search results by IP range

      * incorporating WHOIS details in search results (e.g foo +bar -foobar inurl:baz author:"J. Random Hacker")

      and so on, ideas that I hear mentioned occasionally but that never seem to go anywhere. Most of them would be fairly trivial to implement - perhaps file hashing would be too CPU intensive, but it could be limited to smaller files, or less acpu intensive algorithm could be used.

      Anyways, most of these I'd only use if they were added to Google - when it comes right down to it, database size is king with search engines - I'm happy to leave the meta/interactive/social/tagging side of things to the social bookmarking sites.
  • by gambit3 (463693) on Monday January 29 2007, @09:17AM (#17799026) Homepage Journal
    .. I want "accurate"
  • zombo.com [zombo.com]

    when i go there, i simply forget what i was looking for. problem solved. because you can do anything at zombo.com
  • by Browzer (17971) on Monday January 29 2007, @09:22AM (#17799086)
    a search engine?

    http://www.digg.com/about [digg.com]
    • Something like Digg, or Stumble Upon, is how many web users surf. It's the latest in Channel Hopping, and I think it's here to stay.

      The latest craze on the web is a 2000 Bloggers meme [abandonedstuff.com], where the photos of the blogmasters link to their website, giving people a random way to surf the blogosphere based on the looks of the author.
        • It's about random discovery (not content quality). Not quite like the Google I'm Feeling Lucky button, but close. And you know the blog isn't spam as it's been human added.
  • by 140Mandak262Jamuna (970587) on Monday January 29 2007, @09:22AM (#17799090) Journal
    Does slashdot or Sourceforge have a decent search engine to look for source code, sample code for particular APIs ?
  • Do the wiki tiki. (Score:5, Interesting)

    by AlexanderDitto (972695) on Monday January 29 2007, @09:24AM (#17799138)
    Ha, that's OK. I just use Google to find pages on Wikipedia. It's got all the information I'LL ever need from the internet.

    In all seriousness. There must be a reason why Google's floated to the top of the search engine love list, and I highly doubt it can be their (nonexistant) effective advertising campaign or their (also nonexistant) entertaining flashed-based website, because we all know people love those. No, I have to say that Google's got to have come up on top because they've been giving fairly accurate results. I know that if my search results were completely off, there would be almost nothing keeping me from switching to a new search; and, ironically, a search on Google for search engine brings up quite a few possibilities.

    I see no problem here... I'll just move right along.
    • by Not_Wiggins (686627) on Monday January 29 2007, @09:50AM (#17799478) Journal
      No, I have to say that Google's got to have come up on top because they've been giving fairly accurate results.

      "Accurate" had nothing to do with it. In a time when having keywords was the way to get to the top of the popular search engines, Google implemented a social ranking system; it really wasn't about being accurate so much as it was about "if everyone thinks you're link worthy, then you're probably an authority on some subject."

      There's plenty of talk about community-supported information and social networks, but Google was the first serious attempt at utilizing that information way before such discussions were popular.
    • There must be a reason why Google's floated to the top of the search engine love list

      Being popular doesn't mean anything. Look at Windows.
    • Re: (Score:2, Interesting)

      I don't know about anyone else, I just find google really easy to type in my browser. The sad thing is, I open up a browser intending to go somewhere else, I will start typing www.goo right away... just habit? I really don't know.

      Looking at some of the other search engines in the link, I can see some of them being quite hideous to type. For instance, www.boxxet.com. I know, doesn't look hard but I have a hard time with x sometimes.

      Disclaimer: Some people would just say to use favorites.... I can
  • Excite? Altavista? (Score:3, Interesting)

    by xxxJonBoyxxx (565205) on Monday January 29 2007, @09:26AM (#17799166)
    Just gotta ask: what happened to Excite and Altavista? Both of these guys still look up...
    • they're now owned by the big guys. using them is pretty much pointless ever since google came out with a better algorithm. the big four (google/yahoo/msn/ask) use comparable quality algorithms, everyone else just leeches off them or uses much lower quality algorithms
  • Music map is interesting, but I needed to enable scripts in FireFox, I don't see reason why it uses scripts and all this animation instead simple results list.
    http://www.music-map.com/ [music-map.com]
  • I want a search engine like Google was before they bought into blogging and brought that bias into their search results. Search engines that bias results in favor of pages that are heavily linked to end up supporting the status quo over newer-but-better ideas/products/pages, the corporate bullies vs Hertz. This could be a great small business incubator. Does such an engine exist today?
  • by Guppy06 (410832) on Monday January 29 2007, @09:43AM (#17799368) Journal
    are typosquatters?

    Find everything there is to know about "salshdot!"
  • They are entirely right, many of the most revolutionary search engines out there are not mainstream. Did you know that the big four comprise less than half of the top 10 most innovative search engines?
  • "Search engine Spammer". There's no such thing as SEO.
  • "in the other .01% lies a vast multitude of the most innovative and creative search engines around" -- all looking to get bought by one of the top four.
  • Suspect source (Score:3, Insightful)

    by hcdejong (561314) <acme AT xmsnet DOT nl> on Monday January 29 2007, @10:22AM (#17799882)
    The article was written by an SEO. Who says he doesn't rank them by how easy they are for him to manipulate?
    • The article was written by an SEO. Who says he doesn't rank them by how easy they are for him to manipulate?

      If he's like any other SEO I've ever dealt with, he really only cares about manipulating page rank on Google, Yahoo and maybe MSN or Ask. Remember, those four engines are the 99%; The rest are toys to him. They are on his radar but not enough so he needs to care about them yet. (Again, remember why the SEO put out this PR piece; the point is to get you, the buyer of SEO services, to think that hi

  • Yikes. If you have a graph with N nodes, it's N-dimensional. There are more than three websites. The difficulty is embedding the graph in two or three dimensional space. There are lots of algorithms for doing this, choosing the distance between nodes, and so on. Each necessarily discards some information, but choosing the right one could help. And maybe such a look at the web would be interesting, but it doesn't seem like it's always the most useful. You're probably interested in the big clusters, th
  • by PFI_Optix (936301) on Monday January 29 2007, @10:34AM (#17800026) Journal
    Liveplasma links Transsiberian Orchestra with Iron Maiden. Yeah, I see so much in common there.
  • "...lies a vast multitude of the most innovative and creative search engines..."
    They're innovative and creative, but nearly useless as well.
  • by cylcyl (144755) on Monday January 29 2007, @10:41AM (#17800124)
    On the term "alternative search engine" and post the first 100 results as an article?
  • The list is bad (Score:5, Informative)

    by Qbertino (265505) on Monday January 29 2007, @10:52AM (#17800280)
    Vivisimo not among the top 100? This is silly. Vivisimo is the first I turn to when Google fails to deliver. And they cover Googles shortcomings very well.
    In my book (and that of many others) Vivisimo is SE #2. And for good reasons too.

    www.vivisimo.com
  • by Pigeon451 (958201) on Monday January 29 2007, @10:53AM (#17800294)
    This list is junk.

    A9 uses MSN or Google, SimplyGoogle uses Google, Soople uses Google, etc, etc, etc. They just provide a new or innovate method of presenting the results, the result order is still the same.

    Also, Digg is listed? Del.icio.us? AOL???

    I'd rather have a top-10 list of REAL alternative search engines, not "portals" and such written by a SEO of a search engine optimization company.

    • This seems odd, but I was looking up some css, and javascript information. I couldn't find it on google, but I figured I'd search through digg. It worked. I've done this a couple times.

      Like the old yahoo and about.com, sometimes user selected content is better. Its different though because you depend on the description given to the article by the person submitting it.
  • Part of this guy's research involved Googling "alternative search engines."
      • I'm afraid I don't have one :-) I just thought it was funny, and perhaps mildly ironic. Although that's such a loaded term these days that who knows, really.
  • I typed Dreadful Snakes into the music search engine No results, Funny I have 2 of their Bluegrass albums

  • Usage stats for some of those can't be too good.

    Or maybe I'm wrong. But it seems like the big 2 or 3 would command 99.99% of all search traffic,
    leaving very little for these guys.

  • I don't see that any of these "alternative" search engines offer fixes to Google's current shortcomings (or at least the ones I run into). Personally, I can't wait til there's the Wayback Machine's [archive.org] archives are searchable by text rather than just domain. Hell, I'd even be appreciative if you could search for parts of domains. I can't overstate how often I'm driven crazy by remembering something from a site I saw long ago and not being able to refresh my memory because the site either went down or fell out o