Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Wikia Search Launches Alpha, Not Ready Yet

Posted by ScuttleMonkey on Monday January 07, @03:56PM
from the release-early-and-often dept.
babooo404 writes "Jimmy Wales' latest project, Search Wikia has launched into alpha this morning. Most reviews have been negative. The system is a 'social search' and uses the Nutch search algorithm. You can friend people along with creating profiles, and the system uses a Wikipedia-style format for 'mini articles.'"

Related Stories

[+] Wikia Search Engine to be Launched on January 7th 189 comments
cagnol writes "The Washington Post reports that Jimmy Wales, the founder of online encyclopedia Wikipedia, has announced the launch of a new open-source search engine, Wikia Search, on January 7th, 2008. The project will allow the community to help rank search results, in a model close to Wikipedia. However the company is a for-profit organization. This new search is supposed to challenge Google and Yahoo."
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • Ok, let me see if I understand this. Wikipedia is an encyclopedia that can't have proofs or in depth reference materials, because more detail is out of scope for really no reason. But, they can somehow try and turn wiki into another google or a facebook.

    Interesting!

    Me thinks wiki should focus on its content.
  • no go (Score:5, Insightful)

    by jacquesm (154384) on Monday January 07, @04:00PM (#21946500) Homepage
    I tried it on a bunch of fairly simple queries and got nothing but extremely lousy results.

    On the web first impressions really matter and I think wikia fails horribly in that respect.

    Please Jimmy Wales go and fix wikipedia, it needs urgent attention, especially protection from editors running wild, and please, google go work on getting rid of that spam and fixing the rankings...

    • Re:no go (Score:5, Informative)

      by garcia (6573) on Monday January 07, @04:21PM (#21946770) Homepage
      You need to read some of Jimmy's comments on one of the blogs linked in the summary, especially the one I have copy/pasted below... The most important part is the second paragraph and while I am no Wikipedia fan and certainly agree with your comments that protections need to occur from what I assume you mean by "editors running wild," I think what he says below is very important for this new project!

      From here [techcrunch.com]:

      January 6th, 2008 at 10:50 pm

      Release early, release often.

      It's a project to *build* a search engine, not a search engine. We've been telling everyone that constantly. I'm sorry Michael's disappointed, but having said that, we didn't build it for him, but for people who think that openness, transparency, and participation are more important than slick releases.

      When I launched Wikipedia, I wrote at the top of the first page "Wikipedia, the free encyclopedia". On that day, anyone reviewing it would have laughed. What's this? There's nothing here! This is not an encyclopedia, it is an empty website with some funny editing syntax!

      So the comparison to Google on day one is just mistaken. Google didn't launch a project to build a human-powered search engine, they launched an algorithmic search engine with a clever new idea. So they didn't have to wait for the humans to come in and start building it.

      We aren't even running with a real index yet, just a placeholder index. Yeah, the search sucks today. But that's not the point. The point is that we are building something different.
      • Re:no go by evanbd (Score:1) Monday January 07, @04:54PM
        • 1 reply beneath your current threshold.
      • Re:no go by ZombieWomble (Score:3) Tuesday January 08, @04:15AM
      • 1 reply beneath your current threshold.
    • I'll stick with Google by filbranden (Score:2) Monday January 07, @04:44PM
    • Re:no go by antic (Score:2) Monday January 07, @05:10PM
    • first impressions matter by epine (Score:2) Monday January 07, @05:48PM
    • Re:no go by patro (Score:1) Tuesday January 08, @02:46AM
  • citation needed (Score:3, Funny)

    by snarfies (115214) on Monday January 07, @04:04PM (#21946550) Homepage
    Sorry, these reviews are not from reliable peer-reviewed sources and all references to them should be deleted. In fact, this whole article should be speedy deleted as non-notable.
    • 1 reply beneath your current threshold.
  • Work on wikipedia's search first (Score:5, Insightful)

    by mcsqueak (1043736) on Monday January 07, @04:05PM (#21946574)

    Maybe this will be rolled into Wikipedia once it's done, but it seems me to that their search algorithm needs *plenty* of work. Thanks to the glories of SpellChecker, I can't spell worth a damn... when I misspell something in Wikipedia, it rarely finds it in the results, whereas Google always know what I meant to type AND OFFERS ME A CORRECTION. On Wikipedia, I have to go look how to spell whatever I'm searching for correctly, then put it back into Wikipedia's search just to find what I'm looking for.

    Very frustrating...

  • The search results aren't great because there isn't really an index yet. I'm not sure why they led people to expect a working search engine.

    But at least Wikia Search is hosted in a cool underground nuke-proof data bunker [datacenterknowledge.com] in the middle of Iowa.

    • 1 reply beneath your current threshold.
  • very bad (Score:2)

    by wwmedia (950346) on Monday January 07, @04:08PM (#21946608)
    results are terribly irrelevant

    this will make people appreciate how much work goes into google/live/yahoo search engines once they use this

    and no unicode support? wtf! i tried searching for Moscow in russian (lol wtf! no unicode on slashdot either!)

    ugly 404 page too
  • Awesome (Score:5, Funny)

    by Anonymous Coward on Monday January 07, @04:10PM (#21946636)

    Highlighted article when I search for "sex":

    Mini Article About "sex"

    Sex is a term which is very often searched in the internet. Thus, a mini-article about pages with free pictures / videos without spam would be important.

    First result for "George Bush"

    George Bush Is A Crackwhore!
    ... handjobs for cash. George Bush is addicted to smack ... some blow.. yah know... like George Bush ...
    http://www.george.bush.isacrackwhore.com/ [isacrackwhore.com] - Cached - 1.26

    This is genius. I think I know what I'll search site I'll use next time I need some entertainment.

  • Which part of ALPHA... (Score:5, Interesting)

    by KingSkippus (799657) * on Monday January 07, @04:10PM (#21946642) Homepage Journal

    Which part of Alpha did these guys not understand? It is, by definition, "Not Ready Yet"!

    Jimmy has pointed out that they're not even running against a real index yet, just a placeholder index. He even went so far as to say, "the search sucks today." The idea wasn't to launch a finished product that's ready for primetime. It wasn't even to launch a particularly working application. The point was to put something out there to demonstrate some rudimentary functionality while they continue to work towards something that does work.

    You know, like a Beta.

    I think it's kind of sad that Jimmy put something out and said, "Here's what it kinda will look like, and sorta how it will work," and people's first reaction is, "It's not a fully-functional working product? What a piece of crap."

    I think I'll wait a little longer before judging. If you don't like the concept, fine, don't like the concept. But to bust its chops because it's not fully functional is a bit premature and silly at this point.

    • Re:Which part of ALPHA... by PhxBlue (Score:2) Monday January 07, @04:15PM
    • Re:Which part of ALPHA... by kebes (Score:3) Monday January 07, @04:16PM
    • Re:Which part of ALPHA... by IamTheRealMike (Score:2) Monday January 07, @04:17PM
    • Re:Which part of ALPHA... (Score:5, Insightful)

      by anthony_dipierro (543308) on Monday January 07, @07:00PM (#21948392) Journal

      Which part of Alpha did these guys not understand? It is, by definition, "Not Ready Yet"!

      I always took "Alpha" to imply feature-complete.

      I think it's kind of sad that Jimmy put something out and said, "Here's what it kinda will look like, and sorta how it will work," and people's first reaction is, "It's not a fully-functional working product? What a piece of crap."

      Personally I think it's crap not because it doesn't work, but because there aren't any original good ideas to it. Mini articles are cool, but not at all original, and the idea that they're going to populate them solely from user contributions rather than taking them from a free content source or buying them from somewhere is dumb. Sure, rating results doesn't work, but again, not at all original, and probably not that useful unless and until there are millions of people using the thing. Then there's the whole Myspace/Facebook/whatever stuff. Not original, not well integrated into the rest of the site, not interesting to me, and not a good idea in the first place (to integrate the two).

      Wales says "It's a project to *build* a search engine, not a search engine." Fine, but how does Jimmy expect to get people to build a search engine for his for-profit business? There are answers to that question, but I don't see where Jimmy has hit on any of them. The Alpha that launched today doesn't seem geared to developers. Sure, when Wikipedia was launched it sucked. But at least I could edit it and make it not suck! And anything I added could be used by anywhere in the world, not just Jimmy Wales or Bomis. What can I do with Search Wikia? Add to the mini-articles? Lame.

    • 2 replies beneath your current threshold.
  • Since when ... (Score:1)

    by jabberwock (10206) on Monday January 07, @04:15PM (#21946696) Homepage
    ... is this the community for whipsawing a website that goes alpha | "proof of concept" with "very little money?" I read the article, I read the review. I know who Jimmy Wales is, for chrissake, and I know he's controversial [wikipedia.org]. Does that mean he's not allowed to take the wraps off software and a web site that hopes to rely heavily on user input to make it worthwhile and better? We all start ... and restart ... *somewhere.*
  • 404 page quick fix (Score:2)

    by wsanders (114993) on Monday January 07, @04:22PM (#21946774)
    " is the name of a garage band from Omaha"

    Well, this is Wiki, if the content isn't there, do something!
  • Wikipedia's Blacklist In Use? (Score:2, Interesting)

    by _bug_ (112702) on Monday January 07, @04:34PM (#21946898) Journal
    I can't find anything in terms of documentation on Wikia, but it appears Wikia search is blocking sites on Wikipedia's blacklist [wikimedia.org] from being listed in the search engine. I've pulled a few examples from the blacklist and searched for them and have yet to receive any results on any of those searches.

    Can anyone confirm or refute this? Maybe it's just because the Wikia is in alpha it hasn't indexed much yet?

    If this is the case I'd probably steer clear of Wikia; I'm not sure I ant my search results to be filtered like that.
  • Be careful... (Score:5, Informative)

    by christopherfinke (608750) <cfinke@gmail.com> on Monday January 07, @04:34PM (#21946904) Homepage Journal
    Not only are the reviews bad, but using it could get you banned from Facebook. [chrisfinke.com]
  • More experienced developers? (Score:2, Offtopic)

    by sphix42 (144155) on Monday January 07, @04:36PM (#21946918) Homepage
    I always get a kick out of reading new sites.
    From the footer:

        <div id="ftcnt">
            <div id="ftlinks">
                <div id="ftcloud">
  • by MarkWatson (189759) on Monday January 07, @04:40PM (#21946958) Homepage
    ... before blasting the effort like the top level story poster.

    BTW, last night I looked at their technical information site: http://search.wikia.com/wiki/Search_Wikia [wikia.com]

    Some interesting stuff that I did not know about in their "Semantic lab".

    Anyway, it is at least an interesting idea - time will tell how it works out for users, and as a business.
  • Not that bad (Score:2)

    by AlXtreme (223728) on Monday January 07, @04:51PM (#21947102) Homepage Journal
    Not having heard much about Wikia search or the hype that surrounds it (I like it under my rock, thank you very much) I gave it a shot.

    The results aren't that bad (tried 2 dozen queries, albeit only moderately difficult. all gave satisfactory results in the first few hits), and the integration of a wiki article and people-profiles are interesting concepts. The interface is nice and clean. I guess they could work on their integration with wikipedia; it's one of the strong points of Clusty. All in all not a bad start.

    Then again, maybe I'm just not as critical about the results. I haven't used Google Search as my main search engine for a couple of years, but with time I could see wikia search become one of my regulars. All I can say is keep it up!
  • by pongo000 (97357) on Monday January 07, @04:57PM (#21947176)
    You can friend people along with creating profiles

    Huh? You mean I have to RTFA to figure out what this means?

    Where's a grammar Nazi when you need them most...
  • Wikipedia.org Search Sucks (Score:2, Interesting)

    by WebmasterNeal (1163683) on Monday January 07, @05:29PM (#21947528) Homepage
    Has anybody used Wikipedia's normal search on their site. I can never find anything, It's horrible. Yahoo & Google do a much superior job with searches than they do on their own site. With that in mind, it doesn't make much sense for them to development a brand new search engine type thing when their own isn't that good.
  • Typo (Score:1)

    by bruciemoose (319876) on Monday January 07, @05:32PM (#21947574) Homepage
  • by dannydawg5 (910769) on Monday January 07, @05:37PM (#21947624)
    Please don't mark me off-topic or peg me as astroturfing. I am saying explicitly that I wrote this site. This is my own vision of social searching, and I am putting it here because it's the topic of the article:

    http://jumphunt.com/ [jumphunt.com]

    Essentially, you type "g" to search google, "y" to search yahoo, etc. You can add and take away from the defaults. You can share, grab from other users, discuss sites, etc (that's the social aspect). You also get your own no-login-needed homepage to jump to sites. You can then add your own search box to your Firefox search box (the site performs suggestions too).

    Other aspects:
    -There are minimal ads.. a grand total of 2 in the entire site... and not on your custom home page.

    -Absolutely no email address is needed and never will be.

    It is still in alpha too (site is about 5 days old), but what do you think? I'm pretty proud of it. I'm using mod-rewrite, AJAX, JSON, XML, etc. I learned a lot from making it. I am not making a dime off this site. My real business is hosting.

    Hope you enjoy,
    -Dan

  • Chicken and egg (Score:2)

    by D H NG (779318) on Monday January 07, @07:09PM (#21948480)
    Nobody's going to bother using it until its quality is "good enough".
    Its quality isn't going to improve if nobody uses it.
  • lame (Score:1)

    by larry bagina (561269) on Monday January 07, @08:18PM (#21948968) Journal
    so, I searched for "penis" -- not that I'm searching for penis, it's the first word that popped into my head. Ugh, anyhow, the first page consists solely of sites for penis enlargement pills.
  • by Animats (122034) on Monday January 07, @08:58PM (#21949198) Homepage

    Wales was quoted recently complaining about Google's results for "Tampa hotels" [searchengineland.com], and talking about how Wikia was going to be better. So I searched Wikia for "Tampa hotels".

    The first three results from Wikia search are all from the domain "visit-tampa-bay.com". That's one of those bottom-feeder ad link sites. The site is supposed to redirect traffic to Orbitz, but doesn't even do that right. Very disappointing result. Could they have been spammed already?

    Trying "Tampa hotels" in Google gets us "travel.yahoo.com" for the top two results, which indicates that Google isn't biasing their search against their biggest competitor. Next is "traveladvisor.com". Those are OK results; you'd be able to get a hotel room that way.

    Trying "Tampa hotels" in Yahoo search gets us a page from one of Yahoo's special cases. Yahoo knows about "hotels", so we get a list of hotels and prices from Yahoo, and three sponsored results. The top organic result is "tripadvisor.com", which is at least a big-name travel site, followed by "visittampabay.com" (not to be confused with "visit-tampa-bay.com"), the site for the local Convention and Visitor's Bureau. Yahoo certainly tries hard for hotel searches, and seems to be doing OK.

    Trying "Tampa hotels" in MSN search gets results that look much like Yahoo's, but with lower result quality. MSN understands hotels as a special case. There are three sponsored results, and addresses and phone numbers for three real hotels. The first three organic search results are Yahoo Travel, "tampa-hotels.net" (an ad-laden landing page), and "tampa-hotels-discounts.net" (a bottom-feeder generic landing page that isn't even on topic.) Poor results.

    Trying our own SiteTruth [sitetruth.com] the top result is "all-hotels.com", which has a list of hotels with pictures and a reservation interface. The second result is Yahoo Travel, and the third is Expedia. We're sorting Yahoo results on business legitimacy, so that's not surprising. OK here.

    So there's where Wikia is today, on their recommended demo search.

  • Woah (Score:2)

    by the_kanzure (1100087) on Monday January 07, @10:43PM (#21949750) Homepage
    Wikia Search is open source, it's based off of Grub [grub.org] (which we have already [slashdot.org] talked about [slashdot.org] before [slashdot.org]). Here's the source code to the grub Windows client [grub.org], and there's a dev site [grub.org] too. The current scoring algorithm is over here [nutch.org]. If you want to talk with Jimbo and the developers, hop on to mailing list [wikia.com] and let's talk.

    Anyway, it looks like there's the opportunity here to *improve* this search engine -- programmers, I know you are reading, and at least check out the code. There's been talk about running some competitions for improving the search results (the scoring algorithms), how many of us would like to form a team? Maybe I'll do one. Who's with me?

    (Btw, these guys need help. I just found all of this after the recent news articles.) Screw my mod points.
  • Works great for me (Score:1)

    by Gregory Rider (923948) on Tuesday January 08, @02:28AM (#21950808) Homepage
    For example, I get nothing but the most insightful results when searching for the likes of "George Bush" [wikia.com] on Wikia Search.
  • There's a reason Google has a proprietary algorithm. It's not so it can control the world and generate cash, it's so the results are better.


    Back when google actually used pagerank, the results were OK, but they soon sucked as everyone started to game the system. Knowing the algorithm means sites designed specifically for high standings rather than for best content. They continually avoid this problem by changing the algorithm in secret.

    With an open source search algorithm, every result will not be the best site, but the one where the designer is willing to torture the content in a particular way. Then everyone will torture their content, and the results will be completely irrelevant.

    This is one of the rare places where closed, proprietary, secret systems actually make things better.

    • 1 reply beneath your current threshold.
  • by metempsychotic (1215690) on Tuesday January 08, @06:32PM (#21962424)
    I see at least two big flaws in the business model for the Wikia engine--

    1. Google can do it better if Wikia works

    Insofar as Wikia derives competitive advantage over Google from community support of search result ranking, Google can copy the model easily.

    In order for it to scale within Wikia, a large number of users must contribute to ranking. Once so trained, most of these users would not resist contributing in the same way to Google's rankings. The community of search ranking raters is very different from the kind of community that supports open source development, for example. It is much larger, much lower common denominator, concerned with results, without any axes to grind with the search providor.

    Google, with its large user base, would soon overcome any initial Wikia advantage in stored rating data.

    I do not see any other competitive advantage Wikia might depend on. On the other hand, I can think of a host of advantages Google enjoys (which I will not bother to list here).

    2. The ratio of hard costs is high, and the scale is large

    I suspect that the cost of infrastructure (machines, data centers, power, and people) to host the web in memory to support low-latency search at the Google scale would be difficult to support with a community model.

    Most Wikipedia support comes from individual donations of spare time from the broader community (i.e., articles). I would think that Wikipedia's content-to-hosting cost ratio is orders of magnitude better than Wikia's would be.
  • 6 replies beneath your current threshold.