Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Wikipedia Founder Working on User-Powered Search

Posted by Zonk on Tue Dec 26, 2006 09:26 AM
from the many-hands-make-light-workd dept.
An anonymous reader writes "Jimmy Wales, founder of the Wikia corporation, has revealed plans to offer a user-driven search engine. Ars Technica reports that the plan is to leverage user preferences to pick the 'best' site for any given search term, while at the same time utilizing advertising for commercial gain. The article admits this may not be the ideal solution: 'Users may be reluctant to contribute to the betterment of a commercial site that may end up being bought by a bigger company. Consider, for example, the tragic death of TV Tome, a comprehensive community-driven television content guide that was eventually bought by CNET and transformed into a garish, excessively commercialized Web 2.0 monstrosity of significantly less value to users.' Just the same, Wales seems very enthusiastic in the Times Online article highlighting this venture."

Related Stories

[+] Jimmy Wales Resigns Chair at Wikipedia 71 comments
user24 writes "As reported by Wikipedia's Signpost ezine, 'The Wikimedia Foundation Board of Trustees has moved forward with a step in restructuring the organization by reorganizing itself. Following a planning retreat in Frankfurt, Jimmy Wales stepped down as Chair of the board, and Florence Nibart-Devouard was chosen to replace him ... Wales, meanwhile, will continue to serve on the board and assume the honorary title of Chairman Emeritus. He will remain active in Wikimedia projects, but chose to pass on the responsibility of heading the board, due in part to his commitments to outside projects.'"
[+] Wikipedia Founder to Give Away Web Hosting 108 comments
eldavojohn writes "Wikipedia founder Jimmy Wales is going to be giving away free web hosting from his company's site Wikia. The company announced this 'free culture' movement at the current Le Web 3 conference in Paris. They somehow received a $4 million dollar investment package from Bessemer Venture Partners, Omidyar Network and individual investors with no business model. Is this a dotcom bubble style mistake or just proof of Jimmy Wales' golden touch?" From the article: "Openserving will go further than Wikia's current services, by giving away hosting services and bandwidth, in addition to allowing site creators to keep the advertising revenue generated by the site. 'If we give away the bandwidth and the storage, and we get none of the advertising revenue, what's the business model? Well, I don't know yet,' Penchina said. The software acquired with ArmchairGM will let Openserving customers create collaborative publishing sites, combining elements of blogs and wikis."
This discussion has been archived. No new comments can be posted.
Display Options Threshold:
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
  • Controversial subjects (Score:2, Insightful)

    by Alien54 (180860) on Tuesday December 26 2006, @09:31AM (#17366288)
    (Last Journal: Saturday October 27, @10:19AM)
    I can just imagine the results involving controversial subjects.

    Nevermind something as sedate as GWB or Blair or global warning or religion. What about vi vs emacs?
  • One word: (Score:1, Insightful)

    by Anonymous Coward on Tuesday December 26 2006, @09:35AM (#17366322)
    SPAM.

    Please explain how you're going to handle gaming the system by seo spammers.
    • Re:One word: by Pneuma ROCKS (Score:2) Tuesday December 26 2006, @01:20PM
  • Early rumors had him working with Amazon in the effort, but this [mashable.com] should clear things up.

    Google, Amazon, Opera, Mozilla, all are good ideas but as they expand their reach, they turn to crap. Google is going to Hell, Amazon is there, Opera likes the road, and Moz? They seem to be eyeing it.

    Whatever happened to, "Do what you do best. Forget the rest"?

  • TV Tome Replacement (Score:5, Insightful)

    by RaboKrabekian (461040) on Tuesday December 26 2006, @09:50AM (#17366454)
    (Last Journal: Tuesday May 20 2003, @11:50AM)
    TV Tome really was a fantastic site. A lot of the users that contributed there went over to a great TV wiki that sprung up after TV tome was sold, http://tviv.org/ [tviv.org]

  • Out sourced. (Score:4, Funny)

    by BillGodfrey (127667) on Tuesday December 26 2006, @09:53AM (#17366500)
    (http://billpg.me.uk/)
    He managed to find someone even cheaper than India to outsource?
  • Been there, done that... (Score:5, Interesting)

    by Iphtashu Fitz (263795) on Tuesday December 26 2006, @09:56AM (#17366532)
    Back in 1999 a company called Direct Hit Technologies developed what they coined a "popularity engine" that ranked results based on tracking user behavior. They basically partnered with existing search engines and mined their web logs looking for patterns among users. If a lot of people who entered search term "A" went to website "X" but within a minute or two went to website "Y" then "Y" would be ranked higher than "X". Direct Hit was bought in the middle of the internet boom by Ask Jeeves for a cool $512 million. Some of that technology likely still exists within their full search engine, and I'm sure others like Google, Yahoo, etc. all use similar methods of tracking user behavior for helping with their rankings.

  • Time will tell who is right but (Score:5, Informative)

    by atw (9209) on Tuesday December 26 2006, @09:58AM (#17366542)
    (http://www.majestic12.co.uk/)
    Disclaimer: I am a founder of a competing search engine concept that is based on volunteers running distributed software that crawls pages, conducts partial analysis and indexing before passing over results to central server that will index data into main central index: human aspect here is the people who take part in the project, and participants can actually change ranking formulaes, and shortly will be able to assist in human detection of spam etc.

    Searching the Web is a very challenging problem (that's why few companies do it): volume of data is huge and one only appreciates value of good algorithms when faced with situation when poor algorithms make stuff run for weeks failing near the end and you have to restart the run to wait another week. You can either try to handle this very big problem, which is very hard even if you have the money (look at Amazon's A9 funded with millions, yet they licensed Google's code and database), or you can try to reduce the problem: only focus on a handful of "important" pages - Yahoo did that when they were human edited directory/search engine hybrid.

    It seems to me that Mr Wales entertains the illusion that a very small number of manually checked pages in the Web space will be sufficient to satisfy vast majority (and it has got to be 98%+ as I won't be hopping from one search engine to another) of search queries. If this was the case then we would still be using Yahoo that did pretty much just that, yet almost everyone (including Yahoo) moved to algorithmic search engines because it is the only way to handle billions of pages, and billions of pages you will have to handle: even if you just index homepages of all registered domain names you will be dealing with 100 mln+ pages, that's good 20 times more than articles in Wikipedia and checking pages can be far more duller than reading nice article you have some personal interest in.

    What I find ironic that our own concept of the search engine was removed from Wikipedia because we were supposedly "not noteable enough", that's the sign how they handle problem of "too much data" in Wikipedia - they just reduce the problem by reducing datasets greatly, sometimes this is done wrongly, sometimes rightly and it might well work for Wikipedia, but it sure as hell won't work for Web scale searches. Oh, and by the way who said Google and others don't use human reviewers? They sure do, just check TrustRank [wikipedia.org], this link is ranked as #1 match on Google for search TrustRank! Notice what Wikipedia tells us: "While human experts can easily identify spam, it is too expensive to evaluate manually a large number of pages."

    Human input plays an important (although fairly unknown as they prefer to keep it secret) role in the state of the art search engines, however suggestion that humans can handle billions of pages and/or that a handful of pages will be sufficient for a general purpose search engine is wrong and a very backwards move that will result in exactly the kind of wrong attitude present in Wikipedia now.
  • Doomed to repeat it, I guess (Score:4, Informative)

    by hirschma (187820) on Tuesday December 26 2006, @10:01AM (#17366578)
    OK, perhaps the new generation of Web entrepreneurs had better learn something from their "elders". We're about to see a lot of concepts that didn't work in the 90's be resurrected and funded. This has been tried and failed, badly:

    An orphaned ref to Magellan, the human powered search engine [gocee.com]

    Didn't work before when there were a lot less sites out there, not likely to work this time, either.

    jh
  • by fotbr (855184) on Tuesday December 26 2006, @10:05AM (#17366600)
    In the summary quote he pretty much announces his intention to sell out at some point. More attention now leads to more money later, either through having a higher-profile name, or through suckering more people into developing his search rankings.

    So the guy founded Wikipedia. Good for him. It doesn't mean he walks on water, and the advent of yet another search engine doesn't deserve front page of slashdot. Especially when you know its going to get swamped by spammers (or their bots) and quickly become useless.
  • by Watson Ladd (955755) on Tuesday December 26 2006, @10:23AM (#17366760)
    If Mr. Wales uses some kind of Netflix like system to guard against spam, we will have a problem. People who have entrenched beliefs will read sources that further those beliefs, further entrenching them. Every day we see how it's become harder to have meaningful debate due to polarization. So far, Europe has remained immune, but if the demagogues gain power it will happen there also. If This Goes On-- [wikipedia.org]
  • by KrissyM (1043712) on Tuesday December 26 2006, @10:29AM (#17366840)
    SearchSays.com [searchsays.com] is a user-based search engine that just recently launched, and has a growing contributor base. Looks like it might be a race, but I guess I'm not surprised it's already been started. There are so few original ideas on the Internet these days. As to the "it's already been done and failed" remarks - timing is everything isn't it? I could see something like this taking off with the current Web 2.0 craze.
  • by Yogs (592322) on Tuesday December 26 2006, @10:43AM (#17366974)
    Back in the late 90s, I used to recommend browsing downward to see if there was anything in the hierchical categorization of web content Yahoo set up as a complement to the many, but often irrelevant results people got back from AltaVista.

    Fast forward to days of Google and Wikipedia and you have infinitely better "dumb" search, and an equally easy to use, generally decently accurate, and well contained treatment of a dizzying array of topics.

    So, what's needed to fill the "search for information" gap? I doubt it's an attempt at hierarchical categorization, people don't try and absorb a whole lot of related content at once, they want an answer, and the navigation was a pain even back then. So then, just using people to try and make the dumb search results better? Well yes, but Google is continuously working to fill that gap itself. Nobody really comes close that I've seen. Even back in the late 90s those yahoo directories got stale quickly both in terms of dead links, and missing good newer content. Now, you can automatically test for and prune dead links, but results WILL get dated quickly.

    Humans are SLOW, especially when you consider that their number is limited to those you hire if you want to avoid opening yourself up to the kind of spamming google has to contend with, which they're putting forward as a key differentiator in what they're trying. The web is several orders of magnitude bigger now than it was then, and it didn't work then, so what exactly makes Jimbo think it will work now?
  • by jwales (97533) on Tuesday December 26 2006, @10:53AM (#17367070)
    (http://search.wikia.com/)
    TVTome is an excellent example of why free licensing matters. When a community has free licensing as the social framework to allow for forking, the infrastructure providers are forced to continue to provide good service, to prevent the community from forking and leaving.

    Even Dmoz, for which I have great fondness and respect, has been crippled for years by a non-free license that allowed AOL to run it into the dirt. (See the recent 6 week server outage, for which there is simply no excuse.) (The Dmoz license is not the worst possible, mind you, but it is still problematic in a number of important ways.) And their software is totally non-free.

    • 1 reply beneath your current threshold.
  • Shills? (Score:4, Interesting)

    by Kjella (173770) on Tuesday December 26 2006, @11:26AM (#17367430)
    (http://slashdot.org/)
    And how would he handle shills? Bots? Trolls? Ok, google is struggling bad enough with link farms and the like but I can't really imagine end-users being the answer to this, even with some sort of meta-recommendation system. Personally, I find either a wikipedia search (for example now, recently I've been answering some christmas quizzes which it just excels at) or a very targeted google search (3-4 search terms) finds 99.9% of what I want, and the rest just isn't there.
  • Google already has this (Score:2, Funny)

    by zlogic (892404) on Tuesday December 26 2006, @12:06PM (#17367856)
    (http://zlogic.da.ru/)
    Google already has this, only they've decided that pigeons are best for the job [google.com].
  • Advertising (Score:1)

    by blue.strider (737082) on Tuesday December 26 2006, @12:15PM (#17367932)
    How come advertising is an acceptable business model in polite company?! How come thumping one's chest with no backup data is an acceptable form of communication?! I'm getting REALLY tired by all the Web wannabes that think advertisement is a valid business model. Just give me a clean per-pay service for once!
  • wiki weighted search (Score:5, Insightful)

    by owlnation (858981) on Tuesday December 26 2006, @12:37PM (#17368144)
    Part of me welcomes new methods and new technology where search is concerned. However, the involvement of Mr Wales into this arena isn't one I welcome at all.

    The only good thing about this is that possibly Wikipedia might be ousted from the primary or secondary page rank for most subjects. That is an authority most highly undeserved, and proof of nothing more than how far we need to go in terms of achieving accurate search.

    I think (hope) this is just a piece of self publicity. I doubt they have the technology - judging by the fact that at peak times Wikipedia search shuts down and defaults to Google and Yahoo.

    Interesting too, that while Google employs seriously smart people and is founded by seriously smart people, that Jimbo and whomever he cobbles together from the smart-search-technologists-who-decided-not-to-work -for-google-exactly-why? are likely to succeed where they have failed. Sure, brains aren't necessarily everything, but they really do help. I think no small amount of Google's success is the size of the brains behind it. It's why they have a competitive advantage in most markets they enter.

    We have seen very clearly that Wikipedia is extremely vulnerable to, and tainted with, group-think manipulation. (Jimbo's icon, Ayn Rand as one very tiny example of many). Why would anyone think this search will be in any way different. This looks just as vulnerable and easy to manipulate if you get a group together. Which every SEO blackhat on the planet will do on the day of launch. This looks much easier to manipulate than meta tags, or page rank.

    I'm sure SEO blackhats and right wing organisations are foaming at the mouth with excitement at this wonderful Christmas announcement.
  • Why start there? (Score:1)

    by JPriest (547211) on Tuesday December 26 2006, @12:37PM (#17368146)
    (http://lp.org/)
    Why start by creating an entirely seperate search engine? I often find myself using Google just to search wikipedia becasue I am not sure how to spell the item I am looking for and the search box on Wikipedia needs an exact phrase. Why not improve that first and then consider expanding?
    • 1 reply beneath your current threshold.
  • by Animats (122034) on Tuesday December 26 2006, @12:40PM (#17368212)
    (http://www.animats.com)

    Problems:

    • There are too many websites, changing too rapidly, for volunteer raters to keep up. It's too labor-intensive. In a narrow area (like, say, hotels) it might work, and it's been done.
    • Inducing people to work for free for a profit making company may be illegal. AOL got into trouble over this years ago. [com.com]
    • It didn't work last time (see "Magellan search engine"). [submitcorner.com] Dead and forgotten, the Magellan search site is still up [worldweb.com] but the search box no longer works.
    • There's already the Open Directory Project. [dmoz.org]
    • It's highly vulnerable to spam and manipulation. The people most interested in the rating of X will be the promoters of X.
    • IBM has several patents on this.
  • Classic (Score:1)

    by Shohat (959481) on Tuesday December 26 2006, @12:50PM (#17368324)
    (http://www.sc2blog.com/)
    When all you have is a hammer, everything looks like a nail
  • downvoters (Score:1)

    by Inmatarian (814090) on Tuesday December 26 2006, @01:01PM (#17368426)
    User Driven Search Engine?

    Is that anything like ytmnd's voting system? Because if it is, Wiki will have it's own class of Downvoters.
  • stumbleupon.com? (Score:1)

    by modir (66559) on Tuesday December 26 2006, @01:20PM (#17368558)
    (http://modir.stumbleupon.com/ | Last Journal: Saturday July 05 2003, @01:19PM)
    Isn't http://www.stumbleupon.com/ [stumbleupon.com] already something like this?

    They have a search functionality as well.
  • InfraSearch (Score:2)

    by Darth Cider (320236) on Tuesday December 26 2006, @01:59PM (#17368926)
    InfraSearch [openp2p.com] was another early attempt at collaborative search, based on Gnutella. Sun bought it for about $20 million in stock (estimated) then did nothing with it.
  • Actually this is not going to happen (Score:3, Informative)

    by GerardM (535367) on Tuesday December 26 2006, @03:27PM (#17369724)
    Hoi,
    On a WMF mailing list Angela said that there was no substance to all this. I had also heard from other channels that there is not much to this.

    So even though it is nice to speculate, there is not much to all this.

    Thanks,
            GerardM
  • by BaldBass (49775) on Tuesday December 26 2006, @04:04PM (#17370070)
    Seriously, though, this whole deal of "User-Powered" looks like an attempt to play out the image of Wikipedia if front of ignorant public.

    The fact is that many (of not all) search engines use human input to rank search results. For example, Google's PageRank [google.com] is about links put on pages by whom? Humans, of course.

    OK, so you found a new way of extracting rating info from humans? Let's talk about that, but please stop bringing this "People vs. Computers" nonsense.
  • by John Jorsett (171560) on Tuesday December 26 2006, @04:15PM (#17370146)
    I entered a bunch of my CDs into the music CD database CDDB [wikipedia.org] (now Gracenote), thinking that the database's contents would remain public domain or at least freely usable. Then it was sold and the company that owned it forbade access by media applications without a fee. Thousands of people like me voluntarily built that thing, and now they wanted to sell it back to us. Any organization asking for my help on future projects had better have an ironclad guarantee that my work product will remain free to users.
    • 1 reply beneath your current threshold.
  • by mixenmaxen (857917) <[max] [at] [maximise.dk]> on Tuesday December 26 2006, @04:34PM (#17370330)
    (http://www.maximise.dk/)
    I actually started a company doing much the same thing back in 2000, probably a bit ahead of its time. Even took out a patent [uspto.gov] . Unfortunately investors (actually almost everyone we talked to..) didn't have a clue what we were talking about, so we ran out of money.

    Why do I live in such a small country, where nobody has a clue.... sigh...

    Anybody got a job working with interesting people that can actually think ????
  • Other Priorities? (Score:2)

    by Frosty Piss (770223) on Tuesday December 26 2006, @06:03PM (#17371216)
    (http://www.nojailforpot.com/)
    Before Mr. Wales expands his empire to cash in on search, maybe he ought to invest some development in improving Wikipedia's search feature wich is almost useless, one of the least useful search features I've ever seen. To search Wikipedia, I use Google.
  • by shren (134692) on Tuesday December 26 2006, @10:13PM (#17373218)
    (http://www.io.com/~shren | Last Journal: Monday September 16 2002, @01:55PM)
    You enter your search criteria, and the wikisearch engine tells you that your choice of topic is Not Notable, and for gods sake why not search for something important.
  • Why not create a project that takes the Google search results, and then creates a collaborative layer on top of that? Sort of like GreaseMonkey for Google? I know Google will never alter their automated search results - this is where a project like this could come in. It would have all the power of Google, plus all the power of collaborative human input. Right off the bat, it would be at least as powerful as Google - and assuming human input is more beneficial than detrimental, it would only get better over time...
  • People never Learn (Score:2)

    by NatePWIII (126267) <nathan@wilkersonart.com> on Wednesday December 27 2006, @01:49AM (#17374202)
    (http://www.haidacarver.com/)
    That is what I can never figure out with these community driven things. All these saps putting there time and energy into it, when they know that behind the scenes someone owns it and for them regardless of the personal investment or emotional attachment the bottom line (money) is going to call the shots at the end of the day.

    I have quite a view of my old customers who know me personally from my webhosting company and they still can't understand how I would abandon them and sell the company, not that I wanted to, it was a financial decision.

    Its like with the wikipedia phenomenom, are these people being paid for all the time they put into editing articles and reviewing content? Then possibly a year from now they get bought up by Google and the whole structure of the thing changes, then what?

    People get a clue... why support some other venture, even if it is well organized and ground breaking etc..., get out there and start your own business or try to develop the next big thing! Thats what I did back in 1999.
  • by macraig (621737) on Wednesday December 27 2006, @02:36AM (#17374374)
    (http://macraig.homedns.org/blog/)
    Wikipedia was a reasonably inspired idea... this one, of a user-driven search engine, is NOT. As a not-exactly-run-of-the-mill human who is already nearly drowning in the mediocrity and stupidity that surrounds me, I'm frankly terrified at the prospect that AVERAGE people will be entirely responsible for determining what results are returned by my search engine! I want the exceptional results that *I* can comprehend and appreciate, not the moronic ones that appeal to people of average intelligence and abilities. It already takes far too much time to coax those gems out of existing search engines; Wales' idea would make it utterly impossible. The search engine would drown in the same sea of mediocrity by which I'm engulfed.

    This idea would actually be a search-engine implementation of "tyranny of the majority", to go micely hand-in-hand with spam blacklists ad nauseum. The founders of the United States tried their best to put safeguards in place to prevent said tyranny, and now dear Jimmy Wales wants to thwart that? Apparently Mr. Wales is now more interested in any wild idea that stands a ghost of a chance of trumping his one decent idea than in doing something truly useful or helpful. Useful or helpful are nor words descriptive of user-driven search engine results.
  • 5 replies beneath your current threshold.