English Wikipedia Gets Two Millionth Article 125
reybrujo writes to inform us of a milestone for the English-language Wikipedia: the posting of its two millionth article. At the time of this posting there is uncertainty over which article achieved the milestone. "Initial reports stated that the two millionth article written was El Hormiguero, which covers a Spanish TV comedy show. Later review of this information found that this article was most likely not two million, and instead a revised list of articles created around two million has been generated, and is believed to be correct to within 3 articles. The Wikimedia foundation, which operates the site, is expected to make an announcement with a final decision, which may require review of the official servers' logs."
Likely a lot more than 2 million (Score:5, Informative)
However, if they (or anyone else) need a plugin for Mediawiki that will list the pages in order so that you can count them and determine which article was the Nth article, I wrote a plugin called Page Create Order [bloomingpedia.org] that will put a special page called "List Pages By Creation Date" in your wiki. We developed it for Bloomingpedia originally. Its simple, but it does the job. It could be easily modified to only count articles that are of a certain size as well, the main purpose of this plugin is to see the order in which pages where created.
Re:Likely a lot more than 2 million (Score:3, Informative)
Either way, something about that length is likely to be a stub and not a 'real' article.
Re:The millionth (Score:3, Informative)
Re:That was quick (Score:3, Informative)
Basically, the situation is this: Notability has its thresholds - either you are notable or not (though where exactly to draw the line is, at times, difficult - but we have pretty clear picture by now). Articles about people, bands, groups, companies, websites, etc. have to have assertions of notability (i.e. "they're really big in Pakistan and have released three albums", or whatever). Notability has to be backed up by reliable sources.
This leads to the situation that 1) people who are famous for failing at something can be considered notable enough for articles of their own (provided someone noticed and documented that in a reliable source), and 2) worthless celebrities are, alas, notable enough for articles because they probably have had verifiable media appearances.
(Think of it this way: if I had not heard about Paris Hilton before, I'd go to the article, come to the conclusion that she's a worthless celebrity, and be done with it. If there was no articles about her, I'd probably ask "hey, this... thing is on TV all the time, what the heck has she done to get there, anyway, and why isn't there an article about her?" =)
You can help review new articles (Score:3, Informative)
http://en.wikipedia.org/wiki/Special:Newpages [wikipedia.org]
This will take you to the list of the most recently created articles. If you find that you have trouble keeping up with other editors who are reviewing the same articles, you might find this link useful:
http://en.wikipedia.org/w/index.php?title=Special:Newpages&limit=250&offset=250&namespace=0 [wikipedia.org]
Which will take you to the same list, but starting from the 250th most recent article.
Typically, it's most useful to
Anyone can do these things, and you can also just improve on any article by adding additional sources, or expanding on the article.
Re:Yeah, but hasn't Wikipedia jumped the shark? (Score:5, Informative)
Re:It would be interesting to know (Score:3, Informative)
Re:Research isn't what I'm talking about. (Score:2, Informative)
A manual presentation layer. I'm content-driven, personally, a slick presentation does not increase my perception of the value of information.
- Everybody says that, but studies show time and time again that the way information is presented has drastic effects on how much information gets accross and how it is percieved. Next you're going to tell us ads don't affect you.
Right, so it's an automatic (and thus more up-to-date) presentation layer, which carries quantifiable and repeatable bias by virtue of being algorithmic.
- What you're missing here is that google indexes links to information, it does not summarize the actual information as Wikipedia does. Even if the information you wanted was always in a google search, you still then have to collate it and judge sources, etc. Also quality information is not all or perhaps even mostly online right now. The work of summarizing the information is valuable, and if it is already done for you can get you further ahead on the task at hand.
Why should a wiki be "stabilized"? Why is "formality" a virtue when wikipedia was created and gained value from non-conformance to traditional models?
- Because the real goal is information quality. Demonstrable quality in a way useful to the reader/researcher. The non conforming, radically open current system has been shown to be successful in producing content, a smaller portion of it of reasonably high quality. But studies and observation of Wikipedia show that it has extremely high variation in quality. From articles replaced with "YO MAMA SO PHAT..." to widely reviewed articles citing and properly summarizing all the best written material on the subject. Formal peer review can lead to higher information quality and if that reviewed version is available as an option, default or not, can allow the best of both worlds. (like the Linux kernel and most other software) Then there can be both a radically open article that may be more up to date, balanced, etc, and a stable version that is at least guaranteed not to be vandalized. The amount of stabalization could be as little as that or as much as the formally reviewed case, or both. Thus the best of both worlds, content is produced, and high quality content is available, and the review processes can be demonstrated.
Re:Research isn't what I'm talking about. (Score:3, Informative)
No, the "no original research" rule was instituted to deal with physics crackpots. This is documented on wikipedia itself if you actually delve into the pages about the rule.
There is no good way for wikipedia to differentiate between the personal experiences or knowledge of a 73-year-old rocket scientist wunderkind, a crackpot writing stuff in his garage, or a published scientist dabbling poorly outside his actual area of expertise. So wikipedia just disallows that sort of thing entirely, and draws instead on the difficulty in those people publishing their work in peer-reviewed journals or mainstream publications by setting threshholds in that direction.
And it's not wikipedia's fault if the knowledge of a 73-year-old-Jim-Yardley knower isn't preserved. Anecodes and anything else from him can be written down on any web page and preserved for posterity that way. (And if they get media attention because they're not crackpottery, they may make it into wikipedia someday.)
The goal of preserving absolutely everything known by every human, but only the good stuff, is unsatisifiable, and wikipedia aims on the extremely conservative side of the problem. It may not seem like that with all the pop culture crap to be found there, but wikipedia isn't a single coherent entity, it's a teeming mass of random people following the rules to varying degrees of accuracy and with no consistency at all. Somehow people care more about following the rules when it comes to rocket science than when it comes to character summaries of last year's big TV show. And isn't that awesome?