Wikipedia and Plagiarism

Wikipedia and Plagiarism 267

Posted by CmdrTaco on Sunday November 05, 2006 @11:28AM from the less-than-college-papers dept.

Spo22a writes "Daniel Brandt found the examples of suspected plagiarism at Wikipedia using a program he created to run a few sentences from about 12,000 articles against Google Inc.'s search engine. He removed matches in which another site appeared to be copying from Wikipedia, rather than the other way around, and examples in which material is in the public domain and was properly attributed. Brandt ended with a list of 142 articles, which he brought to Wikipedia's attention.... 'They present it as an encyclopedia," Brandt said Friday. "They go around claiming it's almost as good as Britannica. They are trying to be mainstream respectable.'"

Wikipedia and Plagiarism

This discussion has been archived. No new comments can be posted.

Search 267 Comments Log In/Create an Account

Comments Filter:

Re:That doesn't seem like alot (Score:3, Informative)

by tomhudson ( 43916 ) writes: <barbara.hudson@b ... m ['son' in gap]> on Sunday November 05, 2006 @12:00PM (#16725325) Journal

... and after an investigation of some of those by Wikipedia, it was found that some were in the public domain, some were culled from government sites, and some were copied from the wiki, and not the other way around. Of those 12,000, we can now say that the wiki is at least as clean as Ivory soap (99.44%).

Re:ok methodology, bad analysis (Score:3, Informative)

by Skippy_kangaroo ( 850507 ) writes: on Sunday November 05, 2006 @04:41PM (#16727955)

12,000 is easily enough to be statistically effective. Election polling gets acceptable results with samples of about 1,000.

Assuming that it is a binomial distribution then p=142/12000=0.0118, q=0.9882, n=12000 which means the standard error is sqrt(npq)=11.5 (approximately). Thus a 95% confidence interval is that the true number of plagiarised articles in the sample lies between 165 and 119.

And this is only plagiarism from on-line sites that are indexed by Google. Plagiarism from dead tree sources could well be significantly more.

This has got nothing to do with faith-based science and low analytical quality. I am once again amazed at how little people seem to know or care about proper statistics and just say "I don't believe it" if something doesn't accord with their preconceived notions.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Wikipedia and Plagiarism 267

Wikipedia and Plagiarism More Login

Wikipedia and Plagiarism

Re:That doesn't seem like alot (Score:3, Informative)

Re:ok methodology, bad analysis (Score:3, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot