Wikipedia and Plagiarism

Slashdot is powered by your submissions, so send in your scoop

Wikipedia and Plagiarism 267

Posted by CmdrTaco on Sunday November 05, 2006 @11:28AM from the less-than-college-papers dept.

Spo22a writes "Daniel Brandt found the examples of suspected plagiarism at Wikipedia using a program he created to run a few sentences from about 12,000 articles against Google Inc.'s search engine. He removed matches in which another site appeared to be copying from Wikipedia, rather than the other way around, and examples in which material is in the public domain and was properly attributed. Brandt ended with a list of 142 articles, which he brought to Wikipedia's attention.... 'They present it as an encyclopedia," Brandt said Friday. "They go around claiming it's almost as good as Britannica. They are trying to be mainstream respectable.'"

This discussion has been archived. No new comments can be posted.

Wikipedia and Plagiarism

Load All Comments

Search 267 Comments Log In/Create an Account

Comments Filter:

That doesn't seem like alot (Score:2, Insightful)

by NinjaFarmer ( 833539 ) writes:

Doesn't Wikipedia have over a million articles (not in English alone, I know)? That would mean that's less than .1% of the articles are plagiarized. Seems reasonable to me that that amount would get by into unnoticed. All it takes is for the original author then to deal with it.
- Re: (Score:2, Insightful)
  
  by sprins ( 717461 ) writes:
  
  Apparently Wikipedia has over 1.5 million english articles alone. So your calculation of the percentage of 'problematic' articles is even more favourable. Of those 142 eledgedly 'problematic' articles only a few really seem to be a problem as the others originated from the public domain to begin with.
  
  Sounds like much ado about nothing once more. *yawn*
  - Re:That doesn't seem like alot (Score:4, Insightful)
    
    by aquaepulse ( 990849 ) writes: on Sunday November 05, 2006 @11:42AM (#16725101)
    
    Well that 142 was found out of his search of 12000, if his methodology was sound you could expect the proportion plagiarized within the 1.5 million to be about 17750. About 1.18%.
    
    Parent Share
    twitter facebook
    - Re: (Score:3, Informative)
      
      by tomhudson ( 43916 ) writes:
      
      ... and after an investigation of some of those by Wikipedia, it was found that some were in the public domain, some were culled from government sites, and some were copied from the wiki, and not the other way around. Of those 12,000, we can now say that the wiki is at least as clean as Ivory soap (99.44%).
      - Re: (Score:2)
        
        by sbaker ( 47485 ) * writes:
        
        Some are also instances of people writing something on their own web site and then later deciding to put it on Wikipedia - so even the instances where the other web site predates the Wiki article may not be copyright violations. Without discussing the matter with every single original author, it's hard to know.
        
        I guess the only thing this study tells us is that an UPPER limit on the number of plagiarisms is of the order of 1%. That's still an alarmingly high number.
        
        Re: (Score:3, Insightful)
        
        by tomhudson ( 43916 ) writes:
        
        Considering that an audit of dead-tree encyclopedias hasn't been done, we can't say. What we CAN say is that its foolish to make a comparison with Britannica, when an audit of Britannica found 10% of 600 articles to be non-factual. The sources cited in those 10% disavowed the articles' contents.
        This isn't all that surprising either, when you think about it. People cite people who cite people, and someone somewhere will mis-interpret what someone else wrote, or come to different conclusions while still ci
        
        Re: (Score:2, Interesting)
        
        by kkwst2 ( 992504 ) writes:
        
        Alarmingly high? You find it alarming that 1 of every 100 articles on a free web-based encyclopedia has plagiarized material. You are clearly much less cynical than I am. I would have guessed at least 5%, probably more.
        
        Re: (Score:2)
        
        by sbaker ( 47485 ) * writes:
        
        If it were (as many people assume) just a case of people sitting down and writing articles which remain in that state for all time - then, yes, I'd guess closer to 5% too. But that's now how Wikipedia works. Pick an article at random - hit the history button - see how many people have worked on it? For plagiariasm to stand, it requires that none of the subsequent editors noticed it. That's much less likely - but still possible - but in addition to that, the general churning up of text tends to change se
  - Re: (Score:2, Insightful)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
    - Re:That doesn't seem like alot (Score:5, Funny)
      
      by user24 ( 854467 ) writes: on Sunday November 05, 2006 @01:26PM (#16726197)
      
      "It's a wiki. If you find a problem with it, you fix it."
      no, it's a wiki. If you find a problem with it, you add a template telling everyone that someone else should fix it.
      
      Parent Share
      twitter facebook
      - Re: (Score:2)
        
        by TheCarp ( 96830 ) * writes:
        
        wow that made me laugh so hard, I am practically crying.
      - Re: (Score:2)
        
        by GreatBunzinni ( 642500 ) writes:
        
        Actually the Wikipedia procedure for weeding out the copyrighted work is to flag the article as a possible copyright violation (add a {{copyvio}} template to the article) along with the source and then inform the editors about that problem by adding the article into the list of articles with possible copyright violations.
        
        Regarding the article, there is already a very active community weeding out Wikipedia of possible copyright violations. I don't know how this can be considered news.
    - - Re: (Score:2)
        
        by damiangerous ( 218679 ) writes:
        
        That misses the point of the AC parent. I could be updating it if I knew the subject, yes. But what if I didn't know the subject? How would I know it's wrong? I either have to be researching the subject anyway, in which case I wouldn't be using information from Wikipedia or I would have to just have a passing interest in the subject without being particularly concerned if it's wrong.
        Tycho from Penny Arcade said it best [penny-arcade.com], and this is a point that has never been addressed.
- Re: (Score:2)
  
  by nomadic ( 141991 ) writes:
  
  Except the story specifically says he checked only about 12,000 of wikipedia's articles, so that would make it about 1% are plagiarized if you extrapolated. Still not horrible, but I'm guessing it's a lot higher than Brittanica.
  - Re: (Score:2)
    
    by Yvanhoe ( 564877 ) writes:
    
    Right now we can just watch and see how this story end. I doubt this automated procedure could take into account contributor copying their own copyrighted materials insode wikipedia. I think this has already happened, I don't say that 100% of the 142 articles are in this case, but I think he raises an interesting point. Let's now see how this ends
  - Re: (Score:2)
    
    by DragonWriter ( 970822 ) writes:
    
    Except the story specifically says he checked only about 12,000 of wikipedia's articles, so that would make it about 1% are plagiarized if you extrapolated. Which would make sense to do if it was a systematic random sample, rather than a selection conducted by someone who has been on an anti-Wikipedia crusade for quite some time, as this one is. Of course, there is the question of the trustworthiness of the original number, as well, as the material was never independently reviewed, and Wikipedia's own revi
    - - Re: (Score:2)
        
        by interiot ( 50685 ) writes:
        
        He lists where they're plagiarized from on his website... click on each article and read the box at the top.
        He's got a bit more information at these threads: [1] [wikipediareview.com], [2] [wikipediareview.com], [3] [wikipediareview.com] I don't agree with his conclusions, but he said he did put around three weeks of effort going over these by hand to make sure they were legitimate copyvios.
- I hope you're not contributing... (Score:2)
  
  by NineNine ( 235196 ) writes:
  
  ...especially to any math articles. 142 is 1.183...% of 12000. Not "less than 0.1%"
Impressive (Score:4, Interesting)

by Solder Fumes ( 797270 ) writes: on Sunday November 05, 2006 @11:36AM (#16725045)

Wow. Only 142 articles in which average Joe Wiki forgot the proper way to attribute a source. I'm actually amazed there were so few occurrences. This article has the effect of heightening my opinion of Wikipedia's quality.

Share
twitter facebook
- Re: (Score:2)
  
  by porkThreeWays ( 895269 ) writes:
  
  In high school while doing term papers at least 1/3 of most of my papers weren't written by me. They were quotes from other sources. What's the difference? It's only plagiarized if you don't cite the source properly. Legally you are allowed to take small quotes and use them in a publication as long as you cite sources. I'm guessing many of those offenders could go legit just by citing the source alone without removing the quote.
  - Are you going to the prom? (Score:2)
    
    by goombah99 ( 560566 ) writes:
    
    I ask because apparently You did not actually graduate high school yet if you can't understand what the difference is between cited and uncited text.
  - - Depends on what you're writing (Score:2)
      
      by benhocking ( 724439 ) writes:
      
      IIRC, at least 2/3 of what you write should be your own conclusions, described in your own words, with the bulk of the rest expected to be comprise conclusions reached by others, but described in your own words. Direct quotations should not make up more than a very small part of any academic paper.
      If you're writing a summary article (e.g., on the current state of data mining [virginia.edu]), then as little as 10% (or even less) could be your own conclusions. However, if you're writing about your own research [virginia.edu], then you
- Re: (Score:2)
  
  by Penguinoflight ( 517245 ) writes:
  
  First, there's a ton of information which is "common knowlege"; This means that plagarism doesn't apply. Second, unless someone makes a direct quote of something they read, it wont show up as plagarism even if it is. The 142 count just means that all of them were flagrantly plagarized. This still seems rather low, but it makes a little more sense.
- - Is this sample as biased as Wakeman from 151? (Score:2)
    
    by tepples ( 727027 ) writes:
    
    Does the article make any claims as to how Mr. Brandt chose the sample of 12,000 articles? How can we look for biases in the sample?
Not shocking, but not a big deal (Score:3, Interesting)

by Chairboy ( 88841 ) writes: on Sunday November 05, 2006 @11:36AM (#16725049) Homepage

What's missing from the summary is that almost immediately upon getting the list, the articles in question were dealt with and the offenders were blocked or warned.

Wikipedia is written by a large community, and people make mistakes. I have read about other reference tomes that have been caught plagiarizing (for example, some encyclopedias or atlas's will put in a fake piece of data or a fake street so that they can easily determine if they're being copied from), and the turnaround time for fixing it can be years depending on the publishing cycle.

This isn't a condemndation of Wikipedia, despite Mr. Brandt's best efforts, it's a confirmation of why WP works.

Share
twitter facebook
Pfizzle. (Score:2)

by Etherwalk ( 681268 ) writes:

142 out of 12,000, some of which aren't really a problem, and that's numbers generated by a critic?

Yes, it's a problem, but that's actually not a bad score at all. You probably get more plagiarism than that on college papers at good schools. How many of these articles cite what they "plagiarize," even if they don't put it in quotes? Also, to make it legal plagiarizing, all you have to do is re-write each paragraph in your own words.

I see 1.18% of articles as potentially having text lifted from somewhere
- Re: (Score:2)
  
  by Daniel Rutter ( 126873 ) writes:
  
  142 out of 12,000, some of which aren't really a problem, and that's numbers generated by a critic?
  
  And a very... dedicated critic, too [crank.net].
  I must admit there's a certain recursive appeal to the idea of someone being notable enough for a Wikipedia entry purely because of his vehement attempts to avoid being mentioned on Wikipedia.
  As usual, the talk page [wikipedia.org] has lots of entertaining dirt.
  (Uncyclopedia has the real low-down [uncyclopedia.org], of course.)
- - Re: (Score:2)
    
    by Etherwalk ( 681268 ) writes:
    
    Legal != ethical.
    
    A school's honor code may be very different from a nation's copyright laws. (As they should be.) Ideally, if you come up with an idea in conversation with a few friends around a coffee table, and they contribute meaningfully to the genesis of the idea, you'll cite, thank them, or credit them in the finished product. But from a copyright status, while you can copyright the form of an idea, you can't usually copyright the idea itself--which is why you can write a new horror novel, or a new
  - - Re: (Score:2)
      
      by Etherwalk ( 681268 ) writes:
      
      Actually, I think his complaint was about students who didn't have an original idea. (Although I suppose it was worded ambiguously.) If you're analyzing a text and come up with an idea on your own, that's fine; but if you come up with the same idea by reading an article about the text, you should cite the article. That's fair.
The proof of the pudding (Score:2)

by GerardM ( 535367 ) writes:

The proof of the pudding is in the eating; consider Mr Brandt comes up with a computer generated list of potential problematic articles. These are scrutinized and where needed problematic content is removed. The wiki methodology works thanks to Mr Brandt.

Conclusion; the best way of improving Wikipedia is by showing where it has a problem. Mr Brandt disproved his opinion. Live and learn. :)

Thanks,
GerardM
Daniel Brandt, valuable Wikipedia contributor (Score:5, Insightful)

by alienmole ( 15522 ) writes: on Sunday November 05, 2006 @11:49AM (#16725195)

Brandt is doing a great service to Wikipedia — checking for and reporting plagiarism. That takes dedication and hard work. It's ironic that he feels the need to present it as criticims of Wikipedia's model, when in fact he's demonstrating the power of contributions from many people with different motivations. Even if the motivation is anti-Wikipedia, Wikipedia just absorbs the input and grows stronger.

"If you strike me down, I shall become more powerful than you could possibly imagine..." -- Obi Wiki-nobi

Share
twitter facebook
- Re: (Score:2)
  
  by Pharmboy ( 216950 ) writes:
  
  I posted an article on Wikipedia that a copy of a webpage I had written (and own) on another site. It was taken down on Wikipedia within 24 hours until I posted on the FIRST site that the information was under the GPDL, which took me all of 5 minutes, then the article was restored. No harm, no foul, the editor was just taking no chances. I would say they are pretty good at catching potential copyright issues, at least from MY experience.
  
  Besides, 142 out of 1,500,000 articles is only 0.009% of the content
- - I don't know (Score:2)
    
    by khallow ( 566160 ) writes:
    
    Is it such a good idea, checking for and reporting plagiarism? While that takes dedication and hard work, it's notable that he feels the need to present it as criticims of Wikipedia's model, because in fact he's demonstrating the power of plagarism from many people with different motivations. Even if the motivation is anti-Wikipedia, Wikipedia just absorbs the input and grows stronger. That doesn't seem a good thing.
    - Re: (Score:2)
      
      by tgv ( 254536 ) writes:
      
      Generally speaking, checking for plagiarism and reporting it, is a sound idea. It does not take a lot of dedication and hard work. It's understandable Brandt presents the cases as criticism to Wikipedia, but it does not cripple it. With every slash, Wikipedia grows stronger. It won't be long before many people with different motivations will be told they are plagiarizing Wikipedia.
From an ex-wikipedia administrator (Score:2)

by BMIComp ( 87596 ) writes:

I used to be a wikipedia administrator, before resigning due to time constraints. However, we would catch a lot of the copyright issues. I mean, when you're reading an article, and part of its plagerized, it's usually really obvious. The plagarized part usually doesn't fit into the rest of the article.. and you can just tell that the average editor didn't write that copy. (Just as I'm sure a teacher can tell one of his/her students didn't write a plagerized essay) Once you found the possibly infringin
142 out of 12,000? (Score:2)

by MMC Monster ( 602931 ) writes:

142 articles out of 12,000 is certainly a problem, but actually not much of one. I'm sure it he made his script public (I have no idea if he did so. In the /. tradition, I did not RTFArticle) and the wikipedia were to use it, it would be of benefit. Not to automatically tag articles as plagiarism, but at least tag them for further evaluation by an editor.

Buy, hey, 142/12000 is less than 2%. I would have thought the percentage would have been at least 5%.
- Re: (Score:2)
  
  by AtomicBomb ( 173897 ) writes:
  
  I am quite sure the ratio would be the same or even higher if the wiki critic managed to compare published books with existing copyright material. Many so-called experts are no exception to this especially when they are writing "supportive chapters" for their books (e.g. the video hardware technology review for a software research/professor writing a book about OpenGL vs DirectX)....
  - Re: (Score:2)
    
    by John Hasler ( 414242 ) writes:
    
    I wonder what the "plagiarism rate" is in Britannica?
US Gov copyright? (Score:2, Insightful)

by julesh ( 229690 ) writes:

Articles with offending passages have been stripped of most text. An entire paragraph in Alonzo Clark's entry, for instance, was deleted, leaving the article with the bare-bones: "Alonzo M. Clark (August 13, 1868-October 12, 1952) was an American politician who was Governor of Wyoming from 1931 to 1933."

The original article, Brandt said, was copied from a biography on the Wyoming state government site.

Err... I thought works of the US Government were generally free from copyright...?
- Re: (Score:2)
  
  by athmanb ( 100367 ) writes:
  
  Only those of the federal government. Those of most states aren't.
- Re:US Gov copyright? (Score:4, Insightful)
  
  by DragonWriter ( 970822 ) writes: on Sunday November 05, 2006 @12:53PM (#16725857)
  
  Err... I thought works of the US Government were generally free from copyright...?
  
  (1) The Wyoming state government is not the US government: state government works are not generally free from copyright.
  
  (2) Plagiarism is separate from copyright violation, anyway. Using material that is not subject to copyright or is in the public domain that is from one unique identifiable source without crediting the source is plagiarism, as is using copyright material in a way that does not violate copyright without attribution (say, fair use.) Plagiarism isn't a violation of the law, but a violation of commonly accepted standards of integrity when it comes to not claiming other's work as your own.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by imsabbel ( 611519 ) writes:
    
    But seeing that the policy of wikipedia FORBITS original reseach or works to be presented, i dont think that plagiatism isnt really that much of a violation here.
    Everybody with half a brain can suggest that the knowledge didnt manifest itself out of thin air, even without citations given.
- Re: (Score:2)
  
  by asuffield ( 111848 ) writes:
  
  Only in theory. They figured out a way to work around that pesky law a long time ago - a private contractor is given the task of 'producing' the work, with assistance supplied by the government. "Assistance" here means that the government supplies all the people who do the actual work on it. The contractor then sells the copyright to the government. This little legal fiction results in a work that was produced entirely by government employees and using government funds, that is copyrighted and owned by the
- - Re: (Score:2)
    
    by AxelBoldt ( 1490 ) writes:
    
    Citations are still required, even for the work of Government officials.
    By (often ignored) Wikipedia policy, which requires sourcing of all statements, but not by law.
Biographical articles. (Score:4, Funny)

by Anonymous Coward writes: on Sunday November 05, 2006 @12:16PM (#16725463)

It's very lazy of of the Wikipedia authors to enter the same biographical information as other sites.
They should write new and interesting histories for all these people rather than using the same old worn out ideas that are on so many places on the net.
All it takes is a little imagination.
A new birth place, better achivements (why could hitler not have discovered the cure for cancer and be the first man on the moon? It's better than the depressing story on Wiki at the moment.) and some creative editing would solve this problem once and for all.

Some Wiki articles are already better and contain things about people that have never happened, but sadly these often get put back to the same old boring stories almost as soon as the changes are made.

Share
twitter facebook
- Re: (Score:2)
  
  by _Sprocket_ ( 42527 ) writes:
  
  why could hitler not have discovered the cure for cancer and be the first man on the moon? It's better than the depressing story on Wiki at the moment.
  
  I also understand he was responsible for trippling the population of African elephants during his lifetime.
- Re: (Score:2)
  
  by MMC Monster ( 602931 ) writes:
  
  I always found this biography on Hitler less dull than the one from wikipedia:
  http://uncyclopedia.org/wiki/Adolf_Hitler [uncyclopedia.org]
ok methodology, bad analysis (Score:2)

by fermion ( 181285 ) * writes:

In this kind of study, basing the conclusion on the presence of few hits would characterize the study as faith based science.
First, the sample size was 12,000. Where did that number come from? Were the samples picked randomly? Assuming so, is 12,000 a statistically an effective sample size? And if the samples are random, and the size is sufficient, is that 142 articles statistically significant, that is, are the number of matches outside the margin of error? In other words, does the sample size, sele
- Re: (Score:3, Informative)
  
  by Skippy_kangaroo ( 850507 ) writes:
  
  12,000 is easily enough to be statistically effective. Election polling gets acceptable results with samples of about 1,000.
  
  Assuming that it is a binomial distribution then p=142/12000=0.0118, q=0.9882, n=12000 which means the standard error is sqrt(npq)=11.5 (approximately). Thus a 95% confidence interval is that the true number of plagiarised articles in the sample lies between 165 and 119.
  
  And this is only plagiarism from on-line sites that are indexed by Google. Plagiarism from dead tree sources could we
Confused? (Score:2)

by superstick58 ( 809423 ) writes:

I'm confused by the concept of plagiarism on wikipedia. For example, the article describes a biography copied from a government website. Isn't the point of Wikipedia to catalog and assemble information? How is copying an openly published biography from a government website considered plagiarism? Wikipedia is not being sold. No one is taking credit for the articles. Most cases, the original info is cited anyway. Anyway, please let me know what I'm missing here (which is probably a lot).
- Re: (Score:2)
  
  by AxelBoldt ( 1490 ) writes:
  
  Plagiarism is not a legal term, it's a term used in journalism and academia to describe taking somebody else's words or ideas and presenting them as your own, without attribution. In these realms, it is considered unethical.
  If you copy somebody's words, and these words are not in the public domain (for instance because the author is long dead or works for the U.S. government), and you can't defend the use as "fair use", then it's a civil offense and they can sue you (in some countries and severe cases it'
Even Virus authors contribute (Score:2)

by tmk ( 712144 ) writes:

Authors of malware are trying to exploit the good reputation of Wikipedia to infect PCs with their malicious software. In a mass e-mail, recipients were told to download a "security update" for windows from a Wikipedia site.

The attackers had used a Wikipedia feature that archives all previous versions of articles when changes have been made. The malicious page thus continued to exist in the archive, and the attackers were able to point to it in mass emails.

See here [heise.de] , here [techworld.com] and here [theregister.co.uk].
- Re: (Score:2)
  
  by hawaiian717 ( 559933 ) writes:
  
  There is a way to deal with this. If an article is deleted, the history gets erased. An administrator could copy the current, clean content of the article, delete the article, then recreate it from the clean version.
  - Re: (Score:2)
    
    by tmk ( 712144 ) writes:
    
    That is neither allowed nor effective. Administrators can purge single revisions of an article and keep the rest. By erasing the whole article they would erase all informations abiout the article authors, whoich is not allowed bei the GFDL.
- - Re: (Score:2)
    
    by tmk ( 712144 ) writes:
    
    I submitted this two days ago with better sources and more details...
Turns out they weren't plagiarized... (Score:2)

by cliveholloway ( 132299 ) writes:

They were just authored by Roland Piquepaille. His articles are always all his own work, so it must be a mistake in the program.
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
- Re: (Score:2)
  
  by DragonWriter ( 970822 ) writes:
  
  Is plagiarism an issue for Wikipedia?
  Yes.
  ut legally, the real issue here is Copyright, isn't it?
  Not all issues are legal issues.
  There is no copyright in facts. Therefore, nonfiction works are open to have the facts used in Wikipedia. Where a verbatim transcription would not be fair use, someone needs to paraphrase.
  The issue here is verbatim use, anyway. An automated script is going to have more trouble finding use of "facts" from another source that aren't verbatim copies of the presentation.
- Verifiability (Score:2)
  
  by tepples ( 727027 ) writes:
  
  Wikipedia's job is to provide accurate information.
  
  Not exactly. The job of Wikipedia (or for that matter any other general encyclopedia) is to provide verifiable [wikipedia.org] information from reliable sources. Verifiability > truth until the truth becomes verifiable.
Wikipedia bashing du jour (Score:2)

by mabu ( 178417 ) writes:

It seems to be th3 c00l3ss to bash Wiki lately, but the bottom line is there is no encyclopedic reference that comes close. The media and other pseudo-pundits who seem to resent any influential source of information that doesn't have obvious corporate influence (read: money-based control) as a major threat and they do whatever they can to discredit Wikipedia. Aside from a tiny subset of controversial articles that routinely get vandalized, and another tiny subset of plagiarism, this issue is likely to be
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
Here is the link to my report (Score:2)

by Everyman ( 197621 ) writes:

Why is this news? Maybe because the Associated Press says it's news, and it's in hundreds of newspapers?

Why should Slashdotters care? Because while AP doesn't use links, Slashdot should have the courtesy of linking to the original sources that AP used to generate the report. (Plus AP also checked with Jimmy Wales for a reply, which is expected from professional reporters.)

The report is at http://www.wikipedia-watch.org/psamples.html [wikipedia-watch.org]

Wikipedia's own newsletter reports on it here:
http://en.wikipedia.org/wiki/W [wikipedia.org]
- What Brandt _should_ do, rather than crowing (Score:3, Insightful)
  
  by Howzer ( 580315 ) * writes:
  
  Is release the script or code that he used to generate his 142 plagiarised articles out of 12,000.
  
  Such a script, if tuned and more widely applied, could be extraordinarily useful in weeding out future instances of plagiarism.
  142 articles flagged, 142 articles fixed within hours. That's Wikipedia working as no dead-tree encyclopedia can.
  Of course, Brandt would never do anything as useful as that, but will probably content himself with continuing to "shoot from the hip" and claim this as a blow against
Brandt vs. Wikipedia (Score:2)

by mako1138 ( 837520 ) writes:

Brandt has a long-standing (well, year-old) beef with Wikipedia. You can read about it, ironically enough, in the Wikipedia article about him [wikipedia.org].

He got into a dispute because he didn't like having his biography on WP (though it was constructed from publicly available news sources). He was generally combative and belligerent, and so was blocked and banned various times; check out the Talk archives for details. Afterwards he started a webpage where he attempted to list the real-world identities of the editors in
- Re: (Score:2)
  
  by bigbigbison ( 104532 ) writes:
  
  I'd never heard of Brandt before this, but he sounds like an ass. His namebase.org website is ranked low in google, so he starts google-watch.org. He doesn't like his wikipedia bio, so he starts wikipedia-watch.org. Any bets on how long it takes him to start slashdot-watch.com????
Not an unflattering biography (Score:2)

by iabervon ( 1971 ) writes:

Daniel Brandt is against Wikipedia's portrayal of him not because of it being unflattering (it is, in my opinion, if anything oddly sympathetic to his position, despite his position being that it shouldn't exist at all), but because of his privacy concerns. He's a privacy activist with a particular focus on the actions of information organizing sites, and so he's not unexpectedly against the existance of unauthorized widely-available detailed biographies. He's gone so far as to complain about CIA and NSA we
- Especially since he used to sell such info himself (Score:2)
  
  by Reziac ( 43301 ) * writes:
  
  From the Wiki article:
  
  "From the 1960s onwards, Brandt collected clippings and citations pertaining to influential people and intelligence matters. In the 1980s, through his company Micro Associates, he sold a database of citations of these clippings, books, government reports, and other publications."
  
  Pot, kettle, hello.....??!
Brandt's paper and Wikipedia's response (Score:2)

by AxelBoldt ( 1490 ) writes:

Brandt's original paper is here [wikipedia-watch.org], explaining his methodology and giving the complete list of articles he found. Wikipedia's response is here [wikipedia.org], where people go through the list one by one and also check the other contributions of users who have added copyrighted content. Wikipedia also has a bot [wikipedia.org] which aims to detect newly added copyright violations by searching Google.
142 isn't bad. (Score:2)

by Maxo-Texas ( 864189 ) writes:

It's great this guy created a program to make it easier for them to avoid this problem.

That's the great thing about open source and projects like wiki.

You encounter a problem, it's very easy for people to fix it quickly.

If those 142 items are real, they are probably already being fixed now if not all fixed.
Brandt's odd sayings (Score:2)

by clap_hands ( 320732 ) writes:

"They present it as an encyclopedia," Brandt said Friday.

Well, yes. Not that odd, really, given that it is an encyclopedia.

"They go around claiming it's almost as good as Britannica."

Actually, Wikipedians don't, in my experience. Most are quite sober when it comes to comparisons with Britannica. Brandt may be referring to the journal Nature, which did make such a claim for science articles.

They are trying to be mainstream respectable.
Wikipedia is already pretty darn mainstream, and if by "respectable" Brand
If you equate good with referenced (Score:2)

by Kjella ( 173770 ) writes:

"They present it as an encyclopedia," Brandt said Friday. "They go around claiming it's almost as good as Britannica. They are trying to be mainstream respectable."

Whether something is plagerized or not, doesn't really impact the quality of it. If someone copied a great article into Wikipedia, then Wikipedia has a great article - just through foul play. There's previously been comparisons which have shown Wikipedia to be just as accurate as Britannica. Now, it's been a while since I looked at a dictionary,
Other concerns about Wikipedia (Score:2)

by meburke ( 736645 ) writes:

In the Encyclopaedia Britannica and other published, for-sale reference works, the articles' sources are not only attributed, but the author of the article is attributed and his/her credentials displayed as a guide to their qualifications in providing the article.

Now, an article presenting facts can be written by someone who has no academic qualifications but still represents the facts fairly and accurately, so I don't claim that a person MUST be academically qualified to write a good article, nor do I clai
Plagarism is common but usually promotional (Score:2)

by Animats ( 122034 ) writes:

Plagarism shows up frequently in Wikipedia, but usually it's promotional. Typically, company X copied their "about" page into Wikipedia. Bands and musicians, usually ones that are a legend only in their own minds, try this. A new user associated with the thing being promoted is usually responsible.
Then there are the people with a collector mindset. They create endless minor articles like "Indiana State Highway 22" and biographical articles of long-forgotten city council members. Often by cutting and
How does he know? (Score:2)

by mr_zorg ( 259994 ) writes:

If I wrote an article on some subject and then decided to share that information with Wikipedia, I may well just copy my text verbatim. Does that make it plagiarism? If I wrote the text, why can't I reuse it? How does this guy know that's not what's going on here?
- Re: (Score:2)
  
  by interiot ( 50685 ) writes:
  
  It's at least internal Wikipedia policy that there needs to be verification that the original author is posting the article (either by [wikipedia.org] modifying the original site to note that the article is released under the GFDL, or by sending an email to the Wikimedia Foundation confirming its GFDL status). Without more formal confirmation, it's difficult to say whether the off-wiki author is the same as the on-wiki one, either from a plagiarism standpoint or a legal one.
Who is Daniel Brandt anyway? (Score:2)

by YGingras ( 605709 ) writes:

You might like to know that Daniel Brandt [wikipedia.org] founded Google Watch back in the old days to protest against page rank. Yes, Google Watch was originally just against how Google didn't give mr Brandt a good page rank. Now he added some bits about privacy but I think anyone should visit Google Watch now to see how childish Daniel Brandt is. And using Google to do datamining is against the acceptable use policy anyway.
- Re: (Score:2)
  
  by The MAZZTer ( 911996 ) writes:
  
  So are my term papers.
- Re: (Score:2)
  
  by Tim C ( 15259 ) writes:
  
  But copyright infringement isn't; just being non-commercial won't necessarily save it, if infringement is indeed taking place.
- - - Re: (Score:2)
      
      by Dunbal ( 464142 ) writes:
      
      but if shit is knowledge,
      
      Then tubgirl is a smart woman...
- Re: (Score:2)
  
  by Klaidas ( 981300 ) writes:
  
  You must be new here...
- Re: (Score:2)
  
  by Solder Fumes ( 797270 ) writes:
  
  Plagiarizing on Wikipedia has to be one of the more victimless "crimes" I can think of, especially since entries are essentially anonymous and no one else is really getting quantifiable credit for using someone else's text in a wiki article.
  - victimless?!?!? (Score:2)
    
    by abigsmurf ( 919188 ) writes:
    
    Victimless crime?
    You're not only not buying the book of whoever did the (possibly expensive) research, you're not even crediting them so they get zero credit and because you've got the info you need you're even less likely to seek out the author's work! Just because the perpatrator(sp?) has little to gain commiting the crime doesn't make it victimless!
    - Re: (Score:2)
      
      by MarkByers ( 770551 ) writes:
      
      Yeah because if you took Wikipedia offline I would immediately go out and buy tons of reference books.... not!
      
      If you are the sort of person that needs to buy expensive research papers, you are not in the target audience for Wikipedia! Wikipedia is not intended to be used for professional research, it's just a little fact book that may or may not be correct, with some links to sources on each page. Nothing more. It's not going to be making a dent into your sales figures, so relax!
      
      If you support Wikipedia, ma
      - Re: (Score:2)
        
        by abigsmurf ( 919188 ) writes:
        
        you may not buy his books but you may come across his article on a site he writes for...
  - - Re: (Score:2)
      
      by Solder Fumes ( 797270 ) writes:
      
      Imitation is the sincerest form of flattery...are you coming on to me?
- Re: (Score:2)
  
  by Zeinfeld ( 263942 ) writes:
  
  Any Journal article comprised of 1% plagiarism would be subject to law suits, apologies and the journal would face ostracism.
  There is a big difference between plagarised articles and articles with plagarised passages. Pretty much every medium has a significant plagarism rate, including scholarly journals.
  The methodology in this case is more than a little suspect. At least 50% of Wikipedia is utter crap. There is fancruft, stubs, POV peddling forks. Anyone who is involved with Wikipedia will admit as muc
- - Re: (Score:2)
    
    by makomk ( 752139 ) writes:
    
    That's a very interesting allegation. Got a source for it?
  - Re: (Score:2)
    
    by goombah99 ( 560566 ) writes:
    
    cute but this has nothing to do with plagiarism. Press releases are meant to be copied.
- - Re: (Score:2)
    
    by goombah99 ( 560566 ) writes:
    
    147/12000 = 1.2%
- How works the Wherebot? (Score:2)
  
  by tmk ( 712144 ) writes:
  
  Is there a description how this bot identifies plagiarism? Does he search for random edits?
- Re: (Score:2)
  
  by Fnkmaster ( 89084 ) writes:
  
  Sorry, but Brandt is a fucking nutjob. Just look around on his sites. That is not a stable, coherent person.
  - Re: (Score:2)
    
    by remembertomorrow ( 959064 ) writes:
    
    Wow, you're right.
    
    This guy is almost on the same level as Jack Thompson in terms of stupidity/ignorance.
- Re: (Score:2)
  
  by mabu ( 178417 ) writes:
  
  The guy's got a 501(c)3 corporation dedicated to bashing Wiki. My guess is it's funded by media and other encyclopedia makers. Follow the money and what you probably will find out about these people is much more disgusting than any transgression on the part of Wikipedia.
- Re: (Score:2)
  
  by MostAwesomeDude ( 980382 ) writes:
  
  I'll bite, mostly because people might actually believe what you're saying.
  
  Daniel Brandt doesn't like Wikipedia. His article there was started 'against his wishes,' and although he managed to get it deleted once by a few choice threats. it was quite rapidly created again. Ironically, the community now agrees that his anti-Wikipedia rantings have made him notable enough to be included in the encyclopedia.
  
  Mr. Brandt is certainly not a nice person. While your words "politician" and "Republican" are completely
  - Re: (Score:2)
    
    by DragonWriter ( 970822 ) writes:
    
    However, he is well aware that Wikipedia's "no copyright violations" policy requires users to immediately quash plagarized content.
    How can one be "well aware" of something that isn't true? Wikipedia's copyright policies (WP:C and WP:COPYVIO) address copyright violations, not plagiarism. You can have a copyright violation without plagiarism—for instance, if the use of properly quoted, properly cited material exceeds legal "fair use", it is not plagiarism while it is a copyright violation. And you can l
    - Re: (Score:2)
      
      by MostAwesomeDude ( 980382 ) writes:
      
      Heh, and usually I'M the pedant. You're right, that should read "...quash copyrighted content used improperly. Thanks.
- Not True (Score:2)
  
  by viewtouch ( 1479 ) writes:
  
  Daniel Brandt can't edit Wikipedia so it's not true that anyone can edit it.
- Re: (Score:2)
  
  by Smallpond ( 221300 ) writes:
  
  It's interesting that Wikipedia also removed the edit history [wikipedia.org] so that we can't tell what was there or who contributed it in the first place.
  - Re: (Score:2)
    
    by interiot ( 50685 ) writes:
    The actual contents of the deleted versions obviously won't be visible, since it's a legal issue. The edit history metadata used to be visible to everyone, until vandals started being "funny", and leaving personal information in edit summaries, so unfortunately the edit history isn't automatically visible now. But admins may cut-n-paste the history on request. Here's that one:
    
    07:17, 23 October 2006 Alphachimp (Talk | contribs | block) deleted "Alonzo M. Clark" (g12)
    
    21:18, 21 May 2006 . . Siva1979 (Ta
- - Re: (Score:2)
    
    by John Hasler ( 414242 ) writes:
    
    He did not check all the pages. He only checked 12,000. How did he choose them?

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

That doesn't seem like alot (Score:2, Insightful)

Re: (Score:2, Insightful)

Re:That doesn't seem like alot (Score:4, Insightful)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:3, Insightful)

Re: (Score:2, Interesting)

Re: (Score:2)

Re: (Score:2, Insightful)

Re:That doesn't seem like alot (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

I hope you're not contributing... (Score:2)

Impressive (Score:4, Interesting)

Re: (Score:2)

Are you going to the prom? (Score:2)

Depends on what you're writing (Score:2)

Re: (Score:2)

Is this sample as biased as Wakeman from 151? (Score:2)

Not shocking, but not a big deal (Score:3, Interesting)

Pfizzle. (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

The proof of the pudding (Score:2)

Daniel Brandt, valuable Wikipedia contributor (Score:5, Insightful)

Re: (Score:2)

I don't know (Score:2)

Re: (Score:2)

From an ex-wikipedia administrator (Score:2)

142 out of 12,000? (Score:2)

Re: (Score:2)

Re: (Score:2)

US Gov copyright? (Score:2, Insightful)

Re: (Score:2)

Re:US Gov copyright? (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Biographical articles. (Score:4, Funny)

Re: (Score:2)

Re: (Score:2)

ok methodology, bad analysis (Score:2)

Re: (Score:3, Informative)

Confused? (Score:2)

Re: (Score:2)

Even Virus authors contribute (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Turns out they weren't plagiarized... (Score:2)

Re: (Score:2)

Re: (Score:2)

Verifiability (Score:2)

Wikipedia bashing du jour (Score:2)

Re: (Score:2)

Here is the link to my report (Score:2)

What Brandt _should_ do, rather than crowing (Score:3, Insightful)

Brandt vs. Wikipedia (Score:2)

Re: (Score:2)

Not an unflattering biography (Score:2)

Especially since he used to sell such info himself (Score:2)

Brandt's paper and Wikipedia's response (Score:2)

142 isn't bad. (Score:2)

Brandt's odd sayings (Score:2)

If you equate good with referenced (Score:2)

Other concerns about Wikipedia (Score:2)

Plagarism is common but usually promotional (Score:2)

How does he know? (Score:2)

Re: (Score:2)

Who is Daniel Brandt anyway? (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)