Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Infrequent Anonymous Cowards Reliable on Wikipedia

Posted by samzenpus on Wed Oct 17, 2007 08:36 PM
from the a-visit-from-the-correction-fairy dept.
Hugh Pickens writes "Researchers at Dartmouth University have recently discovered that infrequent anonymous contributors, so called "Good Samaritans," are as reliable as registered users who update constantly and have a reputation to maintain. A graph from page 31 of the group's original paper (pdf file) shows that the quality of contributions of anonymous users goes down as the number of edits increases while quality goes up with the number of edits for registered users."
+ -
story

Related Stories

This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • Not news (Score:5, Interesting)

    by Titoxd (1116095) on Wednesday October 17 2007, @08:41PM (#21019655) Homepage
    Unlike what some users may tell you, many anonymous users contribute content and not vandalize. The quality of the edits per se is all over the place, but this is to be expected, as there is no way a new contributor can know all the nuances of the in-house referencing system [wikimedia.org], or the indications made by the Manual of Style [wikipedia.org]. But they do try.

    ~~~~
    • Re:Not news (Score:5, Insightful)

      by hedwards (940851) on Wednesday October 17 2007, @08:57PM (#21019773)
      I don't think that this is really all that surprising, people that just do one or two edits aren't typically doing it because the have an investment, they are generally doing it because they found an error.

      Some presumably do deface the pages, but I don't find it terribly surprising that somebody that primarily uses wikipedia would be more reliable than somebody that spends most of their time building a reputation. There's just so much more incentive to fix it if you are using it. That isn't to say that named contributers are inherently bad.
        • Re:Of course... (Score:4, Interesting)

          by maggard (5579) <michael@michaelmaggard.com> on Wednesday October 17 2007, @11:41PM (#21021049) Homepage Journal

          Really? I just checked the half dozen edits Ive made over the past six months. The 5 trivial ones are all intact, and the extensive one (transportation in the town I live in) was edited & rearranged, for the better.

          Perhaps the quality of the edits is important.

        • Re: (Score:3, Insightful)

          I was tempted to reply with "rv or" but seriously your post is a wild claim. I have 200+ pages on my watchlist, and while I cannot speak for other pages, I can tell you that anonymous edits do not get reverted by default. I've personally reverted some edits, and others have reverted others, but by far most edits have stayed in one form or another.
          • Re:Of course... (Score:4, Informative)

            by 1u3hr (530656) on Wednesday October 17 2007, @11:09PM (#21020875)
            But I have yet to see a single edit being wrongly reverted by a bot.

            Happened to me once. I noticed a list of "movies about the Mafia" was full of titles just about crime, so I deleted those I knew were not Mafia related. Then later I see they've been reverted by some asshole (later I worked out it was a bot) that had decided I was a vandal (as stated in the comment).

            • Re:Of course... (Score:4, Informative)

              by Titoxd (1116095) on Wednesday October 17 2007, @11:30PM (#21020971) Homepage
              Well, did you provide a reason why you deleted the content?

              The bots are not infallible. They do catch a ton of the really ridiculous crap that people add to Wikipedia, but they miss some, and have a few false positives as well.

              If you are not some random vandal, one thing that you could (actually, should do, as I strongly recommend it) is that you specify why you remove content in the "Edit summary" box. If you say, "Removing movies unrelated to mafia", the bot leaves you alone, or if someone sees the bot revert your removal for an invalid reasons, they can always revert the bot. I've done that myself many a time.

              Remember: Humans watch the Recent changes [wikipedia.org] feed too. If you provide a reason for the human, the human may leave you alone. Otherwise, you're just a random IP that is removing content for no reason whatsoever, which happens all day, every day. ~~~~
                  • Re:Of course... (Score:5, Insightful)

                    by Carthag (643047) on Thursday October 18 2007, @12:52AM (#21021403) Homepage
                    1. Look at the edit summary 2. Find the name of the bot that reverted you & click it 3. Find the user who runs the but and go to his page 4. Talk with him. You're way too angry for way too little reason. I'd understand if you got angrty if someone shot your or something, but getting angry over a minor misunderstanding is ... a bit much.
                    • Re: (Score:3, Interesting)

                      1,2,3...

                      Yes, I did try to communicate with him, and no I wasn't abusive. He ignored me.

                      I'm not "way too angry". I only mentioned it at all in answer to a direct question. At the time I was pissed because I'd spent time and thought "giving back" to the community only to have it deleted and be insulted for my trouble. I got over it a few minutes later and haven't mentioned it to anyone till now. Now I know Wikipedia is infested with self-important twats who like to play power games, so I don't waste my

          • This anti vandal bot's talk page [wikipedia.org] has plenty of examples of mistakes. ~~~~
    • I probably edit two entries a month (mainly corrections, sometimes minor additions). I'm not registered and I cannot see any real benefiit in doing so. Perhaps being registered allows you to add pages or modify GW Bush's entry or something but I have not been motivated enough to find out what the benefits are.
      • Re: (Score:3, Informative)

        It allows you to do both those things (create pages, and edit semi-protected pages). It also allows you to not have an annoying captcha pop up when you try to add an external link, and reduces the chance of your edits being mistaken for vandalism and reverted even though they were perfectly good.
      • Re: (Score:3, Informative)

        There are several tangible benefits to having a registered account. The primary one I would think of is GFDL attribution: since your IP is not guaranteed to be stable (at least for most people), your edits cannot be attributed back to you as easily.

        Additionally, as an anonymous editor, you can't edit semi-protected pages, but you cannot upload images either. You cannot move pages either, nor create pages in the article namespace. You can still create talk pages, but if you want to create an article, you
    • Re: (Score:3, Interesting)

      Even for a registered user like myself this is difficult. I never got around to editing much, but when I did I'd get a notice criticizing my edit for putting periods and commas "inside quotation marks," as opposed to "outside". It seems like the other editors just disregarded the fact that I corrected some major grammatical flaws. I soon stopped making edits while logged in and just waited on other people to fix most problems. :(

      (By the way, in US english, commas and periods should ALWAYS go inside the q
      • Re:Not news (Score:5, Funny)

        by Whiteox (919863) <htcstech AT gmail DOT com> on Wednesday October 17 2007, @10:43PM (#21020663) Journal

        (By the way, in US english, commas and periods should ALWAYS go inside the quotes.)
        * Citation Required

      • Re: (Score:3, Funny)

        by Anonymous Coward
        Regarding the placement of commas, periods and quotes, I just had to share this. http://worsethanfailure.com/Articles/Its-a-Different-Set-of-Rules.aspx [worsethanfailure.com]
      • Re:Not news (Score:5, Interesting)

        by FuturePastNow (836765) on Wednesday October 17 2007, @11:25PM (#21020953)
        I am not a registered Wikipedia user, so anything I do is anonymous. I've only made a few (3 or 4) edits to articles, always to fix minor typos or spelling errors I've seen while reading.

        Every time I have done so, it has been rolled back within minutes, which I assume means that registered editors are watching for anonymous changes and removing them no matter what. As a result, my current attitude towards Wiki editors can be summarized with the words "fuck you."

        Hopefully, some of those pricks will read this article and change their attitude, but I doubt it.

        • Re: (Score:3, Funny)

          I've only made a few (3 or 4) edits to articles, always to fix minor typos or spelling errors I've seen while reading. Every time I have done so, it has been rolled back within minutes

          [citation needed]

          • Re: (Score:3, Insightful)

            If the edit you're making is minor (e.g. fixing a typo) the last thing you feel like doing is getting into a pissing contest over it.
      • Re: (Score:3, Interesting)

        (By the way, in US english, commas and periods should ALWAYS go inside the quotes.)

        Yes, but for a strange reason. American newspapermen couldn't be bothered with the nuances of the English language, even though some of the nuances were linguistically valuable. In this case, placement of punctuation inside or outside quotation marks relays information about the quote. I'm American so, you know, screw the redcoats and all that, but I think our stateside grammarians dropped the ball on this one.
        • Re:Not news (Score:5, Interesting)

          by dgatwood (11270) on Thursday October 18 2007, @01:01AM (#21021449) Journal

          The American style is inconsistent. We put question marks and exclamation points inside the quotes only if it is part of the quoted content. Note the difference in these sentences:

          Are you "working a late night"? (Translation: are you sleeping with your secretary? See comment below about terms used ironically.)
          I asked him, "Are you working a late night?" (I asked an innocent question.)

          Similarly, we treat punctuation with regards to parentheses in this way. A period goes inside the parentheses only if it is a complete sentence.

          I am smarter then you (but you knew that).
          I am smarter than you. (My dog is smarter than you.)

          However, the American style says to put periods and commas inside the quotation marks in all cases. I would argue that American usage is simply wrong here, as it is thoroughly inconsistent with all other punctuation combinations. Thus, I make it a point (despite being American) to ignore it and follow the much more rational British rules. The rules for punctuation should logically be determined by whether the punctuation is truly part of what is being quoted or not.

          For example, if I use a term in an ironic way, I might put that in quotes.

          The "nice girl" dumped me yesterday.
          In that context, if that were at the beginning or end of a sentence, punctuation should logically go outside the quotes.

          I got dumped by a "nice girl".

          Putting it inside the quotes implies that you are quoting the period as though "nice girl" were a complete sentence or some reasonable facsimile thereof. It just doesn't make sense. If a wookiee can live on Endor....

          The same holds true if you are using quotes to define new terminology (though this is less frequently done with quotes these days and more frequently done with boldface text or other typographic conventions).

          This type of memory is known as "Random Access Memory".

          This stands in contrast with cases in which punctuation would logically be part of the quote.

          "This is stupid," he said. (The comma takes the place of a period.)
          He said, "This is stiupid."

          But I digress.

    • Re:Not news (Score:4, Interesting)

      by Petrushka (815171) on Wednesday October 17 2007, @10:05PM (#21020375)

      The graph they show is practically meaningless.

      • First, the graph shows that the retention rate for all contributors is below 1%. I find that pretty hard to swallow in the first place, and it makes me doubt that it's based on good data, though I suppose the decimal point could be an error (I'm hoping so).
      • Secondly, the graph shows a continuous line, though the x axis is clearly discrete. Based on the graph, the retention rate for non-registered contributors who have made one edit is about 0.72%; all well and good. How is it that we are given a retention rate for contributors who have made one and a half edits? How is it that we are given a retention rate for contributors who have made no edits? Has any thought at all gone into this?

      The rest of the study may possibly have some good stuff in it, but the incompetence that has gone into this graph leaves me with grave reservations. I notice, for example, that the main body on p. 15 refers to "FIGURE 2 ABOUT HERE". I also notice that there is no figure 2. I don't think this study has a shred of credibility.

  • ha! (Score:5, Funny)

    by Anonymous Coward on Wednesday October 17 2007, @08:42PM (#21019657)
    RESPECT ME!
  • by Actually, I do RTFA (1058596) on Wednesday October 17 2007, @08:42PM (#21019661)

    That study was published by Dartmouth College. Dartmouth University is an unrelated entity in Canada.

  • Or... (Score:4, Insightful)

    by MyLongNickName (822545) on Wednesday October 17 2007, @08:43PM (#21019667) Journal
    Look at it another way... registered users who are "experts" are no better than the riff-raff.
  • by Nrbelex (917694) on Wednesday October 17 2007, @08:43PM (#21019671) Homepage
    Another interesting study might determine how many posts a person usually makes before becoming registered...
    • by Fallingcow (213461) on Wednesday October 17 2007, @08:57PM (#21019777) Homepage
      I made a few grammar, punctuation, and spelling fixes before I ever bothered to register.

      If I'm not already logged in and see a minor problem in an article, I'll usually fix anonymously. Not worth the time to log in.
    • by Raul654 (453029) on Wednesday October 17 2007, @09:20PM (#21019979) Homepage
      Doing such a study requires checkuser [wikipedia.org] access, which is something only a few people on Wikipedia have. Fortunately, I am one of them. I just sampled ten users out of the new user log [wikipedia.org]. I am assuming a 1:1 mapping between IP and user (that is, that a user made no anonymous edits except with the IP he used to register his account). The number of anonymous edits prior to registeration for each user was:

      A - 0
      B - 0
      C - 0
      D - 2
      E - 0
      F - 0
      G - 0
      H - 0
      I - 0
      J - 0

      In short: most of the people registering accounts had made no edits prior to registering. It's common knowledge on Wikipedia that something like half of all accounts registered never make any edits at all, so this makes sense.
      • Re: (Score:3, Informative)

        A large portion of new users never make edits after they register. See here the new user log [wikipedia.org]. New users that have made edits will have the word "contribs" shown in blue; otherwise, it is shown in red. For many new users that have made edits... those edits turn out to be vandalism and the account, a vandalism-only account that is blocked.

        How many of those new users you selected have made edits since registering? I think many of those you sampled will never edit, period. Not before, not after. To mak
        • Re: (Score:3, Informative)

          This is a troll, and is patently false, but I'll bite nonetheless: the Foundation's privacy policy [wikimediafoundation.org] (which governs the use of checkuser) strictly limits the conditions under which "personally identifiable data collected in the server logs, or through records in the database via the CheckUser feature" may be released. The release of aggregated, anonomyized data, such as I did above, is perfectly acceptable under the privacy policy and is a common practice in web traffic analysis
    • Re: (Score:3, Insightful)

      by Anonymous Coward

      Another interesting study might determine how many posts a person usually makes before becoming registered...

      Can't speak for the anonymous posters at Wikipedia or even any of the ACs here but for myself. The validity of a statement is the greatest when it can stand on its on without the benefit or detriment allocated to the statement by its maker. Posting as AC often draws extra scrutiny as to the validity and worth of the posting. If a moderator perceivers value in it worth their mod point application then

  • Lost Passwords? (Score:4, Interesting)

    by mochan_s (536939) on Wednesday October 17 2007, @08:44PM (#21019677) Homepage
    Maybe someone just forgot his/her password or don't want to login in library computers.
  • by User 956 (568564) on Wednesday October 17 2007, @08:44PM (#21019679) Homepage
    Researchers at Dartmouth University have recently discovered that infrequent anonymous contributors, so called "Good Samaritans," are as reliable as registered users who update constantly and have a reputation to maintain.

    Even better, the number of these "Good Samaritans" has tripled in the last six months!
  • well duh (Score:5, Insightful)

    by ILuvRamen (1026668) on Wednesday October 17 2007, @08:45PM (#21019689)
    This is to be expected. A lot of people read wikipedia to look up stuff and learn and all that. They never really wanted to edit it though cuz they're lazu. And then when they look up a topic near and dear to their heart like a specific video game or show and find something incorrect or totally lacking and just can't bear to not do something about it. But that's as far as the motivation takes them. I'd assume the majority of editors are like that. Who has like hours and hours to write really good articles all the time?
      • Re:well duh (Score:4, Funny)

        by Firehed (942385) on Wednesday October 17 2007, @09:37PM (#21020139) Homepage

        The giving-geeks-a-bad-name mom's-basement-dwelling sociopaths who adopt obscure TV series, comic book characters, or musicians and then write and champion the articles about their outre darlings.


        Digg users? Come on now, that's just being unfair. They have no "darlings" in any sense of the word.
      • Re:well duh (Score:5, Insightful)

        by kebes (861706) on Wednesday October 17 2007, @09:44PM (#21020169) Journal

        Who has like hours and hours to write really good articles all the time?
        The nuts. The fanatics. The giving-geeks-a-bad-name mom's-basement-dwelling sociopaths
        With all due respect, that is a rather narrow-minded view of the people who spend significant time contributing to Wikipedia. Do you similarly think of people who volunteer their time at soup kitchens as "Nuts. Fanatics. The giving-hard-working-people-a-bad-name social rejects." Or perhaps you think that open-source software coders are "Nuts. Fanatics. The giving-coders-a-bad-name time-wasters."

        Luckily, not everyone views volunteering as a waste of time, or indicative of fanaticism. Many people contribute to Wikipedia because they value information and education. They enjoy challenging their mind. This is their hobby (instead of Sudoku and crossword puzzles), or perhaps even their passion. This is their way of contributing to a greater good.

        We don't want to edit it because we are *adults* with lives and jobs and families and deadlines who want our encyclopedias to be encyclopedias and not some kind of bring-your-own-violin pick-up jazz concert.
        You are more than welcome to ignore the free spread of information and impromptu musical gatherings, and focus on all the important things in your adult life. However it is rather unfortunate that you cannot see the value in what other communities achieve when they willingly devote time from their busy schedules to a communal project.
      • Re: (Score:3, Funny)

        by Anonymous Coward
        "We don't want to edit it because we are *adults* with lives and jobs and families and deadlines" ... AND PLENTY OF TIME TO POST TO FUCKING SLASHDOT. You lazy bastard.
  • Depends, (Score:3, Funny)

    by Fengpost (907072) on Wednesday October 17 2007, @08:49PM (#21019715)
    Just don't cite wikipedia as a reliable source of the elephant population.
    • Re: (Score:3, Insightful)

      Lol.. I wouldn't cite it as a reliable source ever. I would use it as a starting point to get familiarity with a topic or subject or to point some one to where they could. But I wouldn't stake a grade or even an argument on the accurateness of it.

      Wikkipedia has had it's share of experts that amount to people lieing about their credentials to fix a page in a certain way and keep the tones of pages agenda driven. This is especially true for anything political or even emotional. I don't know how many times I h
  • by Anonymous Coward on Wednesday October 17 2007, @08:56PM (#21019767)
    Which is why I should be allowed to accrue karma.
  • by Raul654 (453029) on Wednesday October 17 2007, @08:59PM (#21019803) Homepage
    The graph on page 31 is the retention rate of characters contributed by an editor relative to the total edits by that editor to the article ("The dependent variable is the retention rate, R, of contributions, measured as the percent of characters retained per contribution by each contributor.")

    This metric makes sense if the wiki is new, and most of the edits are adding new content. The metric is virtually meaningless if the wiki is established, and most of the edits by a group of people are vandalism or reverts - people fixing the article will have a lower score by virtue of the fact that they are making the same edit (more or less) over and over again.

    Normally, you'd expect that the more edits a user makes, the more trustworthy he is. If he were vandalizing, he wouldn't make more than a few before being blocked. If he's making hundreds, he should be considered more trust worthy (and have a higher retention rate) than if he's new. The results here show the exact opposite for anonymous users. In short, the methodology is flawed and the results are wrong.
      • Re: (Score:3, Interesting)

        "That said, I wonder if they took reversions into account in their analysis." - yes, they did, in a negative way. If you have an otherwise high trust metric and revert someone's edit back to the old version, then your retention rate as a percentage of characters divided by total edits goes down - in other words, you become less trust worthy after doing anti-vandal patrolling.
  • by SamP2 (1097897) on Wednesday October 17 2007, @09:10PM (#21019895)
    I fully admire the eagerness of individual contributors, anonymous or not, to improve Wikipedia.

    Unfortunately, not all edits which are good-intended actually contribute to the overall quality. Of course, edits which fix simple things like revert vandalism, fix a typo, update a number etc, are all good. But the rest pose a potential problem. First off, newcomers, while well-intented, simply do not know the way Wikipedia works. They may include unsourced or poorly sourced material, insert a POV without even realising it, piss off another editor by being careless (and thus start an edit war) etc.

    But even those edits which do not break any Wikipedia rules or guidelines still can cause damage, this time much more subtle. The thing is, a (good) Wikipedia article is not just a collection of facts, even if every single fact is relevant, neutral, sourced, and deserves to be in the article. An article is a unified piece of work. It should flow to the reader, not bump. Information must be properly organized and related to each other. A major suffering of Wikipedia is the so-called "contribution creep", where people just keep dumping more and more facts into the article. The result is grossly disproportional coverage of some sections compared to others, a huge overemphasis on bullet-point lists rather than coherent paragraphs, lots of small factoids which while each good on their own right, do not belong together, parts of articles being outdated compared to other parts, and a lot of other problems which make Wikipedia look like a search result by Google rather than a real encyclopedia.

    Early on, Wikipedia's first priority was to fill its databank with stuff, and all contributions (other than those breaking policy) were welcome. Recently, WP is at the stage of more stringent enforcement of policies, as well as guidelines and styleguides. And by all means, that is very important and should be the first priority. But it's not enough to be a good encyclopedia. Making sure everything is neutral, notable, verifiable, attributed, legal, and formatted according to style, is all sub-article tasks, which you apply to a particular sentence, paragraph, or image. But then you have to pause for a moment and look at an article at the big picture. Does it flow smoothly? Are all sections balanced? Are all parts equally updated? Would an average reader get a proportional representation from the article?

    You can easily handle the sub-article problems (those that break a clearcut policy or guideline) contributions from anonymous edits (as well as non-anonymous edits). But "Contribution creep" is biggest problem to the overall article, where there is no clearcut right or wrong. And that's why, no matter how important anonymous edits are to Wikipedia (and they certainly are), the already developed articles should be marked as "revised" and new contributions screened before updating them. Not because of potential vandalism or policy violations (those are easy to fix), but precisely to manage contribution creep and make sure well-intented contributions don't introduce speedbumps to an article and break its coherent organization and flow.
    • by Titoxd (1116095) on Wednesday October 17 2007, @09:44PM (#21020171) Homepage
      You have described the dilemma of the Wikipedia editor/administrator.

      There are edits that are obviously unhelpful; there are others that are clearly helpful. But there is a gray area of edits that falls in between, and for which editors' reactions vary a lot.

      A good example is an anonymous/new editor adding unsourced information to a carefully-sourced Featured article. You can't let the information just remain there, as editors have gone through that page, double-checked the citations and validity of the statements, and generally polished the article to have its prose crisp and clean. But you cannot just revert the edit wholesale, as the edit was not done in bad faith. While sometimes the edits can be fixed, there are many times that the edits are incorrigible, and need to be completely reworked or removed (such as introducing widespread, irrelevant rumors on the biography of a celebrity).

      So, at this time, some editors remove the text, with an explanation in the edit summary. Sometimes anonymous editors read the edit summaries, sometimes they don't. Often they wonder why their text got removed, justifiably so. Some users take that personally and begin accusing us of being "grammar Nazis", or even "suppressors of the truth" (I've heard that one before). But in a way, we're just trying to keep everything in order.

      ~~~~
  • This is true... (Score:4, Interesting)

    by FuzzyDaddy (584528) on Wednesday October 17 2007, @09:36PM (#21020125) Journal
    I've made only one edit ever on Wikipedia, in the section on heat pipes. I happened to notice a minor, but not insignificant, omission.

    I would imagine that most single edits are like that - someone with a good depth of knowledge on a subject, noticing something that's not quite right. The threshold for action is high enough that you'd only do it if it was worthwhile.

  • ease of logging in (Score:5, Interesting)

    by sh3l1 (981741) * on Wednesday October 17 2007, @09:45PM (#21020181) Homepage
    I often don't log in when i want to edit an article, because i don't care how my "rep" is, i just want to fix something.
    • I'm surprised someone actually got money to research this.

      Research is useful even if it's obvious. Previously we couldn't cite anyone if we wanted to say that anons who edit once or twice make good edits. Now, thanks to this research, we can. While it's true that these researchers could spend their time and money in better questions, for example examining P=NP, but this research is still useful, if not for everything else, at least for putting it in the references of some other wiki-related research. Now, if I want to write a paper on wikis, I can cite their