July 2011
Sun Mon Tue Wed Thu Fri Sat
          1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31            

Authors' Committee

Chair:

Matt Blackwell (Gov)

Members:

Martin Andersen (HealthPol)
Kevin Bartz (Stats)
Deirdre Bloome (Social Policy)
John Graves (HealthPol)
Rich Nielsen (Gov)
Maya Sen (Gov)
Gary King (Gov)

Weekly Research Workshop Sponsors

Alberto Abadie, Lee Fleming, Adam Glynn, Guido Imbens, Gary King, Arthur Spirling, Jamie Robins, Don Rubin, Chris Winship

Weekly Workshop Schedule

Recent Comments

Recent Entries

Categories

Blogroll

SMR Blog
Brad DeLong
Cognitive Daily
Complexity & Social Networks
Developing Intelligence
EconLog
The Education Wonks
Empirical Legal Studies
Free Exchange
Freakonomics
Health Care Economist
Junk Charts
Language Log
Law & Econ Prof Blog
Machine Learning (Theory)
Marginal Revolution
Mixing Memory
Mystery Pollster
New Economist
Political Arithmetik
Political Science Methods
Pure Pedantry
Science & Law Blog
Simon Jackman
Social Science++
Statistical modeling, causal inference, and social science

Archives

Notification

Powered by
Movable Type 4.24-en


« June 2011 | Main | September 2011 »

19 July 2011

Detecting (edit) wars

A fun modeling project from a group of physicists on Edit wars in Wikipedia:

We present a new, efficient method for automatically detecting severe conflicts `edit wars’ in Wikipedia and evaluate this method on six different language WPs. We discuss how the number of edits, reverts, the length of discussions, the burstiness of edits and reverts deviate in such pages from those following the general workflow, and argue that earlier work has significantly over-estimated the contentiousness of the Wikipedia editing process.

Burstiness is new to me and appears to be popular in studying communication networks. Bursty (?) processes have many occurrences in short interval and long gaps between these bursts. Where is the burstiness in the social sciences? Would it measure anything interesting?

Also, maybe I don’t look much at the acknowledgements from other disciplines, but I have never seen such a clear delineation of work contributed:

Sumi developed the main classifier, Yasseri the temporal profiles, and Rung selected the seed examples and generated the data for supervised training. Kornai advised on multilingual natural language processing and statistical analysis. Kertesz designed the research, advised on temporal studies and on modeling.

Posted by Matt Blackwell at 9:40 PM