Crowd-sourced Text Analysis: Reproducible and Agile Production of Political Data

被引:120
|
作者
Benoit, Kenneth [1 ,2 ]
Conway, Drew [3 ]
Lauderdale, Benjamin E. [4 ]
Laver, Michael [3 ]
Mikhaylov, Slava [5 ]
机构
[1] London Sch Econ, London, England
[2] Trinity Coll Dublin, Dublin, Ireland
[3] NYU, New York, NY 10003 USA
[4] London Sch Econ & Polit Sci, London, England
[5] UCL, London WC1E 6BT, England
基金
欧洲研究理事会;
关键词
PARTY; RELIABILITY;
D O I
10.1017/S0003055416000058
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
Empirical social science often relies on data that are not observed in the field, but are transformed into quantitative variables by expert researchers who analyze and interpret qualitative raw sources. While generally considered the most valid way to produce data, this expert-driven process is inherently difficult to replicate or to assess on grounds of reliability. Using crowd-sourcing to distribute text for reading and interpretation by massive numbers of nonexperts, we generate results comparable to those using experts to read and interpret the same texts, but do so far more quickly and flexibly. Crucially, the data we collect can be reproduced and extended transparently, making crowd-sourced datasets intrinsically reproducible. This focuses researchers' attention on the fundamental scientific objective of specifying reliable and replicable methods for collecting the data needed, rather than on the content of any particular dataset. We also show that our approach works straightforwardly with different types of political text, written in different languages. While findings reported here concern text analysis, they have far-reaching implications for expert-generated data in the social sciences.
引用
收藏
页码:278 / 295
页数:18
相关论文
共 50 条
  • [21] Transportation hazard spatial analysis using crowd-sourced social network data
    Ghandour, Ali J.
    Hammoud, Huda
    Telesca, Luciano
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 520 : 309 - 316
  • [22] The political is personal: an analysis of crowd-sourced political ideas and images from a Massive Open Online Course
    Humphrey, Mathew
    Umbach, Maiken
    Clulow, Zeynep
    JOURNAL OF POLITICAL IDEOLOGIES, 2019, 24 (02) : 121 - 138
  • [23] Processing of Crowd-sourced Data from an Internet of Floating Things
    Montella, Raffaele
    Di Luccio, Diana
    Marcellino, Livia
    Galletti, Ardelio
    Kosta, Sokol
    Brizius, Alison
    Foster, Ian
    PROCEEDINGS OF WORKS 2017: 12TH WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE, 2017,
  • [24] A Crowd-Sourced Data Based Analytical Framework for Urban Planning
    Li Dong
    Long Ying
    China City Planning Review, 2015, 24 (01) : 49 - 57
  • [25] Road Grade Estimation Using Crowd-Sourced Smartphone Data
    Gupta, Abhishek
    Hu, Shaohan
    Zhong, Weida
    Sadek, Adel
    Su, Lu
    Qiao, Chunming
    2020 19TH ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS (IPSN 2020), 2020, : 313 - 324
  • [26] Robust CNNs for detecting collapsed buildings with crowd-sourced data
    Gibson, Matthew J.
    Kaushik, Dhruv
    Sowmya, Arcot
    2019 JOINT URBAN REMOTE SENSING EVENT (JURSE), 2019,
  • [27] Route Recommendations to Business Travelers Exploiting Crowd-Sourced Data
    Collerton, Thomas
    Marrella, Andrea
    Mecella, Massimo
    Catarci, Tiziana
    MOBILE WEB AND INTELLIGENT INFORMATION SYSTEMS, MOBIWIS 2017, 2017, 10486 : 3 - 17
  • [28] Detecting Label Errors in Crowd-Sourced Smartphone Sensor Data
    Bo, Xiao
    Poellabauer, Christian
    O'Brien, Megan K.
    Mummidisetty, Chaithanya Krishna
    Jayaraman, Arun
    3RD INTERNATIONAL WORKSHOP ON SOCIAL SENSING (SOCIALSENS 2018), 2018, : 20 - 25
  • [29] Mining Urban Traffic Condition from Crowd-Sourced Data
    Mai-Tan H.
    Pham-Nguyen H.-N.
    Long N.X.
    Minh Q.T.
    SN Computer Science, 2020, 1 (4)
  • [30] Collecting Weighted Coercions from Crowd-Sourced Lexical Data for Compositional Semantic Analysis
    Lafourcade, Mathieu
    Mery, Bruno
    Mirzapour, Mehdi
    Moot, Richard
    Retore, Christian
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE (JSAI-ISAI 2017), 2018, 10838 : 214 - 230