Understanding Editing Behaviors in Multilingual Wikipedia

被引:9
|
作者
Kim, Suin [1 ]
Park, Sungjoon [1 ]
Hale, Scott A. [2 ]
Kim, Sooyoung [1 ]
Byun, Jeongmin [1 ]
Oh, Alice H. [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
[2] Univ Oxford, Oxford Internet Inst, Oxford, England
来源
PLOS ONE | 2016年 / 11卷 / 05期
基金
英国经济与社会研究理事会;
关键词
NETWORK;
D O I
10.1371/journal.pone.0155305
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multilingualism is common offline, but we have a more limited understanding of the ways multilingualism is displayed online and the roles that multilinguals play in the spread of content between speakers of different languages. We take a computational approach to studying multilingualism using one of the largest user-generated content platforms, Wikipedia. We study multilingualism by collecting and analyzing a large dataset of the content written by multilingual editors of the English, German, and Spanish editions of Wikipedia. This dataset contains over two million paragraphs edited by over 15,000 multilingual users from July 8 to August 9, 2013. We analyze these multilingual editors in terms of their engagement, interests, and language proficiency in their primary and non-primary (secondary) languages and find that the English edition of Wikipedia displays different dynamics from the Spanish and German editions. Users primarily editing the Spanish and German editions make more complex edits than users who edit these editions as a second language. In contrast, users editing the English edition as a second language make edits that are just as complex as the edits by users who primarily edit the English edition. In this way, English serves a special role bringing together content written by multilinguals from many language editions. Nonetheless, language remains a formidable hurdle to the spread of content: we find evidence for a complexity barrier whereby editors are less likely to edit complex content in a second language. In addition, we find that multilinguals are less engaged and show lower levels of language proficiency in their second languages. We also examine the topical interests of multilingual editors and find that there is no significant difference between primary and non-primary editors in each language.
引用
收藏
页数:22
相关论文
共 50 条
  • [11] MultiWiBi: The multilingual Wikipedia bitaxonomy project
    Flati, Tiziano
    Vannella, Daniele
    Pasini, Tommaso
    Navigli, Roberto
    [J]. ARTIFICIAL INTELLIGENCE, 2016, 241 : 66 - 102
  • [12] Multilingual schema matching for Wikipedia infoboxes
    Nguyen, Thanh
    Moreira, Viviane
    Nguyen, Huong
    Nguyen, Hoa
    Freire, Juliana
    [J]. International Journal of Computer Science Issues, 2012, 9 (03): : 133 - 144
  • [13] INFORMATION OVERLAP IN MULTILINGUAL WIKIPEDIA AND SUMMARIZATION
    Filatova, Elena
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2012, 21 (04) : 383 - 403
  • [14] Wikipedia as Multilingual Source of Comparable Corpora
    Gamallo Otero, Pablo
    Gonzalez Lopez, Isaac
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 21 - 25
  • [15] Multilingual Schema Matching for Wikipedia Infoboxes
    Thanh Nguyen
    Moreira, Viviane
    Huong Nguyen
    Hoa Nguyen
    Freire, Juliana
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 5 (02): : 133 - 144
  • [16] Bipartite Editing Prediction in Wikipedia
    Chang, Yang-Jui
    Tsai, Yu-Chuan
    Kao, Hung-Yu
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2014, 30 (03) : 587 - 603
  • [17] Wikipedia editing history in DBpedia
    Gandon, Fabien
    Boyer, Raphael
    Corby, Olivier
    Monnin, Alexandre
    [J]. 2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 479 - 482
  • [18] EDITING IN A MULTILINGUAL ENVIRONMENT
    MARX, JM
    [J]. JOURNAL OF RESEARCH COMMUNICATION STUDIES, 1982, 3 (04): : 405 - 409
  • [19] Modeling Popularity and Reliability of Sources in Multilingual Wikipedia
    Lewoniewski, Wlodzimierz
    Wecel, Krzysztof
    Abramowicz, Witold
    [J]. INFORMATION, 2020, 11 (05)
  • [20] Effectively Mining Wikipedia for Clustering Multilingual Documents
    Kumar, N. Kiran
    Santosh, G. S. K.
    Varma, Vasudeva
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2011, 6716 : 254 - 257