Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia

被引:0
|
作者
Dalip, Daniel Hasan [1 ]
Goncalves, Marcos Andre [1 ]
Cristo, Marco
Calado, Pavel
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
关键词
Quality Assessment; Wikipedia; Machine Learning; SVM;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The old dream of a universal repository containing all the human knowledge and culture is becoming possible through the Internet and the Web. Moreover, this is happening with the direct collaborative, participation of people., Wikipedia is a great example. It is an enormous repository of information with free access and edition, created by the community in a collaborative manner. However, this large amount of information, made available democratically and virtually without any control, raises questions about its relative quality. In this work we explore a significant number of quality indicators, some of them proposed by us and used here for the first, time, and study I,heir capability to assess the quality of Wikipedia articles. Furthermore, we explore machine learning techniques to combine these quality indicators into one single assessment judgment. Through experiments, we show that the most important quality indicators are the easiest ones to extract, namely, textual features related to length, structure and style. We were also able to determine which indicators did not contribute significantly to the quality assessment. These were, coincidentally, the most complex features, such as those based on link analysis. Finally, we compare our combination method with state-of-the-art solution and show significant improvements in terms of effective quality prediction.
引用
收藏
页码:295 / 304
页数:10
相关论文
共 50 条
  • [41] Fast Delivery of 3D Web Content: A Case Study
    Limper, Max
    Wagner, Stefan
    Stein, Christian
    Jung, Yvonne
    Stork, Andre
    [J]. WEB3D 2013: 18TH INTERNATIONAL CONFERENCE ON 3D WEB TECHNOLOGY, 2013, : 11 - 17
  • [42] Screen Content Video Quality Assessment: Subjective and Objective Study
    Cheng, Shan
    Zeng, Huanqiang
    Chen, Jing
    Hou, Junhui
    Zhu, Jianqing
    Ma, Kai-Kuang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (8636-8651) : 8636 - 8651
  • [43] Interaction in web-based communities: A case study of Last.fm
    Mechant, Peter
    [J]. International Journal of Web Based Communities, 2011, 7 (02) : 234 - 249
  • [44] Use of created wetlands to improve water quality in the Midwest - Lake Bloomington case study
    Kovacic, David A.
    Twait, Richard M.
    Wallace, Michael P.
    Bowling, Juliane M.
    [J]. ECOLOGICAL ENGINEERING, 2006, 28 (03) : 258 - 270
  • [45] PRELIMINARY CASE STUDY ON THE ENVIRONMENTAL QUALITY AND LIFE QUALITY IN THE ROMANIAN RURAL RUDIMENTARY COMMUNITIES
    Manea, Gabriela
    Matei, Elena
    Tiscovschi, Adrian
    [J]. EUROPEAN COUNTRYSIDE, 2009, 1 (04): : 227 - 240
  • [46] A Simple Yet Robust Algorithm for Automatic Extraction of Parallel Sentences: A Case Study on Arabic-English Wikipedia Articles
    Althobaiti, Maha Jarallah
    [J]. IEEE ACCESS, 2022, 10 : 401 - 420
  • [47] Critical Review of Technology-Enhanced Learning using Automatic Content Analysis Case Study of TEL Maturity Assessment Formulation
    Rahmah, Amalia
    Santoso, Harry B.
    Hasibuan, Zainal A.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (01) : 385 - 394
  • [48] Content quality in location-based services: A case study
    Katasonov, A
    Sakkinen, M
    [J]. INTERNATIONAL CONFERENCE ON PERVASIVE SERVICES 2005, PROCEEDINGS, 2005, : 461 - 464
  • [49] Automatic Website summarization by image content: A case study with logo and trademark images
    Baratis, Evdoxios
    Petrakis, Euripides G. M.
    Milios, Evangelos
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (09) : 1195 - 1204
  • [50] PLANNING FOR REMOTE COMMUNITIES - A CASE-STUDY OF HOUSING NEED ASSESSMENT
    WISEMAN, N
    [J]. CANADIAN PUBLIC POLICY-ANALYSE DE POLITIQUES, 1982, 8 (02): : 239 - 247