Mining the History Sections of Wikipedia Articles on Science and Technology

被引:0
|
作者
Kircheis, Wolfgang [1 ,2 ]
Schmidt, Marion [3 ]
Simons, Arno [4 ]
Stein, Benno [5 ]
Potthast, Martin [1 ,2 ]
机构
[1] Univ Leipzig, Leipzig, Germany
[2] ScaDS AI, Leipzig, Germany
[3] DZHW, Hannover, Germany
[4] Tech Univ Berlin, Berlin, Germany
[5] Bauhaus Univ Weimar, Weimar, Germany
关键词
Wikipedia; Science Studies; Priority Disputes; Science; Technology; Science and Technology; Innovation;
D O I
10.1109/JCDL57899.2023.00037
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Priority conflicts and the attribution of contributions to important scientific breakthroughs to individuals and groups play an important role in science, its governance, and evaluation. Debates and dynamics around these processes are analyzed by science studies. Our objective is to transform Wikipedia into an accessible, traceable primary source for analyzing such debates. In this paper, we introduce Webis-WikiSciTech-23, a new corpus consisting of science and technology Wikipedia articles, focusing on the identification of their history sections. We extract such articles from Wikipedia dumps through iterative filtering of the category network. The identification of passages covering the historical development of innovations is achieved by combining heuristics for section heading analysis and classifiers trained on a ground truth of articles with designated history sections.
引用
收藏
页码:200 / 204
页数:5
相关论文
共 50 条
  • [1] SEMI-AUTOMATIC GENERATION OF A CORPUS OF WIKIPEDIA ARTICLES ON SCIENCE AND TECHNOLOGY
    Minguillon, Julia
    Lerga, Maura
    Aibar, Eduard
    Llados-Masllorens, Josep
    Meseguer-Artola, Antoni
    PROFESIONAL DE LA INFORMACION, 2017, 26 (05): : 995 - 1004
  • [2] Generating Quizzes for History Learning Based on Wikipedia Articles
    Tamura, Yoshihiro
    Takase, Yutaka
    Hayashi, Yuki
    Nakano, Yukiko I.
    LEARNING AND COLLABORATION TECHNOLOGIES, LCT 2015, 2015, 9192 : 337 - 346
  • [3] Articles on the history of Medicine and Science
    Fischer, Klaus-Dietrich
    NTM, 2009, 17 (03): : 353 - 354
  • [5] Quality Evaluation of Wikipedia Articles through Edit History and Editor Groups
    Wang, Se
    Iwaihara, Mizuho
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 188 - 199
  • [6] Improving Wikipedia's Credibility: References and Citations in a Sample of History Articles
    Luyt, Brendan
    Tan, Daniel
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (04): : 715 - 722
  • [7] Improving wikipedia's credibility: References and citations in a sample of history articles
    Wee Kim Wee School of Communication and Information, Nanyang Technological University, Singapore
    J. Am. Soc. Inf. Sci. Technol., 4 (715-722):
  • [9] INFORMATION UNIQUENESS IN WIKIPEDIA ARTICLES
    Kirtsis, Nikos
    Stamou, Sofia
    Tzekou, Paraskevi
    Zotos, Nikos
    WEBIST 2010: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGY, VOL 2, 2010, : 137 - 143
  • [10] Societal Controversies in Wikipedia Articles
    Borra, Erik
    Kaltenbrunner, Andreas
    Mauri, Michele
    Weltevrede, Esther
    Laniado, David
    Rogers, Richard
    Ciuccarelli, Paolo
    Magni, Giovanni
    Venturini, Tommaso
    CHI 2015: PROCEEDINGS OF THE 33RD ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2015, : 193 - 196