Library adoption in public software repositories

被引:0
|
作者
Rachel Krohn
Tim Weninger
机构
[1] University of Notre Dame,Department of Computer Science and Engineering
来源
关键词
Information adoption; Software libraries; GitHub; Python; StackOverflow; Classification; SVM; Modelling; Git; Repository; Commit; Software development; Cognitive science; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
We study the the spread and adoption of libraries within Python projects hosted in public software repositories on GitHub. By modelling the use of Git pull, merge, commit, and other actions as deliberate cognitive activities, we are able to better understand the dynamics of what happens when users adopt new and cognitively demanding information. For this task we introduce a large corpus containing all commits, diffs, messages, and source code from 259,690 Python repositories (about 13% of all Python projects on Github), including all Git activity data from 89,311 contributing users. In this initial work we ask two primary questions: (1) What kind of behavior change occurs near an adoption event? (2) Can we model future adoption activity of a user? Using a fine-grained analysis of user behavior, we show that library adoptions are followed by higher than normal activity within the first 6 h, implying that a higher than normal cognitive effort is involved with an adoption. Further study is needed to understand the specific types of events that surround the adoption of new information, and the cause of these dynamics. We also show that a simple linear model is capable of classifying future commits as being an adoption or not, based on the commit contents and the preceding history of the user and repository. Additional work in this vein may be able to predict the content of future commits, or suggest new libraries to users.
引用
收藏
相关论文
共 50 条
  • [31] Innovation and the library: The adoption of new ideas in public libraries - Pungitore,VL
    Schroeder, AT
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1996, 47 (06): : 479 - 479
  • [32] Innovation and the library: The adoption of new ideas in public libraries - Pungitore,VL
    Salter, JL
    [J]. RQ, 1996, 35 (03): : 440 - 441
  • [33] Towards Library 2.0: The Adoption of Web 2.0 Technologies in Public Libraries
    Anttiroiko, Ari-Veikko
    Savolainen, Reijo
    [J]. LIBRI, 2011, 61 (02): : 87 - 99
  • [34] Innovation and the library: The adoption of new ideas in public libraries - Pungitore,VL
    Altman, E
    [J]. LIBRARY QUARTERLY, 1996, 66 (03): : 321 - 322
  • [35] A Generalized Model for Visualizing Library Popularity, Adoption, and Diffusion within a Software Ecosystem
    Kula, Raula Gaikovina
    De Roover, Coen
    German, Daniel M.
    Ishio, Takashi
    Inoue, Katsuro
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), 2018, : 288 - 299
  • [36] University repositories: An extension of the library cooperative
    Anon
    [J]. OCLC Newsletter, 2003, (261):
  • [37] Software for digital repositories: evaluation for the management and preservation of bibliographic materials and archival documents of the National Library of Brazil
    da Silva, Wellington
    Antonio Merlo-Vega, Jose
    [J]. REVISTA IBERO-AMERICANA DE CIENCIA DA INFORMACAO, 2020, 13 (01): : 301 - 315
  • [38] Software Process Simulation based on Mining Software Repositories
    Honsel, Verena
    Honsel, Daniel
    Grabowski, Jens
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 828 - 831
  • [39] Qualitative Clustering of Software Repositories Based on Software Metrics
    Bugayenko, Yegor
    Daniakin, Kirill
    Farina, Mirko
    Kholmatova, Zamira
    Kruglov, Artem
    Pedrycz, Witold
    Succi, Giancarlo
    [J]. IEEE ACCESS, 2023, 11 : 14716 - 14727
  • [40] Comparison of software repositories for their usability in software process reconstruction
    Jankovic, Marko
    Bajec, Marko
    [J]. 2015 IEEE 9TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2015, : 298 - 308