Library adoption in public software repositories

被引:0
|
作者
Rachel Krohn
Tim Weninger
机构
[1] University of Notre Dame,Department of Computer Science and Engineering
来源
关键词
Information adoption; Software libraries; GitHub; Python; StackOverflow; Classification; SVM; Modelling; Git; Repository; Commit; Software development; Cognitive science; Text mining;
D O I
暂无
中图分类号
学科分类号
摘要
We study the the spread and adoption of libraries within Python projects hosted in public software repositories on GitHub. By modelling the use of Git pull, merge, commit, and other actions as deliberate cognitive activities, we are able to better understand the dynamics of what happens when users adopt new and cognitively demanding information. For this task we introduce a large corpus containing all commits, diffs, messages, and source code from 259,690 Python repositories (about 13% of all Python projects on Github), including all Git activity data from 89,311 contributing users. In this initial work we ask two primary questions: (1) What kind of behavior change occurs near an adoption event? (2) Can we model future adoption activity of a user? Using a fine-grained analysis of user behavior, we show that library adoptions are followed by higher than normal activity within the first 6 h, implying that a higher than normal cognitive effort is involved with an adoption. Further study is needed to understand the specific types of events that surround the adoption of new information, and the cause of these dynamics. We also show that a simple linear model is capable of classifying future commits as being an adoption or not, based on the commit contents and the preceding history of the user and repository. Additional work in this vein may be able to predict the content of future commits, or suggest new libraries to users.
引用
收藏
相关论文
共 50 条
  • [1] Library adoption in public software repositories
    Krohn, Rachel
    Weninger, Tim
    [J]. JOURNAL OF BIG DATA, 2019, 6 (01)
  • [2] Mining Software Repositories to Identify Library Experts
    Santos, Adriano
    Souza, Mauricio
    Oliveira, Johnatan
    Figueiredo, Eduardo
    [J]. XII BRAZILIAN SYMPOSIUM ON SOFTWARE COMPONENTS, ARCHITECTURES, AND REUSE (SBCARS), 2018, : 83 - 91
  • [3] The promise of public software engineering data repositories
    Cukic, B
    [J]. IEEE SOFTWARE, 2005, 22 (06) : 20 - 22
  • [4] Repositories with public data about software development
    Gonzalez-Barahona, Jesus M.
    Izquierdo-Cortazar, Daniel
    Squire, Megan
    [J]. International Journal of Open Source Software and Processes, 2010, 2 (02) : 1 - 13
  • [5] Adoption of Electronic Knowledge Repositories: Influencing Factors in the Indian Software Industry
    Chugh, Mitali
    Upadhayay, Rajesh Kumar
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024, 23 (04)
  • [6] Open source software adoption for development of institutional repositories in university libraries of Islamabad
    Khan, Muhammad Kabir
    Sheikh, Arslan
    [J]. INFORMATION DISCOVERY AND DELIVERY, 2023, 51 (01) : 47 - 55
  • [7] API trustworthiness: an ontological approach for software library adoption
    Eghan, Ellis E.
    Alqahtani, Sultan S.
    Forbes, Christopher
    Rilling, Juergen
    [J]. SOFTWARE QUALITY JOURNAL, 2019, 27 (03) : 969 - 1014
  • [8] API trustworthiness: an ontological approach for software library adoption
    Ellis E. Eghan
    Sultan S. Alqahtani
    Christopher Forbes
    Juergen Rilling
    [J]. Software Quality Journal, 2019, 27 : 969 - 1014
  • [9] Adoption of Public Library 2.0: Librarians' and teens' perspective
    Abidin, M. Ismail
    Kiran, K.
    Abrizah, A.
    [J]. MALAYSIAN JOURNAL OF LIBRARY & INFORMATION SCIENCE, 2013, 18 (03) : 75 - 90
  • [10] Mining software repositories
    [J]. 1600, Japan Society for Software Science and Technology (30):