A multivariate classification of open source developers

被引:36
|
作者
di Bella, Enrico [1 ]
Sillitti, Alberto [2 ]
Succi, Giancarlo [2 ]
机构
[1] Univ Genoa, Fac Econ, Genoa, Italy
[2] Libera Univ Bolzano, Fac Comp Sci, I-39100 Bolzano, Italy
关键词
Open source; Development process; Empirical studies; Software metrics;
D O I
10.1016/j.ins.2012.09.031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Open source software development is becoming always more relevant. Understanding the behavior of developers in open source software projects and identifying the kinds of their contributions is an essential step to improve the efficiency of the development process and to organize the development teams more effectively. Moreover, understanding the level of participation of the different developers helps to understand which members of the development team are more important than others and who are the actual key developers. This paper investigates the behavior of open source developers and the structure of the development of open source projects through the analysis of a very large dataset: 10 well-known and widely used open source software projects for a total of more than 4 MLOC (millions of lines of code) modified distributed in more than 200 K versions. This study builds on the top of other studies in this area applying a set of rigorous statistical techniques, analyzing how developers contribute to the projects. Its novelty is in the fine gain analysis of the developers that have commit rights on the repository of the project they work on, in the automated identification of key contributors of the project, in the size of the analyzed datasets, and in the statistical techniques used to classify the behavior of the developers in an automated way. To collect such large volume of data and to ensure their integrity, a tool to automatically mine open source version control systems has been used. The main result of this study is the identification of a recurrent pattern of four kinds of contributors with the same characteristics in all the projects analyzed even if the projects are very different in domain, size, language, etc. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:72 / 83
页数:12
相关论文
共 50 条
  • [21] What do developers talk about open source software licensing?
    Kapitsaki, Georgia M.
    Papoutsoglou, Maria
    German, Daniel M.
    Angelis, Lefteris
    [J]. 2020 46TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2020), 2020, : 72 - 79
  • [22] The characteristics and motivations of library open source software developers: An empirical study
    Choi, Namjoo
    Pruett, Joseph A.
    [J]. LIBRARY & INFORMATION SCIENCE RESEARCH, 2015, 37 (02) : 109 - 117
  • [23] CVExplorer: Identifying Candidate Developers by Mining and Exploring Their Open Source Contributions
    Greene, Gillian J.
    Fischer, Bernd
    [J]. 2016 31ST IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2016, : 804 - 809
  • [24] Trust and involvement of Cameroonian software developers in open-source projects
    University of Douala, Cameroon to University of Douala, BP, Douala
    2701, Cameroon
    不详
    78180, France
    [J]. Data Sci. Manag., 2024, 4 (332-339): : 332 - 339
  • [25] How Peripheral Developers Contribute to Open-Source Software Development
    Setia, Pankaj
    Rajagopalan, Balaji
    Sambamurthy, Vallabh
    Calantone, Roger
    [J]. INFORMATION SYSTEMS RESEARCH, 2012, 23 (01) : 144 - 163
  • [26] An empirical analysis of open source software developers' motivations and continuance intentions
    Wu, Chorng-Guang
    Gerlach, James H.
    Young, Clifford E.
    [J]. INFORMATION & MANAGEMENT, 2007, 44 (03) : 253 - 262
  • [27] Why Do Developers Reject Refactorings in Open-Source Projects?
    Pantiuchina, Jevgenija
    Lin, Bin
    Zampetti, Fiorella
    Di Penta, Massimiliano
    Lanza, Michele
    Bavota, Gabriele
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2022, 31 (02)
  • [28] An analysis of the evolution of developers' role in open-source software community
    Wang, Wenjuan
    Li, Bing
    He, Peng
    [J]. Complex Systems and Complexity Science, 2015, 12 (01) : 1 - 7
  • [29] Identification and Mitigation of Toxic Communications Among Open Source Software Developers
    Sarker, Jaydeb
    [J]. PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022, 2022,
  • [30] An empirical study of the first contributions of developers to open source projects on GitHub
    Subramanian, Vikram N.
    [J]. 2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2020), 2020, : 116 - 118