Continuously mining distributed version control systems: an empirical study of how Linux uses Git

被引:21
|
作者
German, Daniel M. [1 ]
Adams, Bram [2 ]
Hassan, Ahmed E. [3 ]
机构
[1] Univ Victoria, Victoria, BC, Canada
[2] Polytech Montreal, Montreal, PQ, Canada
[3] Queens Univ, Kingston, ON, Canada
关键词
Mining software repositories; Distributed version control; Rebasing; Empirical software engineering; Measuring bias; Linux; Open source development;
D O I
10.1007/s10664-014-9356-2
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Distributed version control systems (D-VCSs - such as git and mercurial) and their hosting services (such as Github and Bitbucket) have revolutionalized the way in which developers collaborate by allowing them to freely exchange and integrate code changes in a peer-to-peer fashion. However, this flexibility comes at a price: code changes are hard to track because of the proliferation of code repositories and because developers modify ("rebase") and filter ("cherry-pick") the history of these changes to streamline their integration into the repositories of other developers. As a consequence, researchers and practitioners, who typically only consider the (cleaned up) history in the official project repository, are unaware of important elements and activities in the collaborative software development process. In this paper, we present a method that continuously mines all known D-VCSs of a software project to uncover the complete development history of a project. We use this method to (1) show the divergence between the code development history in the official Linux kernel repository and the complete kernel development history, and (2) to investigate the characteristics of the ecosystem of git repositories of the Linux kernel. Finally, we discuss how continuous mining could be adopted by current D-VCS hosting services.
引用
收藏
页码:260 / 299
页数:40
相关论文
共 50 条
  • [31] Empirical Study of PLC Authentication Protocols in Industrial Control Systems
    Ayub, Adeen
    Yoo, Hyunguk
    Ahmed, Irfan
    2021 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2021), 2021, : 383 - 397
  • [32] An empirical study of control parameters for the third version of Generalized Differential Evolution (GDE3)
    Kukkonen, Saku
    Lampinen, Jouni
    2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 1987 - +
  • [33] An Empirical Study on Data Retrievability in Decentralized Erasure Code Based Distributed Storage Systems
    Lin, Hsiao-Ying
    Tung, Li-Ping
    Lin, Bao-Shuh P.
    2013 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE SECURITY AND RELIABILITY (SERE), 2013, : 30 - 39
  • [34] HOW TO COMPARE DISTRIBUTED CONTROL-SYSTEMS USING COMPUTER-BASED ANALYSIS
    REMEDIOS, AM
    PULP & PAPER-CANADA, 1987, 88 (10) : 17 - 21
  • [35] Control of Distributed Autonomous Systems - How to Get a Swarm to Succeed over an Overwhelming Challenge
    von Stackelberg, Josef
    AUTONOMOUS SYSTEMS: DEVELOPMENTS AND TRENDS, 2011, 391 : 41 - 48
  • [36] How technological constraints could influence competitiveness? An empirical study of avionics systems
    Beaugency, Aurelie
    Gatti, Marc
    Regis, Didier
    2014 IEEE/AIAA 33RD DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2014,
  • [37] How integrated are environmental, quality and other standardized management systems? An empirical study
    Bernardo, Merce
    Casadesus, Marti
    Karapetrovic, Stanislav
    Heras, Inaki
    JOURNAL OF CLEANER PRODUCTION, 2009, 17 (08) : 742 - 750
  • [38] DISTRIBUTED CONTROL FOR SWITCHING SYSTEMS - A SYSTEMS STUDY TO IMPROVE COST AND MAINTAINABILITY OF SWITCHING SYSTEMS.
    Gueldenpfenning, Klaus
    1975, : 6 - 12
  • [39] HOW TECHNOLOGICAL CONSTRAINTS COULD INFLUENCE COMPETITIVENESS? AN EMPIRICAL STUDY OF AVIONICS SYSTEMS
    Beaugency, Aurelie
    Gatti, Marc
    Regis, Didier
    2014 IEEE/AIAA 33RD DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2014,
  • [40] How integrated are multiple Management Systems? An empirical study of construction firms in China
    Cai Gai
    Fu Chun
    Zhou Qing-Zhu
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON SUSTAINABLE CONSTRUCTION & RISK MANAGEMENT, VOLS I AND II, 2010, : 1096 - 1100