GrimoireLab: A toolset for software development analytics

被引:16
|
作者
Duenas, Santiago [1 ]
Cosentino, Valerio [1 ]
Gonzalez-Barahona, Jesus M. [2 ]
Felix, Alvaro del Castillo San [1 ]
Izquierdo-Cortazar, Daniel [1 ]
Canas-Diaz, Luis [1 ]
Garcia-Plaza, Alberto Perez [1 ]
机构
[1] Bitergia, Madrid, Spain
[2] Univ Rey Juan Carlos, Escuela Super Ingn Telecomunicac, Madrid, Spain
关键词
Mining software repositories; Empirical software engineering; Software development; Software analytics; Datasets; Toolset; Software development visualization; REPOSITORY;
D O I
10.7717/peerj-cs.601
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background: After many years of research on software repositories, the knowledge for building mature, reusable tools that perform data retrieval, storage and basic analytics is readily available. However, there is still room to improvement in the area of reusable tools implementing this knowledge. Goal: To produce a reusable toolset supporting the most common tasks when retrieving, curating and visualizing data from software repositories, allowing for the easy reproduction of data sets ready for more complex analytics, and sparing the researcher or the analyst of most of the tasks that can be automated. Method: Use our experience in building tools in this domain to identify a collection of scenarios where a reusable toolset would be convenient, and the main components of such a toolset. Then build those components, and refine them incrementally using the feedback from their use in both commercial, community-based, and academic environments. Results: GrimoireLab, an efficient toolset composed of five main components, supporting about 30 different kinds of data sources related to software development. It has been tested in many environments, for performing different kinds of studies, and providing different kinds of services. It features a common API for accessing the retrieved data, facilities for relating items from different data sources, semi-structured storage for easing later analysis and reproduction, and basic facilities for visualization, preliminary analysis and drill-down in the data. It is also modular, making it easy to support new kinds of data sources and analysis. Conclusions: We present a mature toolset, widely tested in the field, that can help to improve the situation in the area of reusable tools for mining software repositories. We show some scenarios where it has already been used. We expect it will help to reduce the effort for doing studies or providing services in this area, leading to advances in reproducibility and comparison of results.
引用
收藏
页数:53
相关论文
共 50 条
  • [1] People Analytics in Software Development
    Singer, Leif
    Storey, Margaret-Anne
    Figueira Filho, Fernando
    Zagalsky, Alexey
    German, Daniel M.
    GRAND TIMELY TOPICS IN SOFTWARE ENGINEERING, 2017, 10223 : 124 - 153
  • [2] TOOLSET BREAKS NEW GROUND IN RISC SOFTWARE-DEVELOPMENT
    WILLIAMS, T
    COMPUTER DESIGN, 1990, 29 (09): : 18 - &
  • [3] A Toolset for Simulink Improving Software Engineering Practices in Development with Simulink
    Pantelic, Vera
    Postma, Steven
    Lawford, Mark
    Korobkine, Alexandre
    Mackenzie, Bennett
    Ong, Jeff
    Bender, Marc
    MODELSWARD 2015 PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2015, : 50 - 61
  • [4] Information Needs for Software Development Analytics
    Buse, Raymond P. L.
    Zimmermann, Thomas
    2012 34TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2012, : 987 - 996
  • [5] The Solid* toolset for software visual analytics of program structure and metrics comprehension: From research prototype to product
    Reniers, Dennie
    Voinea, Lucian
    Ersoy, Ozan
    Telea, Alexandru
    SCIENCE OF COMPUTER PROGRAMMING, 2014, 79 : 224 - 240
  • [6] A Functional Testing Toolset and Its Application to Development of Dependable Avionics Software
    Balashov, Vasily
    Baranov, Alexander
    Chistolinov, Maxim
    Gribov, Dmitry
    Smeliansky, Ruslan
    COMPLEX SYSTEMS AND DEPENDABILITY, 2012, 170 : 19 - 35
  • [7] Software Development Analytics for Xen: Why and How
    Izquierdo, Daniel
    Gonzalez-Barahona, Jesus M.
    Kurth, Lars
    Robles, Gregorio
    IEEE SOFTWARE, 2019, 36 (03) : 28 - 32
  • [8] Goldfish Bowl Panel: Software Development Analytics
    Menzies, Tim
    Zimmermann, Thomas
    2012 34TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2012, : 1032 - +
  • [9] Software Development Analytics: Experiences and the Way Forward
    Kaulgud, Vikrant
    Sharma, Vibhu Saujanya
    2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING WORKSHOP (ASEW), 2015, : 10 - 13
  • [10] A REUSABLE TOOLSET FOR SOFTWARE DEPENDENCY ANALYSIS
    WILDE, N
    HUITT, R
    JOURNAL OF SYSTEMS AND SOFTWARE, 1991, 14 (02) : 97 - 102