PyDriller: Python']Python Framework for Mining Software Repositories

被引:173
|
作者
Spadini, Davide [1 ]
Aniche, Maurcio [2 ]
Bacchelli, Alberto [3 ]
机构
[1] Delft Univ Technol, Software Improvement Grp, Delft, Netherlands
[2] Delft Univ Technol, Delft, Netherlands
[3] Univ Zurich, Zurich, Switzerland
基金
瑞士国家科学基金会; 欧盟地平线“2020”;
关键词
Mining Software Repositories; Git[!text type='Python']Python[!/text; Git; !text type='Python']Python[!/text;
D O I
10.1145/3236024.3264598
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software repositories contain historical and valuable information about the overall development of software systems. Mining software repositories (MSR) is nowadays considered one of the most interesting growing fields within software engineering. MSR focuses on extracting and analyzing data available in software repositories to uncover interesting, useful, and actionable information about the system. Even though MSR plays an important role in software engineering research, few tools have been created and made public to support developers in extracting information from Git repository. In this paper, we present PYDRILLER, a Python Framework that eases the process of mining Git. We compare our tool against the state-of-the-art Python Framework GitPython, demonstrating that PYDRILLER can achieve the same results with, on average, 50% less LOC and significantly lower complexity.
引用
收藏
页码:908 / 911
页数:4
相关论文
共 50 条
  • [1] Inspect4py: A Knowledge Extraction Framework for Python']Python Code Repositories
    Filgueira, Rosa
    Garijo, Daniel
    [J]. 2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 232 - 236
  • [2] How are decisions made in open source software communities? - Uncovering rationale from python']python email repositories
    Sharma, Pankajeshwara Nand
    Savarimuthu, Bastin Tony Roy
    Stanger, Nigel
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2024, 36 (04)
  • [3] A Software Metric for Python']Python Language
    Misra, Sanjay
    Cafer, Ferid
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2010, PT 2, PROCEEDINGS, 2010, 6017 : 301 - +
  • [4] PsychoPy - Psychophysics software in Python']Python
    Peirce, Jonathan W.
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2007, 162 (1-2) : 8 - 13
  • [5] A Performance Portability Framework for Python']Python
    Al Awar, Nader
    Zhu, Steven
    Biros, George
    Gligoric, Milos
    [J]. PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ICS 2021, 2021, : 467 - 478
  • [6] Orange: Data Mining Toolbox in Python']Python
    Demsar, Janez
    Curk, Tomaz
    Erjavec, Ales
    Gorup, Crt
    Hocevar, Tomaz
    Milutinovic, Mitar
    Mozina, Martin
    Polajnar, Matija
    Toplak, Marko
    Staric, Anze
    Stajdohar, Miha
    Umek, Lan
    Zagar, Lan
    Zbontar, Jure
    Zitnik, Marinka
    Zupan, Blaz
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2013, 14 : 2349 - 2353
  • [7] BuckinghamPy: A Python']Python software for dimensional analysis
    Karam, Mokbel
    Saad, Tony
    [J]. SOFTWAREX, 2021, 16
  • [8] odgf-python']python - A Python']Python Interface for the Open Graph Drawing Framework
    Fink, Simon D.
    Strobl, Andreas
    [J]. GRAPH DRAWING AND NETWORK VISUALIZATION, GD 2023, PT II, 2023, 14466 : 258 - 260
  • [9] PSTM: Python']Python Software Transactional Memory
    Popovic, Miroslav
    Kordic, Branislav
    [J]. 2014 22ND TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2014, : 1106 - 1109
  • [10] Pyteomics-a Python']Python Framework for Exploratory Data Analysis and Rapid Software Prototyping in Proteomics
    Goloborodko, Anton A.
    Levitsky, Lev I.
    Ivanov, Mark V.
    Gorshkov, Mikhail V.
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, 2013, 24 (02) : 301 - 304