Mining Software Repositories with a Collaborative Heuristic Repository

被引:2
|
作者
Babii, Hlib [1 ]
Prenner, Julian Aron [1 ]
Stricker, Laurin [1 ]
Karmakar, Anjan [1 ]
Janes, Andrea [1 ]
Robbes, Romain [1 ]
机构
[1] Free Univ Bozen Bolzano, Bolzano, Italy
关键词
D O I
10.1109/ICSE-NIER52604.2021.00030
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collaborative heuristic repository, to which researchers can contribute a large amount of diverse heuristics for a variety of tasks on a variety of SE artifacts. These heuristics are then leveraged by state-of-the-art weak supervision techniques to train high-quality classifiers, thus improving the categorizations. We present an initial version of the heuristic repository, which we applied to the concrete task of commit classification.
引用
收藏
页码:106 / 110
页数:5
相关论文
共 50 条
  • [1] Mining software repositories
    [J]. 1600, Japan Society for Software Science and Technology (30):
  • [2] Mining Individual Performance Indicators in Collaborative Development Using Software Repositories
    Zhang, Shen
    Wang, Yongji
    Xiao, Junchao
    [J]. APSEC 2008:15TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS, 2008, : 247 - 254
  • [3] Mining Open Software Repositories
    Alonso Abad, Jesus
    Lopez Nozal, Carlos
    Maudes Raedo, Jesus M.
    [J]. ERCIM NEWS, 2014, (99): : 23 - 24
  • [4] Ethics in the mining of software repositories
    Nicolas E. Gold
    Jens Krinke
    [J]. Empirical Software Engineering, 2022, 27
  • [5] A Survey on Mining Software Repositories
    Jung, Woosung
    Lee, Eunjoo
    Wu, Chisu
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1384 - 1406
  • [6] Ethics in the mining of software repositories
    Gold, Nicolas E.
    Krinke, Jens
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (01)
  • [7] Tools in Mining Software Repositories
    Chaturvedi, K. K.
    Singh, V. B.
    Singh, Prashast
    [J]. PROCEEDINGS OF THE 2013 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ITS APPLICATIONS (ICCSA 2013), 2013, : 89 - 98
  • [8] Process mining software repositories
    Poncin, Wouter
    Serebrenik, Alexander
    van den Brand, Mark
    [J]. 2011 15TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2011, : 5 - 13
  • [9] HEURISTIC ORGANIZATION AND DOMAIN ANALYSIS OF SOFTWARE REPOSITORIES
    TOPTSIS, AA
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 1995, 5 (02) : 193 - 210
  • [10] Creating and Analyzing Source Code Repository Models A Model-based Approach to Mining Software Repositories
    Scheidgen, Markus
    Smidt, Martin
    Fischer, Joachim
    [J]. MODELSWARD: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON MODEL-DRIVEN ENGINEERING AND SOFTWARE DEVELOPMENT, 2017, : 329 - 336