Automatic Traceability Maintenance via Machine Learning Classification

被引:33
|
作者
Mills, Chris [1 ]
Escobar-Avila, Javier [1 ]
Haiduc, Sonia [1 ]
机构
[1] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
关键词
REQUIREMENTS TRACEABILITY; RECOVERY; LINKS; CODE; IR;
D O I
10.1109/ICSME.2018.00045
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Previous studies have shown that software traceability, the ability to link together related artifacts from different sources within a project (e.g., source code, use cases, documentation, etc.), improves project outcomes by assisting developers and other stakeholders with common tasks such as impact analysis, concept location, etc. Establishing traceability links in a software system is an important and costly task, but only half the struggle. As the project undergoes maintenance and evolution, new artifacts are added and existing ones are changed, resulting in outdated traceability information. Therefore, specific steps need to be taken to make sure that traceability links are maintained in tandem with the rest of the project. In this paper we address this problem and propose a novel approach called TRAIL for maintaining traceability information in a system. The novelty of TRAIL stands in the fact that it leverages previously captured knowledge about project traceability to train a machine learning classifier which can then be used to derive new traceability links and update existing ones. We evaluated TRAIL on 11 commonly used traceability datasets from six software systems and compared it to seven popular Information Retrieval (IR) techniques including the most common approaches used in previous work. The results indicate that TRAIL outperforms all IR approaches in terms of precision, recall, and F-score.
引用
收藏
页码:369 / 380
页数:12
相关论文
共 50 条
  • [1] Automatic traceability link recovery via active learning
    Du, Tian-bao
    Shen, Guo-hua
    Huang, Zhi-qiu
    Yu, Yao-shen
    Wu, De-xiang
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2020, 21 (08) : 1217 - 1225
  • [2] Automatic traceability link recovery via active learning
    Tian-bao Du
    Guo-hua Shen
    Zhi-qiu Huang
    Yao-shen Yu
    De-xiang Wu
    [J]. Frontiers of Information Technology & Electronic Engineering, 2020, 21 : 1217 - 1225
  • [3] A Machine Learning based Traceability Links Classification: A Preliminary Investigation
    Workneh, Hika
    Reddivari, Sandeep
    [J]. 2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 989 - 990
  • [4] Automatic classification of magnetocardiograms with the machine learning approach
    Fenici, R
    Brisinda, D
    Meloni, AM
    Fenici, P
    [J]. EUROPEAN HEART JOURNAL, 2004, 25 : 560 - 560
  • [5] Automatic flow classification using machine learning
    Anantavrasilp, Isara
    Schoeler, Thorsten
    [J]. SOFTCOM 2007: 15TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS, 2007, : 390 - +
  • [6] Automatic Vulnerability Classification Using Machine Learning
    Gawron, Marian
    Cheng, Feng
    Meinel, Christoph
    [J]. RISKS AND SECURITY OF INTERNET AND SYSTEMS, CRISIS 2017, 2018, 10694 : 3 - 17
  • [7] Automatic Classification for Vulnerability Based on Machine Learning
    Shuai, Bo
    Li, Haifeng
    Li, Mengjun
    Zhang, Quan
    Tang, Chaojing
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2013, : 312 - 318
  • [8] Towards the Automatic Classification of Traceability Links
    Mills, Chris
    [J]. PROCEEDINGS OF THE 2017 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE'17), 2017, : 1018 - 1021
  • [9] Motor Classification with Machine Learning Methods for Predictive Maintenance
    Kammerer, Christoph
    Gaust, Michael
    Kuestner, Micha
    Starke, Pascal
    Radtke, Roman
    Jesser, Alexander
    [J]. IFAC PAPERSONLINE, 2021, 54 (01): : 1059 - 1064
  • [10] Maintenance Requests Labeling Using Machine Learning Classification
    Lafi, Mohammed
    Hawashin, Bilal
    AlZu'bi, Shadi
    [J]. 2020 SEVENTH INTERNATIONAL CONFERENCE ON SOFTWARE DEFINED SYSTEMS (SDS), 2020, : 245 - 249