Estimating the number of remaining links in traceability recovery

被引:0
|
作者
Davide Falessi
Massimiliano Di Penta
Gerardo Canfora
Giovanni Cantone
机构
[1] California Polytechnic State University,Department of Computer Science
[2] University of Sannio,Department of Engineering
[3] University of Rome Tor Vergata,Department of Civil Engineering and Computer Science
[4] DICII,undefined
来源
关键词
Information retrieval; Traceability link recovery; Metrics and measurement;
D O I
暂无
中图分类号
学科分类号
摘要
Although very important in software engineering, establishing traceability links between software artifacts is extremely tedious, error-prone, and it requires significant effort. Even when approaches for automated traceability recovery exist, these provide the requirements analyst with a, usually very long, ranked list of candidate links that needs to be manually inspected. In this paper we introduce an approach called Estimation of the Number of Remaining Links (ENRL) which aims at estimating, via Machine Learning (ML) classifiers, the number of remaining positive links in a ranked list of candidate traceability links produced by a Natural Language Processing techniques-based recovery approach. We have evaluated the accuracy of the ENRL approach by considering several ML classifiers and NLP techniques on three datasets from industry and academia, and concerning traceability links among different kinds of software artifacts including requirements, use cases, design documents, source code, and test cases. Results from our study indicate that: (i) specific estimation models are able to provide accurate estimates of the number of remaining positive links; (ii) the estimation accuracy depends on the choice of the NLP technique, and (iii) univariate estimation models outperform multivariate ones.
引用
收藏
页码:996 / 1027
页数:31
相关论文
共 50 条
  • [1] Estimating the number of remaining links in traceability recovery
    Falessi, Davide
    Di Penta, Massimiliano
    Canfora, Gerardo
    Cantone, Giovanni
    EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (03) : 996 - 1027
  • [2] Estimating the Number of Remaining Links in Traceability Recovery (Journal-First Abstract)
    Falessi, Davide
    Di Penta, Massimiliano
    Canfora, Gerardo
    Cantone, Giovanni
    PROCEEDINGS OF THE 2018 33RD IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMTED SOFTWARE ENGINEERING (ASE' 18), 2018, : 953 - 953
  • [3] On estimating the number of defects remaining in software
    Cai, KY
    JOURNAL OF SYSTEMS AND SOFTWARE, 1998, 40 (02) : 93 - 114
  • [4] Semantic Recovery of Traceability Links between System Artifacts
    Maria Alvarez-Rodriguez, Jose
    Mendieta, Roy
    Moreno, Valentin
    Sanchez-Puebla, Miguel
    Llorens, Juan
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2020, 30 (10) : 1415 - 1442
  • [5] Recovery of traceability links between software documentation and source code
    Marcus, A
    Maletic, JI
    Sergeyev, A
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2005, 15 (05) : 811 - 836
  • [6] Leveraging execution traces to enhance traceability links recovery in BPMN models
    Lapena, Raul
    Perez, Francisca
    Pastor, Oscar
    Cetina, Carlos
    INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 146
  • [7] Ontological approach for the semantic recovery of traceability links between software artefacts
    Zhang, Y.
    Witte, R.
    Rilling, J.
    Haarslev, V.
    IET SOFTWARE, 2008, 2 (03) : 185 - 203
  • [8] A Literature Review of Automatic Traceability Links Recovery for Software Change Impact Analysis
    Aung, Thazin Win Win
    Huo, Huan
    Sui, Yulei
    2020 IEEE/ACM 28TH INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC, 2020, : 14 - 24
  • [9] Improving Traceability Links Recovery in Process Models Through an Ontological Expansion of Requirements
    Lapena, Raul
    Perez, Francisca
    Cetina, Carlos
    Pastor, Oscar
    ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2019), 2019, 11483 : 261 - 275
  • [10] Automated Recovery and Visualization of Test-to-Code Traceability (TCT) Links: An Evaluation
    Aljawabrah, Nadera
    Gergely, Tamas
    Misra, Sanjay
    Fernandez-Sanz, Luis
    IEEE ACCESS, 2021, 9 : 40111 - 40123