Estimating the number of remaining links in traceability recovery

被引:0
|
作者
Davide Falessi
Massimiliano Di Penta
Gerardo Canfora
Giovanni Cantone
机构
[1] California Polytechnic State University,Department of Computer Science
[2] University of Sannio,Department of Engineering
[3] University of Rome Tor Vergata,Department of Civil Engineering and Computer Science
[4] DICII,undefined
来源
关键词
Information retrieval; Traceability link recovery; Metrics and measurement;
D O I
暂无
中图分类号
学科分类号
摘要
Although very important in software engineering, establishing traceability links between software artifacts is extremely tedious, error-prone, and it requires significant effort. Even when approaches for automated traceability recovery exist, these provide the requirements analyst with a, usually very long, ranked list of candidate links that needs to be manually inspected. In this paper we introduce an approach called Estimation of the Number of Remaining Links (ENRL) which aims at estimating, via Machine Learning (ML) classifiers, the number of remaining positive links in a ranked list of candidate traceability links produced by a Natural Language Processing techniques-based recovery approach. We have evaluated the accuracy of the ENRL approach by considering several ML classifiers and NLP techniques on three datasets from industry and academia, and concerning traceability links among different kinds of software artifacts including requirements, use cases, design documents, source code, and test cases. Results from our study indicate that: (i) specific estimation models are able to provide accurate estimates of the number of remaining positive links; (ii) the estimation accuracy depends on the choice of the NLP technique, and (iii) univariate estimation models outperform multivariate ones.
引用
收藏
页码:996 / 1027
页数:31
相关论文
共 50 条
  • [21] TRIAD: Automated Traceability Recovery based on Biterm-enhanced Deduction of Transitive Links among Artifacts
    Gao, Hui
    Kuang, Hongyu
    Assuncao, Wesley K. G.
    Mayr-Dorn, Christoph
    Rong, Guoping
    Zhang, He
    Ma, Xiaoxing
    Egyed, Alexander
    Proceedings - International Conference on Software Engineering, 2024, : 2668 - 2680
  • [22] Domination number and traceability of graphs
    Li, Rao
    DISCRETE MATHEMATICS LETTERS, 2020, 4 : 27 - 30
  • [23] Establishing Traceability Links among Software Artefacts
    Wijesinghe, Diunuge B.
    Kamalabalan, Karthigesu
    Uruththirakodeeswaran, Thanuj A.
    Thiyagalingam, Gitanjali
    Perera, Indika
    Meedeniya, Dulani
    14TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) 2014, 2014, : 55 - 62
  • [24] Traceability Recovery for Innovation Processes
    Beyhl, Thomas
    Giese, Holger
    2015 IEEE/ACM 8TH INTERNATIONAL SYMPOSIUM ON SOFTWARE AND SYSTEMS TRACEABILITY, 2015, : 22 - 28
  • [25] Recovering traceability links in multilingual Web sites
    Tonella, P
    Ricca, F
    Pianta, E
    Girardi, C
    WSE 2001: 3RD INTERNATIONAL WORKSHOP ON WEB SITE EVOLUTION, 2001, : 14 - 21
  • [26] Optimizing design for variability using traceability links
    Riebisch, Matthias
    Brcina, Robert
    FIFTEENTH IEEE INTERNATIONAL CONFERENCE AND WORKSHOPS ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 2008, : 235 - 244
  • [27] A Study on the Effect of Traceability Links in Software Maintenance
    Jaber, Khaled
    Sharif, Bonita
    Liu, Chang
    IEEE ACCESS, 2013, 1 : 726 - 741
  • [28] Recovering traceability links between code and documentation
    Antoniol, G
    Canfora, G
    Casazza, G
    De Lucia, A
    Merlo, E
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) : 970 - 983
  • [29] An Improved Approach to the Recovery of Traceability Links between Requirement Documents and Source Codes Based on Latent Semantic Indexing
    Shao, Jianwei
    Wu, Wei
    Geng, Peng
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2013, PT V, 2013, 7975 : 547 - 557
  • [30] Estimating the Number of Substance Use Disorder Recovery Homes in the United States
    Jason, Leonard A.
    Wiedbusch, Elzbieta
    Bobak, Ted J.
    Taullahu, David
    ALCOHOLISM TREATMENT QUARTERLY, 2020, 38 (04) : 506 - 514