Estimating the number of remaining links in traceability recovery

被引:0
|
作者
Davide Falessi
Massimiliano Di Penta
Gerardo Canfora
Giovanni Cantone
机构
[1] California Polytechnic State University,Department of Computer Science
[2] University of Sannio,Department of Engineering
[3] University of Rome Tor Vergata,Department of Civil Engineering and Computer Science
[4] DICII,undefined
来源
关键词
Information retrieval; Traceability link recovery; Metrics and measurement;
D O I
暂无
中图分类号
学科分类号
摘要
Although very important in software engineering, establishing traceability links between software artifacts is extremely tedious, error-prone, and it requires significant effort. Even when approaches for automated traceability recovery exist, these provide the requirements analyst with a, usually very long, ranked list of candidate links that needs to be manually inspected. In this paper we introduce an approach called Estimation of the Number of Remaining Links (ENRL) which aims at estimating, via Machine Learning (ML) classifiers, the number of remaining positive links in a ranked list of candidate traceability links produced by a Natural Language Processing techniques-based recovery approach. We have evaluated the accuracy of the ENRL approach by considering several ML classifiers and NLP techniques on three datasets from industry and academia, and concerning traceability links among different kinds of software artifacts including requirements, use cases, design documents, source code, and test cases. Results from our study indicate that: (i) specific estimation models are able to provide accurate estimates of the number of remaining positive links; (ii) the estimation accuracy depends on the choice of the NLP technique, and (iii) univariate estimation models outperform multivariate ones.
引用
收藏
页码:996 / 1027
页数:31
相关论文
共 50 条
  • [31] Estimating the (Remaining) lifetime for maintenance and reuse
    Middendorf, A
    Griese, H
    Pötter, H
    Reichl, H
    Stobbe, I
    ELECTRONICS GOES GREEN 2000 (PLUS): A CHALLENGE FOR THE NEXT MILLENNIUM, VOL 1, PROCEEDINGS, 2000, : 467 - 472
  • [32] Estimating remaining life of airfieId pavements
    Wade, M
    Peshkin, D
    Smith, K
    Yu, HT
    ADVANCING AIRFIELD PAVEMENTS, 2001, : 133 - 147
  • [33] Minimum degree, leaf number and traceability
    Mukwembi, Simon
    CZECHOSLOVAK MATHEMATICAL JOURNAL, 2013, 63 (02) : 539 - 545
  • [34] The number remaining at risk: An adjunct to the number needed to treat
    Massel, D
    Cruickshank, MK
    CANADIAN JOURNAL OF CARDIOLOGY, 2002, 18 (03) : 254 - 258
  • [35] Minimum degree, leaf number and traceability
    Simon Mukwembi
    Czechoslovak Mathematical Journal, 2013, 63 : 539 - 545
  • [36] Establishing Multilevel Test-to-Code Traceability Links
    White, Robert
    Krinke, Jens
    Tan, Raymond
    2020 ACM/IEEE 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2020), 2020, : 861 - 872
  • [37] Dynamic traceability links supported by a system architecture description
    Tryggeseth, E
    Nytro, O
    INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, PROCEEDINGS, 1997, : 180 - 187
  • [38] Recovering Transitive Traceability Links among Software Artifacts
    Nishikawa, Kazuki
    Washizaki, Hironori
    Fukazawa, Yoshiaki
    Oshima, Keishi
    Mibe, Ryota
    2015 31ST INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME) PROCEEDINGS, 2015, : 576 - 580
  • [39] Query-driven soft traceability links for models
    Ábel Hegedüs
    Ákos Horváth
    István Ráth
    Rodrigo Rizzi Starr
    Dániel Varró
    Software & Systems Modeling, 2016, 15 : 733 - 756
  • [40] Cross-Artifact Traceability Using Lightweight Links
    Ratanotayanon, Sukanya
    Sim, Susan Elliott
    Raycraft, Derek J.
    2009 ICSE WORKSHOP ON TRACEABILITY IN EMERGING FORMS OF SOFTWARE ENGINEERING (TEFSE), 2009, : 57 - 64