Applications of natural language processing in software traceability: A systematic mapping study?

被引:8
|
作者
Pauzi, Zaki [1 ]
Capiluppi, Andrea [1 ]
机构
[1] Univ Groningen, Bernoulli Inst, Nijenborgh 9, NL-9747 AG Groningen, Netherlands
关键词
Software traceability; Information retrieval; Natural language processing; SOURCE-CODE; BUG REPORTS; LINKS; REQUIREMENTS; INFORMATION; LOCATION; TIQI;
D O I
10.1016/j.jss.2023.111616
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A key part of software evolution and maintenance is the continuous integration from collaborative efforts, often resulting in complex traceability challenges between software artifacts: features and modules remain scattered in the source code, and traceability links become harder to recover. In this paper, we perform a systematic mapping study dealing with recent research recovering these links through information retrieval, with a particular focus on natural language processing (NLP). Our search strategy gathered a total of 96 papers in focus of our study, covering a period from 2013 to 2021. We conducted trend analysis on NLP techniques and tools involved, and traceability efforts (applying NLP) across the software development life cycle (SDLC). Based on our study, we have identified the following key issues, barriers, and setbacks: syntax convention, configuration, translation, explainability, properties representation, tacit knowledge dependency, scalability, and data availability. Based on these, we consolidated the following open challenges: representation similarity across artifacts, the effectiveness of NLP for traceability, and achieving scalable, adaptive, and explainable models. To address these challenges, we recommend a holistic framework for NLP solutions to achieve effective traceability and efforts in achieving interoperability and explainability in NLP models for traceability. (c) 2023 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Natural Language Processing for Requirements Engineering: A Systematic Mapping Study
    Zhao, Liping
    Alhoshan, Waad
    Ferrari, Alessio
    Letsholo, Keletso J.
    Ajagbe, Muideen A.
    Chioasca, Erol-Valeriu
    Batista-Navarro, Riza T.
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (03)
  • [2] Software product lines traceability: A systematic mapping study
    Vale, Tassio
    de Almeida, Eduardo Santana
    Alves, Vander
    Kulesza, Uira
    Niu, Nan
    de Lima, Ricardo
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2017, 84 : 1 - 18
  • [3] Support for Traceability Management of Software Artefacts using Natural Language Processing
    Arunthavanathan, A.
    Shanmugathasan, S.
    Ratnavel, S.
    Thiyagarajah, V.
    Perera, I.
    Meedeniya, D.
    Balasubramaniam, D.
    [J]. 2ND INTERNATIONAL MERCON 2016 MORATUWA ENGINEERING RESEARCH CONFERENCE, 2016, : 18 - 23
  • [4] Applications of Literature Mining and Natural Language Processing to Evidence Mapping and Systematic Review
    Wikoff, Danielle S.
    [J]. ENVIRONMENTAL AND MOLECULAR MUTAGENESIS, 2022, 63 : 18 - 18
  • [5] The Applications of Natural Language Processing (NLP) for Software Requirement Engineering - A Systematic Literature Review
    Nazir, Farhana
    Butt, Wasi Haider
    Anwar, Muhammad Waseem
    Khattak, Muazzam A. Khan
    [J]. INFORMATION SCIENCE AND APPLICATIONS 2017, ICISA 2017, 2017, 424 : 485 - 493
  • [6] Traceability of Architectural Design Decisions and Software Artifacts: A Systematic Mapping Study
    Hyun, Santiago
    Hurtado, Julio Ariel
    [J]. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2023, 48 (04) : 401 - 423
  • [7] Applications of natural language processing in radiology: A systematic review
    Linna, Nathaniel
    Kahn, Charles E., Jr.
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2022, 163
  • [8] Empirical studies on software traceability: A mapping study
    Charalampidou, Sofia
    Ampatzoglou, Apostolos
    Karountzos, Evangelos
    Avgeriou, Paris
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2021, 33 (02)
  • [9] Applications of natural language processing
    Blandon Andrade, Juan Carlos
    [J]. ENTRE CIENCIA E INGENIERIA, 2022, 16 (31): : 7 - 8
  • [10] Natural Language Processing-Based Software Testing: A Systematic Literature Review
    Boukhlif, Mohamed
    Hanine, Mohamed
    Kharmoum, Nassim
    Ruigomez Noriega, Atenea
    Garcia Obeso, David
    Ashraf, Imran
    [J]. IEEE ACCESS, 2024, 12 : 79383 - 79400