Automated demarcation of requirements in textual specifications: a machine learning-based approach

被引:0
|
作者
Sallam Abualhaija
Chetan Arora
Mehrdad Sabetzadeh
Lionel C. Briand
Michael Traynor
机构
[1] University of Luxembourg,SnT Centre for Security, Reliability, and Trust
[2] Deakin University,School of Information Technology
[3] University of Ottawa,School of Electrical Engineering and Computer Science
[4] QRA Corp,undefined
来源
关键词
Textual requirements; Requirements identification and classification; Machine learning; Natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 33 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall. We collect and analyze expert feedback on the demarcations produced by our approach for industrial requirements specifications. The results indicate that experts find our approach useful and efficient in practice. We developed a prototype tool, named DemaRQ, in support of our approach. To facilitate replication, we make available to the research community this prototype tool alongside the non-proprietary portion of our training data.
引用
收藏
页码:5454 / 5497
页数:43
相关论文
共 50 条
  • [1] Automated demarcation of requirements in textual specifications: a machine learning-based approach
    Abualhaija, Sallam
    Arora, Chetan
    Sabetzadeh, Mehrdad
    Briand, Lionel C.
    Traynor, Michael
    EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (06) : 5454 - 5497
  • [2] A Machine Learning-Based Approach for Demarcating Requirements in Textual Specifications
    Abualhaija, Sallam
    Arora, Chetan
    Sabetzadeh, Mehrdad
    Briand, Lionel C.
    Vaz, Eduardo
    2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, : 51 - 62
  • [3] Capturing creative requirements via requirements reuse: A machine learning-based approach
    Quoc Anh Do
    Bhowmick, Tanmay
    Bradshaw, Gary L.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 170 (170)
  • [4] A Machine Learning-based Approach for Automated Vulnerability Remediation Analysis
    Zhang, Fengli
    Huff, Philip
    McClanahan, Kylie
    Li, Qinghua
    2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
  • [5] A Machine Learning-based Approach for Advanced Monitoring of Automated Equipment for the Entertainment Industry
    Berno, Michele
    Canil, Marco
    Chiarello, Nicola
    Piazzon, Luca
    Berti, Fabio
    Ferrari, Francesca
    Zaupa, Alessandro
    Ferro, Nicola
    Rossi, Michele
    Susto, Gian Antonio
    2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 386 - 391
  • [6] Identification of non-functional requirements in textual specifications: A semi-supervised learning approach
    Casamayor, Agustin
    Godoy, Daniela
    Campo, Marcelo
    INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (04) : 436 - 445
  • [7] A Machine Learning-Based Approach for Multi-AGV Dispatching at Automated Container Terminals
    Gao, Yinping
    Chen, Chun-Hsien
    Chang, Daofang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (07)
  • [8] Machine Learning-Based Model Categorization Using Textual and Structural Features
    Khalilipour, Alireza
    Bozyigit, Fatma
    Utku, Can
    Challenger, Moharram
    NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS, ADBIS 2022, 2022, 1652 : 425 - 436
  • [9] An Automated Machine Learning-Based Quantitative Multiparametric Approach for Mitral Regurgitation Severity Grading
    Sadeghpour, Anita
    Jiang, Zhubo
    Hummel, Yoran M.
    Frost, Matthew
    Lam, Carolyn S. P.
    Shah, Sanjiv J.
    Lund, Lars H.
    Stone, Gregg W.
    Swaminathan, Madhav
    Weissman, Neil J.
    Asch, Federico M.
    JACC-CARDIOVASCULAR IMAGING, 2025, 18 (01) : 1 - 12
  • [10] Landscape of Requirements Engineering for Machine Learning-based AI Systems
    Yoshioka, Nobukazu
    Husen, Jati H.
    Tun, Hnin Thandar
    Chen, Zhenxiang
    Washizaki, Hironori
    Fukazawa, Yoshiaki
    2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW 2021), 2021, : 5 - 8