Automated demarcation of requirements in textual specifications: a machine learning-based approach

被引：0

作者：

Sallam Abualhaija

Chetan Arora

Mehrdad Sabetzadeh

Lionel C. Briand

Michael Traynor

机构：

[1] University of Luxembourg,SnT Centre for Security, Reliability, and Trust

[2] Deakin University,School of Information Technology

[3] University of Ottawa,School of Electrical Engineering and Computer Science

[4] QRA Corp,undefined

来源：

Empirical Software Engineering | 2020年 / 25卷

关键词：

Textual requirements; Requirements identification and classification; Machine learning; Natural language processing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 33 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall. We collect and analyze expert feedback on the demarcations produced by our approach for industrial requirements specifications. The results indicate that experts find our approach useful and efficient in practice. We developed a prototype tool, named DemaRQ, in support of our approach. To facilitate replication, we make available to the research community this prototype tool alongside the non-proprietary portion of our training data.

引用

页码：5454 / 5497

页数：43

共 50 条

[1] Automated demarcation of requirements in textual specifications: a machine learning-based approach
Abualhaija, Sallam
Arora, Chetan
Sabetzadeh, Mehrdad
Briand, Lionel C.
Traynor, Michael
EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (06) : 5454 - 5497
[2] A Machine Learning-Based Approach for Demarcating Requirements in Textual Specifications
Abualhaija, Sallam
Arora, Chetan
Sabetzadeh, Mehrdad
Briand, Lionel C.
Vaz, Eduardo
2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, : 51 - 62
[3] Capturing creative requirements via requirements reuse: A machine learning-based approach
Quoc Anh Do
Bhowmick, Tanmay
Bradshaw, Gary L.
JOURNAL OF SYSTEMS AND SOFTWARE, 2020, 170 (170)
[4] A Machine Learning-based Approach for Automated Vulnerability Remediation Analysis
Zhang, Fengli
Huff, Philip
McClanahan, Kylie
Li, Qinghua
2020 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2020,
[5] A Machine Learning-based Approach for Advanced Monitoring of Automated Equipment for the Entertainment Industry
Berno, Michele
Canil, Marco
Chiarello, Nicola
Piazzon, Luca
Berti, Fabio
Ferrari, Francesca
Zaupa, Alessandro
Ferro, Nicola
Rossi, Michele
Susto, Gian Antonio
2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 386 - 391
[6] Identification of non-functional requirements in textual specifications: A semi-supervised learning approach
Casamayor, Agustin
Godoy, Daniela
Campo, Marcelo
INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (04) : 436 - 445
[7] A Machine Learning-Based Approach for Multi-AGV Dispatching at Automated Container Terminals
Gao, Yinping
Chen, Chun-Hsien
Chang, Daofang
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (07)
[8] Machine Learning-Based Model Categorization Using Textual and Structural Features
Khalilipour, Alireza
Bozyigit, Fatma
Utku, Can
Challenger, Moharram
NEW TRENDS IN DATABASE AND INFORMATION SYSTEMS, ADBIS 2022, 2022, 1652 : 425 - 436
[9] An Automated Machine Learning-Based Quantitative Multiparametric Approach for Mitral Regurgitation Severity Grading
Sadeghpour, Anita
Jiang, Zhubo
Hummel, Yoran M.
Frost, Matthew
Lam, Carolyn S. P.
Shah, Sanjiv J.
Lund, Lars H.
Stone, Gregg W.
Swaminathan, Madhav
Weissman, Neil J.
Asch, Federico M.
JACC-CARDIOVASCULAR IMAGING, 2025, 18 (01) : 1 - 12
[10] Landscape of Requirements Engineering for Machine Learning-based AI Systems
Yoshioka, Nobukazu
Husen, Jati H.
Tun, Hnin Thandar
Chen, Zhenxiang
Washizaki, Hironori
Fukazawa, Yoshiaki
2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE WORKSHOPS (APSECW 2021), 2021, : 5 - 8

← 1 2 3 4 5 →