Automated demarcation of requirements in textual specifications: a machine learning-based approach

被引：0

作者：

Sallam Abualhaija

Chetan Arora

Mehrdad Sabetzadeh

Lionel C. Briand

Michael Traynor

机构：

[1] University of Luxembourg,SnT Centre for Security, Reliability, and Trust

[2] Deakin University,School of Information Technology

[3] University of Ottawa,School of Electrical Engineering and Computer Science

[4] QRA Corp,undefined

来源：

Empirical Software Engineering | 2020年 / 25卷

关键词：

Textual requirements; Requirements identification and classification; Machine learning; Natural language processing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 33 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall. We collect and analyze expert feedback on the demarcations produced by our approach for industrial requirements specifications. The results indicate that experts find our approach useful and efficient in practice. We developed a prototype tool, named DemaRQ, in support of our approach. To facilitate replication, we make available to the research community this prototype tool alongside the non-proprietary portion of our training data.

引用

页码：5454 / 5497

页数：43

共 50 条

[41] Machine Learning-Based Automated Fault Detection and Diagnostics in Building Systems
Nelson, William
Dieckert, Christopher
ENERGIES, 2024, 17 (02)
[42] Towards A Machine Learning-Based Framework For Automated Design of Networking Protocols
Pasandi, Hannaneh Barahouei
2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 433 - 434
[43] Towards a Machine Learning-based Model for Automated Crop Type Mapping
Dakir, Asmae
Barramou, Fatimazahra
Alami, Omar Bachir
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (01) : 772 - 779
[44] Automated machine learning-based building energy load prediction method
Zhang, Chaobo
Tian, Xiangning
Zhao, Yang
Lu, Jie
JOURNAL OF BUILDING ENGINEERING, 2023, 80
[45] A Machine Learning-based Triage methodology for automated categorization of digital media
Marturana, Fabio
Tacconi, Simone
DIGITAL INVESTIGATION, 2013, 10 (02) : 193 - 204
[46] Machine learning-based automated phenotyping of inflammatory nocifensive behavior in mice
Wotton, Janine M.
Peterson, Emma
Anderson, Laura
Murray, Stephen A.
Braun, Robert E.
Chesler, Elissa J.
White, Jacqueline K.
Kumar, Vivek
MOLECULAR PAIN, 2020, 16
[47] Deep learning-based approach for automated assessment of PTEN status
Jamaspishvili, Tamara
Harmon, Stephanie
Patel, Palak
Sanford, Thomas
Caven, Isabelle
Iseman, Rachael
Mehralivand, Sherif
Choyke, Peter L.
Berman, David Monty
Turkbey, Baris
JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (06)
[48] A machine learning approach to textual entailment recognition
Zanzotto, Fabio Massimo
Pennacchiotti, Marco
Moschitti, Alessandro
NATURAL LANGUAGE ENGINEERING, 2009, 15 : 551 - 582
[49] SoK: Machine vs. machine - A systematic classification of automated machine learning-based CAPTCHA solvers
Dionysiou, Antreas
Athanasopoulos, Elias
COMPUTERS & SECURITY, 2020, 97
[50] Evidence-driven Requirements Engineering for Uncertainty of Machine Learning-based Systems
Ishikawa, Fuyuki
Matsuno, Yutaka
2020 28TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE'20), 2020, : 346 - 351

← 1 2 3 4 5 →