Automated demarcation of requirements in textual specifications: a machine learning-based approach

被引:0
|
作者
Sallam Abualhaija
Chetan Arora
Mehrdad Sabetzadeh
Lionel C. Briand
Michael Traynor
机构
[1] University of Luxembourg,SnT Centre for Security, Reliability, and Trust
[2] Deakin University,School of Information Technology
[3] University of Ottawa,School of Electrical Engineering and Computer Science
[4] QRA Corp,undefined
来源
关键词
Textual requirements; Requirements identification and classification; Machine learning; Natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 33 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall. We collect and analyze expert feedback on the demarcations produced by our approach for industrial requirements specifications. The results indicate that experts find our approach useful and efficient in practice. We developed a prototype tool, named DemaRQ, in support of our approach. To facilitate replication, we make available to the research community this prototype tool alongside the non-proprietary portion of our training data.
引用
收藏
页码:5454 / 5497
页数:43
相关论文
共 50 条
  • [21] Machine learning-based software requirements identification for a large number of features
    Talele, Pratvina
    Phalnikar, Rashmi
    Talele, Pratvina (pratvina.talele@mitwpu.edu.in), 1600, Inderscience Publishers (06): : 255 - 260
  • [22] Machine Learning-Based Approach for Fake News Detection
    Gururaj H.L.
    Lakshmi H.
    Soundarya B.C.
    Flammini F.
    Janhavi V.
    Journal of ICT Standardization, 2022, 10 (04): : 509 - 530
  • [23] Machine Learning-Based Approach for Hardware Faults Prediction
    Khalil, Kasem
    Eldash, Omar
    Kumar, Ashok
    Bayoumi, Magdy
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (11) : 3880 - 3892
  • [24] A Machine Learning-Based Approach to Synthesize Multilayer Metasurfaces
    Naseri, Parinaz
    Hum, Sean, V
    2020 IEEE INTERNATIONAL SYMPOSIUM ON ANTENNAS AND PROPAGATION AND NORTH AMERICAN RADIO SCIENCE MEETING, 2020, : 933 - 934
  • [25] Machine Learning-Based Approach for the Gambling Problem Identification
    Kozak, Jan
    Probierz, Barbara
    Juszczuk, Przemyslaw
    Dziczkowski, Grzegorz
    Jach, Tomasz
    Stefanski, Piotr
    Glowania, Szymon
    Hrabia, Anita
    Wolek, Gabriel
    Sznapka, Wojciech
    Swierk, Lukasz
    Joniec, Natalia
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2025,
  • [26] Subtyping of hepatocellular adenoma: a machine learning-based approach
    Yongjun Liu
    Yao-Zhong Liu
    Lifu Sun
    Yoh Zen
    Chie Inomoto
    Matthew M. Yeh
    Virchows Archiv, 2022, 481 : 49 - 61
  • [27] A Machine Learning-Based Probabilistic Approach for Irrigation Scheduling
    Srivastava, Shivendra
    Kumar, Nishant
    Malakar, Arindam
    Choudhury, Sruti Das
    Ray, Chittaranjan
    Roy, Tirthankar
    WATER RESOURCES MANAGEMENT, 2024, 38 (05) : 1639 - 1653
  • [28] Machine learning-based new approach to films review
    Jassim, Mustafa Abdalrassual
    Abd, Dhafar Hamed
    Omri, Mohamed Nazih
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [29] A Machine learning-based approach to determining stress in rails
    Belding, Matthew
    Enshaeian, Alireza
    Rizzo, Piervincenzo
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2023, 22 (01): : 639 - 656
  • [30] Evaluating a Machine Learning-based Approach for Cache Configuration
    Ribeiro, Lucas
    Jacobi, Ricardo
    Junior, Francisco
    da Silva, Jones Yudi
    Silva, Ivan Saraiva
    2022 IEEE 13TH LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS (LASCAS), 2022, : 180 - 183