Automated demarcation of requirements in textual specifications: a machine learning-based approach

被引:0
|
作者
Sallam Abualhaija
Chetan Arora
Mehrdad Sabetzadeh
Lionel C. Briand
Michael Traynor
机构
[1] University of Luxembourg,SnT Centre for Security, Reliability, and Trust
[2] Deakin University,School of Information Technology
[3] University of Ottawa,School of Electrical Engineering and Computer Science
[4] QRA Corp,undefined
来源
关键词
Textual requirements; Requirements identification and classification; Machine learning; Natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 33 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall. We collect and analyze expert feedback on the demarcations produced by our approach for industrial requirements specifications. The results indicate that experts find our approach useful and efficient in practice. We developed a prototype tool, named DemaRQ, in support of our approach. To facilitate replication, we make available to the research community this prototype tool alongside the non-proprietary portion of our training data.
引用
收藏
页码:5454 / 5497
页数:43
相关论文
共 50 条
  • [41] Machine Learning-Based Automated Fault Detection and Diagnostics in Building Systems
    Nelson, William
    Dieckert, Christopher
    ENERGIES, 2024, 17 (02)
  • [42] Towards A Machine Learning-Based Framework For Automated Design of Networking Protocols
    Pasandi, Hannaneh Barahouei
    2019 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2019, : 433 - 434
  • [43] Towards a Machine Learning-based Model for Automated Crop Type Mapping
    Dakir, Asmae
    Barramou, Fatimazahra
    Alami, Omar Bachir
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (01) : 772 - 779
  • [44] Automated machine learning-based building energy load prediction method
    Zhang, Chaobo
    Tian, Xiangning
    Zhao, Yang
    Lu, Jie
    JOURNAL OF BUILDING ENGINEERING, 2023, 80
  • [45] A Machine Learning-based Triage methodology for automated categorization of digital media
    Marturana, Fabio
    Tacconi, Simone
    DIGITAL INVESTIGATION, 2013, 10 (02) : 193 - 204
  • [46] Machine learning-based automated phenotyping of inflammatory nocifensive behavior in mice
    Wotton, Janine M.
    Peterson, Emma
    Anderson, Laura
    Murray, Stephen A.
    Braun, Robert E.
    Chesler, Elissa J.
    White, Jacqueline K.
    Kumar, Vivek
    MOLECULAR PAIN, 2020, 16
  • [47] Deep learning-based approach for automated assessment of PTEN status
    Jamaspishvili, Tamara
    Harmon, Stephanie
    Patel, Palak
    Sanford, Thomas
    Caven, Isabelle
    Iseman, Rachael
    Mehralivand, Sherif
    Choyke, Peter L.
    Berman, David Monty
    Turkbey, Baris
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (06)
  • [48] A machine learning approach to textual entailment recognition
    Zanzotto, Fabio Massimo
    Pennacchiotti, Marco
    Moschitti, Alessandro
    NATURAL LANGUAGE ENGINEERING, 2009, 15 : 551 - 582
  • [49] SoK: Machine vs. machine - A systematic classification of automated machine learning-based CAPTCHA solvers
    Dionysiou, Antreas
    Athanasopoulos, Elias
    COMPUTERS & SECURITY, 2020, 97
  • [50] Evidence-driven Requirements Engineering for Uncertainty of Machine Learning-based Systems
    Ishikawa, Fuyuki
    Matsuno, Yutaka
    2020 28TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE'20), 2020, : 346 - 351