Testing machine learning based systems: a systematic mapping

被引:109
|
作者
Riccio, Vincenzo [1 ]
Jahangirova, Gunel [1 ]
Stocco, Andrea [1 ]
Humbatova, Nargiz [1 ]
Weiss, Michael [1 ]
Tonella, Paolo [1 ]
机构
[1] Univ Svizzera Italiana USI, Software Inst, Via Buffi 13, Lugano, Switzerland
关键词
Systematic mapping; Systematic review; Software testing; Machine learning; GENERATION;
D O I
10.1007/s10664-020-09881-0
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Context: A Machine Learning based System (MLS) is a software system including one or more components that learn how to perform a task from a given data set. The increasing adoption of MLSs in safety critical domains such as autonomous driving, healthcare, and finance has fostered much attention towards the quality assurance of such systems. Despite the advances in software testing, MLSs bring novel and unprecedented challenges, since their behaviour is defined jointly by the code that implements them and the data used for training them. Objective: To identify the existing solutions for functional testing of MLSs, and classify them from three different perspectives: (1) the context of the problem they address, (2) their features, and (3) their empirical evaluation. To report demographic information about the ongoing research. To identify open challenges for future research. Method: We conducted a systematic mapping study about testing techniques for MLSs driven by 33 research questions. We followed existing guidelines when defining our research protocol so as to increase the repeatability and reliability of our results. Results: We identified 70 relevant primary studies, mostly published in the last years. We identified 11 problems addressed in the literature. We investigated multiple aspects of the testing approaches, such as the used/proposed adequacy criteria, the algorithms for test input generation, and the test oracles. Conclusions: The most active research areas in MLS testing address automated scenario/input generation and test oracle creation. MLS testing is a rapidly growing and developing research area, with many open challenges, such as the generation of realistic inputs and the definition of reliable evaluation metrics and benchmarks.
引用
收藏
页码:5193 / 5254
页数:62
相关论文
共 50 条
  • [1] Testing machine learning based systems: a systematic mapping
    Vincenzo Riccio
    Gunel Jahangirova
    Andrea Stocco
    Nargiz Humbatova
    Michael Weiss
    Paolo Tonella
    [J]. Empirical Software Engineering, 2020, 25 : 5193 - 5254
  • [2] A systematic mapping study on testing of machine learning programs
    Sherin, Salman
    Khan, Muhammad Uzair
    Iqbal, Muhammad Zohaib
    [J]. arXiv, 2019,
  • [3] Using Machine Learning on Testing IoT Applications: a systematic mapping
    Freitas, Lavinia
    Lelli, Valeria
    [J]. PROCEEDINGS OF THE 28TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB, WEBMEDIA 2022, 2022, : 348 - 358
  • [4] Machine Learning Applied to Software Testing: A Systematic Mapping Study
    Durelli, Vinicius H. S.
    Durelli, Rafael S.
    Borges, Simone S.
    Endo, Andre T.
    Eler, Marcelo M.
    Dias, Diego R. C.
    Guimaraes, Marcelo P.
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2019, 68 (03) : 1189 - 1212
  • [5] Challenges of Testing Machine Learning Based Systems
    Marijan, Dusica
    Gotlieb, Arnaud
    Ahuja, Mohit Kumar
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING (AITEST), 2019, : 101 - 102
  • [6] Grammar Based Directed Testing of Machine Learning Systems
    Udeshi, Sakshi
    Chattopadhyay, Sudipta
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (11) : 2487 - 2503
  • [7] Machine Learning in Gamification and Gamification in Machine Learning: A Systematic Literature Mapping
    Swacha, Jakub
    Gracel, Michal
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (20):
  • [8] Serverless on Machine Learning: A Systematic Mapping Study
    Barrak, Amine
    Petrillo, Fabio
    Jaafar, Fehmi
    [J]. IEEE ACCESS, 2022, 10 : 99337 - 99352
  • [9] A systematic review: machine learning based recommendation systems for e-learning
    Khanal, Shristi Shakya
    Prasad, P. W. C.
    Alsadoon, Abeer
    Maag, Angelika
    [J]. EDUCATION AND INFORMATION TECHNOLOGIES, 2020, 25 (04) : 2635 - 2664
  • [10] A systematic review: machine learning based recommendation systems for e-learning
    Shristi Shakya Khanal
    P.W.C. Prasad
    Abeer Alsadoon
    Angelika Maag
    [J]. Education and Information Technologies, 2020, 25 : 2635 - 2664