A Search-Based Testing Approach for Deep Reinforcement Learning Agents

被引:12
|
作者
Zolfagharian, Amirhossein [1 ]
Abdellatif, Manel [2 ,3 ]
Briand, Lionel C. [1 ,4 ]
Bagherzadeh, Mojtaba [3 ,5 ]
Ramesh, S.
机构
[1] Univ Ottawa, Sch Elect Engn & Comp Sci EECS, Ottawa, ON K1N 6N5, Canada
[2] Ecole Technol Super, Software & Informat Technol Engn Dept, Montreal, PQ H3C 1K3, Canada
[3] Univ Ottawa, Sch EECS, Ottawa, ON K1N 6N5, Canada
[4] Univ Luxembourg, SnT Ctr Secur Reliabil & Trust, L-4365 Esch Sur Alzette, Luxembourg
[5] Cisco, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Genetic algorithm; machine learning; reinforcement learning; state abstraction; testing; GENETIC ALGORITHM;
D O I
10.1109/TSE.2023.3269804
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep Reinforcement Learning (DRL) algorithms have been increasingly employed during the last decade to solve various decision-making problems such as autonomous driving, trading decisions, and robotics. However, these algorithms have faced great challenges when deployed in safety-critical environments since they often exhibit erroneous behaviors that can lead to potentially critical errors. One of the ways to assess the safety of DRL agents is to test them to detect possible faults leading to critical failures during their execution. This raises the question of how we can efficiently test DRL policies to ensure their correctness and adherence to safety requirements. Most existing works on testing DRL agents use adversarial attacks that perturb states or actions of the agent. However, such attacks often lead to unrealistic states of the environment. Furthermore, their main goal is to test the robustness of DRL agents rather than testing the compliance of the agents' policies with respect to requirements. Due to the huge state space of DRL environments, the high cost of test execution, and the black-box nature of DRL algorithms, exhaustive testing of DRL agents is impossible. In this paper, we propose a Search-based Testing Approach of Reinforcement Learning Agents (STARLA) to test the policy of a DRL agent by effectively searching for failing executions of the agent within a limited testing budget. We rely on machine learning models and a dedicated genetic algorithm to narrow the search toward faulty episodes (i.e., sequences of states and actions produced by the DRL agent). We apply STARLA on Deep-Q-Learning agents trained on two different RL problems widely used as benchmarks and show that STARLA significantly outperforms Random Testing by detecting more faults related to the agent's policy. We also investigate how to extract rules that characterize faulty episodes of the DRL agent using our search results. Such rules can be used to understand the conditions under which the agent fails and thus assess the risks of deploying it.
引用
收藏
页码:3715 / 3735
页数:21
相关论文
共 50 条
  • [1] Search-Based Testing of Reinforcement Learning
    Tappler, Martin
    Cordoba, Filip Cano
    Aichernig, Bernhard K.
    Koenighofer, Bettina
    PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022, 2022, : 503 - 510
  • [2] DeepEvolution: A Search-Based Testing Approach for Deep Neural Networks
    Ben Braiek, Houssem
    Khomh, Foutse
    2019 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2019), 2019, : 454 - 458
  • [3] Novel Cuckoo Search-Based Metaheuristic Approach for Deep Learning Prediction of Depression
    Jawad, Khurram
    Mahto, Rajul
    Das, Aryan
    Ahmed, Saboor Uddin
    Aziz, Rabia Musheer
    Kumar, Pavan
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [4] DiverGet: a Search-Based Software Testing approach for Deep Neural Network Quantization assessment
    Yahmed, Ahmed Haj
    Ben Braiek, Houssem
    Khomh, Foutse
    Bouzidi, Sonia
    Zaatour, Rania
    EMPIRICAL SOFTWARE ENGINEERING, 2022, 27 (07)
  • [5] DiverGet: a Search-Based Software Testing approach for Deep Neural Network Quantization assessment
    Ahmed Haj Yahmed
    Houssem Ben Braiek
    Foutse Khomh
    Sonia Bouzidi
    Rania Zaatour
    Empirical Software Engineering, 2022, 27
  • [6] Testing of Deep Reinforcement Learning Agents with Surrogate Models
    Biagiola, Matteo
    Tonella, Paolo
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (03)
  • [7] A Framework for Continuous Regression and Integration Testing in IoT Systems Based on Deep Learning and Search-Based Techniques
    Medhat, Noha
    Moussa, Sherin M.
    Badr, Nagwa Lotfy
    Tolba, Mohamed F.
    IEEE ACCESS, 2020, 8 : 215716 - 215726
  • [8] A Deep Reinforcement Learning-Based Approach for Android GUI Testing
    Gao, Yuemeng
    Tao, Chuanqi
    Guo, Hongjing
    Gao, Jerry
    WEB AND BIG DATA, PT III, APWEB-WAIM 2022, 2023, 13423 : 262 - 276
  • [9] Leveraging Deep Learning Representation for Search-based Image Annotation
    Kashani, Mahya Mohammadi
    Amiri, S. Hamid
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 156 - 161
  • [10] New Hybrid Deep Neural Architectural Search-Based Ensemble Reinforcement Learning Strategy for Wind Power Forecasting
    Jalali, Seyed Mohammad Jafar
    Osorio, Gerardo J.
    Ahmadian, Sajad
    Lotfi, Mohamed
    Campos, Vasco M. A.
    Shafie-khah, Miadreza
    Khosravi, Abbas
    Catalao, Joao P. S.
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2022, 58 (01) : 15 - 27