Testing the Plasticity of Reinforcement Learning-based Systems

被引:9
|
作者
Biagiola, Matteo [1 ]
Tonella, Paolo [1 ]
机构
[1] Univ Svizzera Italiana, CH-6900 Lugano, Switzerland
基金
欧洲研究理事会;
关键词
Software testing; reinforcement learning; empirical software engineering; NEURAL-NETWORKS;
D O I
10.1145/3511701
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The dataset available for pre-release training of a machine-learning based system is often not representative of all possible execution contexts that the system will encounter in the field. Reinforcement Learning (RL) is a prominent approach among those that support continual learning, i.e., learning continually in the field, in the post-release phase. No study has so far investigated any method to test the plasticity of RL-based systems, i.e., their capability to adapt to an execution context that may deviate from the training one. We propose an approach to test the plasticity of RL-based systems. The output of our approach is a quantification of the adaptation and anti-regression capabilities of the system, obtained by computing the adaptation frontier of the system in a changed environment. We visualize such frontier as an adaptation/anti-regression heatmap in two dimensions, or as a clustered projection when more than two dimensions are involved. In this way, we provide developers with information on the amount of changes that can be accommodated by the continual learning component of the system, which is key to decide if online, in-the-field learning can be safely enabled or not.
引用
收藏
页数:46
相关论文
共 50 条
  • [41] Deep Reinforcement Learning-Based Operation of Distribution Systems Using Surrogate Model
    Bu, Van -Hai
    Zarrabian, Sina
    Su, Wencong
    [J]. 2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
  • [42] Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems
    Raeis, Majid
    Tizghadam, Ali
    Leon-Garcia, Alberto
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [43] Reinforcement Learning-based Active Disturbance Rejection Control for Nonlinear Systems with Disturbance
    Kong, Xiangyu
    Xia, Yuanqing
    [J]. 2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 799 - 804
  • [44] Towards the portability of knowledge in reinforcement learning-based systems for automatic drone navigation
    Barreiro, Jose M.
    Lara, Juan A.
    Manrique, Daniel
    Smith, Peter
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [45] Reinforcement learning-based output feedback control of nonlinear systems with input constraints
    He, P
    Jagannathan, S
    [J]. PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 2563 - 2568
  • [46] Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems
    Fan, Zhong-Xin
    Tang, Lintao
    Li, Shihua
    Liu, Rongjie
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (33): : 23987 - 23996
  • [47] Reinforcement Learning-Based Adaptive Optimal Control for Nonlinear Systems With Asymmetric Hysteresis
    Zheng, Licheng
    Liu, Zhi
    Wang, Yaonan
    Chen, C. L. Philip
    Zhang, Yun
    Wu, Zongze
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 10
  • [48] Reinforcement Learning-Based Model Predictive Control for Discrete-Time Systems
    Lin, Min
    Sun, Zhongqi
    Xia, Yuanqing
    Zhang, Jinhui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3312 - 3324
  • [49] Reinforcement learning-based group navigation approach for multiple autonomous robotic systems
    Azouaoui, O.
    Cherifi, A.
    Bensalem, R.
    Farah, A.
    Achour, K.
    [J]. ADVANCED ROBOTICS, 2006, 20 (05) : 519 - 542
  • [50] Multi-agent Transfer Learning in Reinforcement Learning-based Ride-sharing Systems
    Castagna, Alberto
    Dusparic, Ivana
    [J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 120 - 130