Testing the Plasticity of Reinforcement Learning-based Systems

被引：9

作者：

Biagiola, Matteo ^{[1
]}

Tonella, Paolo ^{[1
]}

机构：

[1] Univ Svizzera Italiana, CH-6900 Lugano, Switzerland

来源：

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY | 2022年 / 31卷 / 04期

基金：

欧洲研究理事会;

关键词：

Software testing; reinforcement learning; empirical software engineering; NEURAL-NETWORKS;

D O I：

10.1145/3511701

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The dataset available for pre-release training of a machine-learning based system is often not representative of all possible execution contexts that the system will encounter in the field. Reinforcement Learning (RL) is a prominent approach among those that support continual learning, i.e., learning continually in the field, in the post-release phase. No study has so far investigated any method to test the plasticity of RL-based systems, i.e., their capability to adapt to an execution context that may deviate from the training one. We propose an approach to test the plasticity of RL-based systems. The output of our approach is a quantification of the adaptation and anti-regression capabilities of the system, obtained by computing the adaptation frontier of the system in a changed environment. We visualize such frontier as an adaptation/anti-regression heatmap in two dimensions, or as a clustered projection when more than two dimensions are involved. In this way, we provide developers with information on the amount of changes that can be accommodated by the continual learning component of the system, which is key to decide if online, in-the-field learning can be safely enabled or not.

引用

页数：46

共 50 条

[41] Deep Reinforcement Learning-Based Operation of Distribution Systems Using Surrogate Model
Bu, Van -Hai
Zarrabian, Sina
Su, Wencong
[J]. 2023 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, PESGM, 2023,
[42] Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems
Raeis, Majid
Tizghadam, Ali
Leon-Garcia, Alberto
[J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[43] Reinforcement Learning-based Active Disturbance Rejection Control for Nonlinear Systems with Disturbance
Kong, Xiangyu
Xia, Yuanqing
[J]. 2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 799 - 804
[44] Towards the portability of knowledge in reinforcement learning-based systems for automatic drone navigation
Barreiro, Jose M.
Lara, Juan A.
Manrique, Daniel
Smith, Peter
[J]. PEERJ COMPUTER SCIENCE, 2023, 9
[45] Reinforcement learning-based output feedback control of nonlinear systems with input constraints
He, P
Jagannathan, S
[J]. PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 2563 - 2568
[46] Reinforcement learning-based robust optimal tracking control for disturbed nonlinear systems
Fan, Zhong-Xin
Tang, Lintao
Li, Shihua
Liu, Rongjie
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (33): : 23987 - 23996
[47] Reinforcement Learning-Based Adaptive Optimal Control for Nonlinear Systems With Asymmetric Hysteresis
Zheng, Licheng
Liu, Zhi
Wang, Yaonan
Chen, C. L. Philip
Zhang, Yun
Wu, Zongze
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 10
[48] Reinforcement Learning-Based Model Predictive Control for Discrete-Time Systems
Lin, Min
Sun, Zhongqi
Xia, Yuanqing
Zhang, Jinhui
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3312 - 3324
[49] Reinforcement learning-based group navigation approach for multiple autonomous robotic systems
Azouaoui, O.
Cherifi, A.
Bensalem, R.
Farah, A.
Achour, K.
[J]. ADVANCED ROBOTICS, 2006, 20 (05) : 519 - 542
[50] Multi-agent Transfer Learning in Reinforcement Learning-based Ride-sharing Systems
Castagna, Alberto
Dusparic, Ivana
[J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 120 - 130

← 1 2 3 4 5 →