Improving Generalization in Reinforcement Learning with Mixture Regularization

被引:0
|
作者
Wang, Kaixin [1 ]
Kang, Bingyi [1 ]
Shao, Jie [2 ]
Feng, Jiashi [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] ByteDance AI Lab, Singapore, Singapore
关键词
NEURAL-NETWORKS; GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning (RL) agents trained in a limited set of environments tend to suffer overfitting and fail to generalize to unseen testing environments. To improve their generalizability, data augmentation approaches (e.g. cutout and random convolution) are previously explored to increase the data diversity. However, we find these approaches only locally perturb the observations regardless of the training environments, showing limited effectiveness on enhancing the data diversity and the generalization performance. In this work, we introduce a simple approach, named mixreg, which trains agents on a mixture of observations from different training environments and imposes linearity constraints on the observation interpolations and the supervision (e.g. associated reward) interpolations. Mixreg increases the data diversity more effectively and helps learn smoother policies. We verify its effectiveness on improving generalization by conducting extensive experiments on the large-scale Procgen benchmark. Results show mixreg outperforms the well-established baselines on unseen testing environments by a large margin. Mixreg is simple, effective and general. It can be applied to both policy-based and value-based RL algorithms. Code is available at https://github.com/kaixin96/mixreg.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Regularization Guarantees Generalization in Bayesian Reinforcement Learning through Algorithmic Stability
    Tamar, Aviv
    Soudry, Daniel
    Zisselman, Ev
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8423 - 8431
  • [2] A new regularization learning method for improving generalization capability of neural network
    Wu, Y
    Zhang, LM
    [J]. PROCEEDINGS OF THE 4TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-4, 2002, : 2011 - 2015
  • [3] IMPROVING GENERALIZATION OF REINFORCEMENT LEARNING USING A BILINEAR POLICY NETWORK
    Fang, Fen
    Liang, Wenyu
    Wu, Yan
    Xu, Qianli
    Lim, Joo-Hwee
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 991 - 995
  • [4] Exemplar Generalization in Reinforcement Learning: Improving Performance with Fewer Exemplars
    Matsushima, Hiroyasu
    Hattori, Kiyohiko
    Takadama, Keiki
    [J]. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2009, 13 (06) : 683 - 690
  • [5] Improving Policy Generalization for Teacher-Student Reinforcement Learning
    Xudong, Gong
    Hongda, Jia
    Xing, Zhou
    Dawei, Feng
    Bo, Ding
    Jie, Xu
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 39 - 47
  • [6] Improving generalization performance of natural gradient learning using optimized regularization by NIC
    Park, H
    Murata, N
    Amari, S
    [J]. NEURAL COMPUTATION, 2004, 16 (02) : 355 - 382
  • [7] Improving Generalization of Meta-learning with Inverted Regularization at Inner-level
    Wang, Lianzhe
    Zhou, Shiji
    Zhang, Shanghang
    Chu, Xu
    Chang, Heng
    Zhu, Wenwu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7826 - 7835
  • [8] Improving Generalization of Deep Reinforcement Learning-based TSP Solvers
    Ouyang, Wenbin
    Wang, Yisen
    Han, Shaochen
    Jin, Zhejian
    Weng, Paul
    [J]. 2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [9] Improving Offline Reinforcement Learning With In-Sample Advantage Regularization for Robot Manipulation
    Ma, Chengzhong
    Yang, Deyu
    Wu, Tianyu
    Liu, Zeyang
    Yang, Houxue
    Chen, Xingyu
    Lan, Xuguang
    Zheng, Nanning
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [10] Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic
    Ren, Yangang
    Duan, Jingliang
    Li, Shengbo Eben
    Guan, Yang
    Sun, Qi
    [J]. 2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,