Improving the Performance of Batch-Constrained Reinforcement Learning in Continuous Action Domains via Generative Adversarial Networks

被引:0
|
作者
Saglam, Baturay [1 ]
Dalmaz, Onat [1 ]
Gonc, Kaan [2 ]
Kozat, Suleyman S. [1 ]
机构
[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkey
[2] Bilkent Univ, Bilgisayar Muhendisligi Bolumu, Ankara, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
deep reinforcement learning; batch-constrained reinforcement learning; offline reinforcement learning;
D O I
10.1109/SIU55565.2022.9864786
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The Batch-Constrained Q-learning algorithm is shown to overcome the extrapolation error and enable deep reinforcement learning agents to learn from a previously collected fixed batch of transitions. However, due to conditional Variational Autoencoders (VAE) used in the data generation module, the BCQ algorithm optimizes a lower variational bound and hence, it is not generalizable to environments with large state and action spaces. In this paper, we show that the performance of the BCQ algorithm can be further improved with the employment of one of the recent advances in deep learning, Generative Adversarial Networks. Our extensive set of experiments shows that the introduced approach significantly improves BCQ in all of the control tasks tested. Moreover, the introduced approach demonstrates robust generalizability to environments with large state and action spaces in the OpenAI Gym control suite.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Physically constrained generative adversarial networks for improving precipitation fields from Earth system models
    Philipp Hess
    Markus Drüke
    Stefan Petri
    Felix M. Strnad
    Niklas Boers
    Nature Machine Intelligence, 2022, 4 : 828 - 839
  • [22] RCFL-GAN: Resource-Constrained Federated Learning with Generative Adversarial Networks
    Quan, Yuyan
    Guo, Songtao
    Qiao, Dewen
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 513 - 518
  • [23] Using generative adversarial networks to evaluate robustness of reinforcement learning agents against uncertainties
    Khayatian, Fazel
    Nagy, Zoltan
    Bollinger, Andrew
    ENERGY AND BUILDINGS, 2021, 251
  • [24] Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
    Tan, Kai Liang
    Esfandiari, Yasaman
    Lee, Xian Yeow
    Aakanksha
    Sarkar, Soumik
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3959 - 3964
  • [25] Synthesizing controlled microstructures of porous media using generative adversarial networks and reinforcement learning
    Nguyen, Phong C. H.
    Vlassis, Nikolaos N.
    Bahmani, Bahador
    Sun, WaiChing
    Udaykumar, H. S.
    Baek, Stephen S.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [26] Synthesizing controlled microstructures of porous media using generative adversarial networks and reinforcement learning
    Phong C. H. Nguyen
    Nikolaos N. Vlassis
    Bahador Bahmani
    WaiChing Sun
    H. S. Udaykumar
    Stephen S. Baek
    Scientific Reports, 12
  • [27] A model-based reinforcement learning method based on conditional generative adversarial networks
    Zhao, Tingting
    Wang, Ying
    Li, Guixi
    Kong, Le
    Chen, Yarui
    Wang, Yuan
    Xie, Ning
    Yang, Jucheng
    PATTERN RECOGNITION LETTERS, 2021, 152 : 18 - 25
  • [28] Anomaly Detection in Industrial IoT Using Distributional Reinforcement Learning and Generative Adversarial Networks
    Benaddi, Hafsa
    Jouhari, Mohammed
    Ibrahimi, Khalil
    Ben Othman, Jalel
    Amhoud, El Mehdi
    SENSORS, 2022, 22 (21)
  • [29] CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA Performance and Convergence
    Rodriguez, Abdel
    Grau, Ricardo
    Nowe, Aim
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 473 - 478
  • [30] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
    Zhou, Ziyuan
    Liu, Guanjun
    Guo, Weiran
    Zhou, MengChu
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646