Improving the Performance of Batch-Constrained Reinforcement Learning in Continuous Action Domains via Generative Adversarial Networks

被引：0

作者：

Saglam, Baturay ^{[1
]}

Dalmaz, Onat ^{[1
]}

Gonc, Kaan ^{[2
]}

Kozat, Suleyman S. ^{[1
]}

机构：

[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkey

[2] Bilkent Univ, Bilgisayar Muhendisligi Bolumu, Ankara, Turkey

来源：

2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年

关键词：

deep reinforcement learning; batch-constrained reinforcement learning; offline reinforcement learning;

D O I：

10.1109/SIU55565.2022.9864786

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The Batch-Constrained Q-learning algorithm is shown to overcome the extrapolation error and enable deep reinforcement learning agents to learn from a previously collected fixed batch of transitions. However, due to conditional Variational Autoencoders (VAE) used in the data generation module, the BCQ algorithm optimizes a lower variational bound and hence, it is not generalizable to environments with large state and action spaces. In this paper, we show that the performance of the BCQ algorithm can be further improved with the employment of one of the recent advances in deep learning, Generative Adversarial Networks. Our extensive set of experiments shows that the introduced approach significantly improves BCQ in all of the control tasks tested. Moreover, the introduced approach demonstrates robust generalizability to environments with large state and action spaces in the OpenAI Gym control suite.

引用

页数：4

共 50 条

[21] Physically constrained generative adversarial networks for improving precipitation fields from Earth system models
Philipp Hess
Markus Drüke
Stefan Petri
Felix M. Strnad
Niklas Boers
Nature Machine Intelligence, 2022, 4 : 828 - 839
[22] RCFL-GAN: Resource-Constrained Federated Learning with Generative Adversarial Networks
Quan, Yuyan
Guo, Songtao
Qiao, Dewen
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 513 - 518
[23] Using generative adversarial networks to evaluate robustness of reinforcement learning agents against uncertainties
Khayatian, Fazel
Nagy, Zoltan
Bollinger, Andrew
ENERGY AND BUILDINGS, 2021, 251
[24] Robustifying Reinforcement Learning Agents via Action Space Adversarial Training
Tan, Kai Liang
Esfandiari, Yasaman
Lee, Xian Yeow
Aakanksha
Sarkar, Soumik
2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 3959 - 3964
[25] Synthesizing controlled microstructures of porous media using generative adversarial networks and reinforcement learning
Nguyen, Phong C. H.
Vlassis, Nikolaos N.
Bahmani, Bahador
Sun, WaiChing
Udaykumar, H. S.
Baek, Stephen S.
SCIENTIFIC REPORTS, 2022, 12 (01)
[26] Synthesizing controlled microstructures of porous media using generative adversarial networks and reinforcement learning
Phong C. H. Nguyen
Nikolaos N. Vlassis
Bahador Bahmani
WaiChing Sun
H. S. Udaykumar
Stephen S. Baek
Scientific Reports, 12
[27] A model-based reinforcement learning method based on conditional generative adversarial networks
Zhao, Tingting
Wang, Ying
Li, Guixi
Kong, Le
Chen, Yarui
Wang, Yuan
Xie, Ning
Yang, Jucheng
PATTERN RECOGNITION LETTERS, 2021, 152 : 18 - 25
[28] Anomaly Detection in Industrial IoT Using Distributional Reinforcement Learning and Generative Adversarial Networks
Benaddi, Hafsa
Jouhari, Mohammed
Ibrahimi, Khalil
Ben Othman, Jalel
Amhoud, El Mehdi
SENSORS, 2022, 22 (21)
[29] CONTINUOUS ACTION REINFORCEMENT LEARNING AUTOMATA Performance and Convergence
Rodriguez, Abdel
Grau, Ricardo
Nowe, Aim
ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 473 - 478
[30] Adversarial Attacks on Multiagent Deep Reinforcement Learning Models in Continuous Action Space
Zhou, Ziyuan
Liu, Guanjun
Guo, Weiran
Zhou, MengChu
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (12): : 7633 - 7646

← 1 2 3 4 5 →