Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation

被引：0

作者：

Papagiannis, Tasos ^{[1
]}

Alexandridis, Georgios ^{[1
]}

Stafylopatis, Andreas ^{[1
]}

机构：

[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Zografou Campus, Athens 15780, Greece

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 01期

关键词：

data augmentation; deep reinforcement learning; generative models; Arcade Learning Environment; diffusion models;

D O I：

10.3390/app14010330

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Data augmentation is a promising technique in improving exploration and convergence speed in deep reinforcement learning methodologies. In this work, we propose a data augmentation framework based on generative models for creating completely novel states and increasing diversity. For this purpose, a diffusion model is used to generate artificial states (learning the distribution of original, collected states), while an additional model is trained to predict the action executed between two consecutive states. These models are combined to create synthetic data for cases of high and low immediate rewards, which are encountered less frequently during the agent's interaction with the environment. During the training process, the synthetic samples are mixed with the actually observed data in order to speed up agent learning. The proposed methodology is tested on the Atari 2600 framework, producing realistic and diverse synthetic data which improve training in most cases. Specifically, the agent is evaluated on three heterogeneous games, achieving a reward increase of up to 31%, although the results indicate performance variance among the different environments. The augmentation models are independent of the learning process and can be integrated to different algorithms, as well as different environments, with slight adaptations.

引用

页数：22

共 50 条

[1] Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
Ko, Byungchan
Ok, Jungseul
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[2] Counterfactual state explanations for reinforcement learning agents via generative deep learning
Olson, Matthew L.
Khanna, Roli
Neal, Lawrence
Li, Fuxin
Wong, Weng-Keen
ARTIFICIAL INTELLIGENCE, 2021, 295
[3] Deep Generative Models for Data Synthesis and Augmentation in Machine Learning
Adavala, Kiran Mayee
Vhatkar, Sangeeta
Ruprah, Taranpreet Singh
Bhatia, Sukhwinder Kaur
Kumar, Vipin
Sharma, Dharmendra
Praveen, B. Shyam
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1242 - 1249
[4] Synthetic Data Augmentation for Deep Reinforcement Learning in Financial Trading
Liu, Chunli
Ventre, Carmine
Polukarov, Maria
3RD ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2022, 2022, : 343 - 351
[5] Data Augmentation for the Femoral Head Using Generative Deep Learning Models
Won, Joon Hee
Goh, Tae Sik
Lee, Jung Sub
Lim, Hee Chang
TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS B, 2025, 49 (02) : 109 - 119
[6] Semantic Data Augmentation for Deep Learning Testing using Generative AI
Missaoui, Sondess
Gerasimou, Simos
Matragkas, Nicholas
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1694 - 1698
[7] Boosting Offline Reinforcement Learning with Residual Generative Modeling
Wei, Hua
Ye, Deheng
Liu, Zhao
Wu, Hao
Yuan, Bo
Fu, Qiang
Yang, Wei
Li, Zhenhui
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3574 - 3580
[8] Automatic Data Augmentation by Upper Confidence Bounds for Deep Reinforcement Learning
Gil, Yoonhee
Baek, Jongchan
Park, Jonghyuk
Han, Soohee
2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 1199 - 1203
[9] Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning
Lin, Yijiong
Huang, Jiancong
Zimmer, Matthieu
Guan, Yisheng
Rojas, Juan
Weng, Paul
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6615 - 6622
[10] Combined data augmentation framework for generalizing deep reinforcement learning from pixels
Xiong, Xi
Shen, Chun
Wu, Junhong
Lu, Shuai
Zhang, Xiaodan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264

← 1 2 3 4 5 →