Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation

被引:0
|
作者
Papagiannis, Tasos [1 ]
Alexandridis, Georgios [1 ]
Stafylopatis, Andreas [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Zografou Campus, Athens 15780, Greece
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 01期
关键词
data augmentation; deep reinforcement learning; generative models; Arcade Learning Environment; diffusion models;
D O I
10.3390/app14010330
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Data augmentation is a promising technique in improving exploration and convergence speed in deep reinforcement learning methodologies. In this work, we propose a data augmentation framework based on generative models for creating completely novel states and increasing diversity. For this purpose, a diffusion model is used to generate artificial states (learning the distribution of original, collected states), while an additional model is trained to predict the action executed between two consecutive states. These models are combined to create synthetic data for cases of high and low immediate rewards, which are encountered less frequently during the agent's interaction with the environment. During the training process, the synthetic samples are mixed with the actually observed data in order to speed up agent learning. The proposed methodology is tested on the Atari 2600 framework, producing realistic and diverse synthetic data which improve training in most cases. Specifically, the agent is evaluated on three heterogeneous games, achieving a reward increase of up to 31%, although the results indicate performance variance among the different environments. The augmentation models are independent of the learning process and can be integrated to different algorithms, as well as different environments, with slight adaptations.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning
    Ko, Byungchan
    Ok, Jungseul
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] Counterfactual state explanations for reinforcement learning agents via generative deep learning
    Olson, Matthew L.
    Khanna, Roli
    Neal, Lawrence
    Li, Fuxin
    Wong, Weng-Keen
    [J]. ARTIFICIAL INTELLIGENCE, 2021, 295
  • [3] Deep Generative Models for Data Synthesis and Augmentation in Machine Learning
    Adavala, Kiran Mayee
    Vhatkar, Sangeeta
    Ruprah, Taranpreet Singh
    Bhatia, Sukhwinder Kaur
    Kumar, Vipin
    Sharma, Dharmendra
    Praveen, B. Shyam
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1242 - 1249
  • [4] Synthetic Data Augmentation for Deep Reinforcement Learning in Financial Trading
    Liu, Chunli
    Ventre, Carmine
    Polukarov, Maria
    [J]. 3RD ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2022, 2022, : 343 - 351
  • [5] Semantic Data Augmentation for Deep Learning Testing using Generative AI
    Missaoui, Sondess
    Gerasimou, Simos
    Matragkas, Nicholas
    [J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1694 - 1698
  • [6] Boosting Offline Reinforcement Learning with Residual Generative Modeling
    Wei, Hua
    Ye, Deheng
    Liu, Zhao
    Wu, Hao
    Yuan, Bo
    Fu, Qiang
    Yang, Wei
    Li, Zhenhui
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3574 - 3580
  • [7] Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning
    Lin, Yijiong
    Huang, Jiancong
    Zimmer, Matthieu
    Guan, Yisheng
    Rojas, Juan
    Weng, Paul
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (04) : 6615 - 6622
  • [8] Automatic Data Augmentation by Upper Confidence Bounds for Deep Reinforcement Learning
    Gil, Yoonhee
    Baek, Jongchan
    Park, Jonghyuk
    Han, Soohee
    [J]. 2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021, : 1199 - 1203
  • [9] Deep learning hotspots detection with generative adversarial network-based data augmentation
    Cheng, Zeyuan
    Behdinan, Kamran
    [J]. JOURNAL OF MICRO-NANOPATTERNING MATERIALS AND METROLOGY-JM3, 2022, 21 (02):
  • [10] Generative Inverse Deep Reinforcement Learning for Online Recommendation
    Chen, Xiaocong
    Yao, Lina
    Sun, Aixin
    Wang, Xianzhi
    Xu, Xiwei
    Zhu, Liming
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 201 - 210