Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation

被引:0
|
作者
Papagiannis, Tasos [1 ]
Alexandridis, Georgios [1 ]
Stafylopatis, Andreas [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, Zografou Campus, Athens 15780, Greece
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 01期
关键词
data augmentation; deep reinforcement learning; generative models; Arcade Learning Environment; diffusion models;
D O I
10.3390/app14010330
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Data augmentation is a promising technique in improving exploration and convergence speed in deep reinforcement learning methodologies. In this work, we propose a data augmentation framework based on generative models for creating completely novel states and increasing diversity. For this purpose, a diffusion model is used to generate artificial states (learning the distribution of original, collected states), while an additional model is trained to predict the action executed between two consecutive states. These models are combined to create synthetic data for cases of high and low immediate rewards, which are encountered less frequently during the agent's interaction with the environment. During the training process, the synthetic samples are mixed with the actually observed data in order to speed up agent learning. The proposed methodology is tested on the Atari 2600 framework, producing realistic and diverse synthetic data which improve training in most cases. Specifically, the agent is evaluated on three heterogeneous games, achieving a reward increase of up to 31%, although the results indicate performance variance among the different environments. The augmentation models are independent of the learning process and can be integrated to different algorithms, as well as different environments, with slight adaptations.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] ON THE DEVELOPMENT OF AUTONOMOUS AGENTS USING DEEP REINFORCEMENT LEARNING
    Barbu, Clara
    Mocanu, Stefan Alexandru
    [J]. UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (03): : 97 - 116
  • [42] Imagination-Augmented Agents for Deep Reinforcement Learning
    Racaniere, Sebastien
    Weber, Theophane
    Reichert, David P.
    Buesing, Lars
    Guez, Arthur
    Rezende, Danilo
    Badia, Adria Puigdomenech
    Vinyals, Oriol
    Heess, Nicolas
    Li, Yujia
    Pascanu, Razvan
    Battaglia, Peter
    Hassabis, Demis
    Silver, David
    Wierstra, Daan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [43] Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
    Lin, Yen-Chen
    Hong, Zhang-Wei
    Liao, Yuan-Hong
    Shih, Meng-Li
    Liu, Ming-Yu
    Sun, Min
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3756 - 3762
  • [44] Deep Reinforcement Learning Agents for Decision Making for Gameplay
    Heaton, Jacqueline
    Givigi, Sidney
    [J]. 18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [45] Testing of Deep Reinforcement Learning Agents with Surrogate Models
    Biagiola, Matteo
    Tonella, Paolo
    [J]. ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (03)
  • [46] On the development of autonomous agents using deep reinforcement learning
    Barbu, Clara
    Mocanu, Ștefan Alexandru
    [J]. UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2021, 83 (03): : 97 - 116
  • [47] Application of Deep Reinforcement Learning in Werewolf Game Agents
    Wang, Tianhe
    Kaneko, Tomoyuki
    [J]. 2018 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2018, : 28 - 33
  • [48] Navigational Behavior of Humans and Deep Reinforcement Learning Agents
    Rigoli, Lillian M.
    Patil, Gaurav
    Stening, Hamish F.
    Kallen, Rachel W.
    Richardson, Michael J.
    [J]. FRONTIERS IN PSYCHOLOGY, 2021, 12
  • [49] Balanced incremental deep reinforcement learning based on variational autoencoder data augmentation for customer credit scoring
    Wang, Yadong
    Jia, Yanlin
    Zhong, Yu
    Huang, Jing
    Xiao, Jin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [50] A Boosting-based Deep Neural Networks Algorithm for Reinforcement Learning
    Wang, Yu
    Jin, Hongxia
    [J]. 2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 1065 - 1071