Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning

被引:0
|
作者
Ko, Byungchan [1 ]
Ok, Jungseul [2 ]
机构
[1] NALBI, Seoul, South Korea
[2] POSTECH, GSAI, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In deep reinforcement learning (RL), data augmentation is widely considered as a tool to induce a set of useful priors about semantic consistency and to improve sample efficiency and generalization performance. However, even when the prior is useful for generalization, distilling it to RL agent often interferes with RL training and degenerates sample efficiency. Meanwhile, the agent is forgetful of the prior due to the non-stationary nature of RL. These observations suggest two extreme schedules of distillation: (i) over the entire training; or (ii) only at the end. Hence, we devise a stand-alone network distillation method to inject the consistency prior at any time (even after RL), and a simple yet efficient framework to automatically schedule the distillation. Specifically, the proposed framework first focuses on mastering train environments regardless of generalization by adaptively deciding which or no augmentation to be used for the training. After this, we add the distillation to extract the remaining benefits for generalization from all the augmentations, which requires no additional new samples. In our experiments, we demonstrate the utility of the proposed framework, in particular, that considers postponing the augmentation to the end of RL training. https://github.com/kbc6723/es-da
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Energy efficient task scheduling based on deep reinforcement learning in cloud environment: A specialized review
    Hou, Huanhuan
    Agos Jawaddi, Siti Nuraishah
    Ismail, Azlan
    [J]. Future Generation Computer Systems, 2024, 151 : 214 - 231
  • [42] Efficient Pump Scheduling for Large-Scale Multiproduct Pipelines Using Deep Reinforcement Learning
    Shao, Kai
    Wang, Xinmin
    Liu, Min
    Xu, Aobo
    Jian, Ling
    [J]. INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024,
  • [43] H-SwarmLoc: Efficient Scheduling for Localization of Heterogeneous MAV Swarm with Deep Reinforcement Learning
    Wang, Haoyang
    Chen, Xuecheng
    Cheng, Yuhan
    Wu, Chenye
    Dang, Fan
    Chen, Xinlei
    [J]. PROCEEDINGS OF THE TWENTIETH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, SENSYS 2022, 2022, : 1148 - 1154
  • [44] A framework for scheduling in cloud manufacturing with deep reinforcement learning
    Liu, Yongkui
    Zhang, Lin
    Wang, Lihui
    Xiao, Yingying
    Xu, Xun
    Wang, Mei
    [J]. 2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1775 - 1780
  • [45] Adaptive DAG Tasks Scheduling with Deep Reinforcement Learning
    Wu, Qing
    Wu, Zhiwei
    Zhuang, Yuehui
    Cheng, Yuxia
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT II, 2018, 11335 : 477 - 490
  • [46] Cellular Network Traffic Scheduling with Deep Reinforcement Learning
    Chinchali, Sandeep
    Hu, Pan
    Chu, Tianshu
    Sharma, Manu
    Bansal, Manu
    Misra, Rakesh
    Pavone, Marco
    Katti, Sachin
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 766 - 774
  • [47] A deep reinforcement learning approach for chemical production scheduling
    Hubbs, Christian D.
    Li, Can
    Sahinidis, Nikolaos, V
    Grossmann, Ignacio E.
    Wassick, John M.
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2020, 141
  • [48] Optimization of global production scheduling with deep reinforcement learning
    Waschneck, Bernd
    Reichstaller, Andre
    Belzner, Lenz
    Altenmueller, Thomas
    Bauernhansl, Thomas
    Knapp, Alexander
    Kyek, Andreas
    [J]. 51ST CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2018, 72 : 1264 - 1269
  • [49] Beam Hopping Scheduling Based on Deep Reinforcement Learning
    Deng, Huimin
    Ying, Kai
    Gui, Lin
    [J]. 2023 INTERNATIONAL CONFERENCE ON FUTURE COMMUNICATIONS AND NETWORKS, FCN, 2023,
  • [50] Decentralized Scheduling for Cooperative Localization With Deep Reinforcement Learning
    Peng, Bile
    Seco-Granados, Gonzalo
    Steinmetz, Erik
    Frohle, Markus
    Wymeersch, Henk
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4295 - 4305