Reinforcement learning in sparse-reward environments with hindsight policy gradients

被引:0
|
作者
Queen Mary University of London, London [1 ]
E1 4FZ, United Kingdom
不详 [2 ]
100-0004, Japan
不详 [3 ]
29056-264, Brazil
不详 [4 ]
6962, Switzerland
不详 [5 ]
6900, Switzerland
不详 [6 ]
6928, Switzerland
不详 [7 ]
6900, Switzerland
机构
来源
Neural Comp. | / 6卷 / 1498-1553期
关键词
Number:; -; Acronym:; IBM; Sponsor: International Business Machines Corporation; 742870; ERC; Sponsor: European Research Council; 200021_165675/1; SNF; Sponsor: Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung; 88881.133206/2016-01; CAPES; Sponsor: Coordenação de Aperfeiçoamento de Pessoal de Nível Superior;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
相关论文
共 50 条
  • [31] Model-free Policy Learning with Reward Gradients
    Lan, Qingfong
    Tosatto, Samuele
    Farrahi, Homayoon
    Mahmood, A. Rupam
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [32] Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards
    Zuo, Guoyu
    Zhao, Qishen
    Lu, Jiahao
    Li, Jiangeng
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (01):
  • [33] Expansive Latent Planning for Sparse Reward Offline Reinforcement Learning
    Gieselmann, Robert
    Pokorny, Florian T.
    [J]. CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [34] A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning
    Fu, Qingxu
    Qiu, Tenghai
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] Planning with Q-Values in Sparse Reward Reinforcement Learning
    Lei, Hejun
    Weng, Paul
    Rojas, Juan
    Guan, Yisheng
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT I, 2022, 13455 : 603 - 614
  • [36] Generalized Hindsight for Reinforcement Learning
    Li, Alexander C.
    Pinto, Lerrel
    Abbeel, Pieter
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [37] Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks
    Xu, Pei
    Zhang, Junge
    Yin, Qiyue
    Yu, Chao
    Yang, Yaodong
    Huang, Kaiqi
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11717 - 11725
  • [38] Structural and Compact Latent Representation Learning on Sparse Reward Environments
    Bang-Giang Le
    Thi-Linh Hoang
    Hai-Dang Kieu
    Viet-Cuong Ta
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II, 2023, 13996 : 40 - 51
  • [39] HMRL: Hyper-Meta Learning for Sparse Reward Reinforcement Learning Problem
    Hua, Yun
    Wang, Xiangfeng
    Jin, Bo
    Li, Wenhao
    Yan, Junchi
    He, Xiaofeng
    Zha, Hongyuan
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 637 - 645
  • [40] Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments
    Sullivan, Ryan
    Terry, J. K.
    Black, Benjamin
    Dickerson, John P.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,