Learning from Pixels with Expert Observations

被引:0
|
作者
Minh-Huy Hoang [1 ]
Long Dinh [2 ]
Hai Nguyen [3 ]
机构
[1] Univ Sci Ho Chi Minh City, Ho Chi Minh City, Vietnam
[2] Hanoi Univ Sci & Technol, Hanoi, Vietnam
[3] Northeastern Univ, Boston, MA 02115 USA
关键词
D O I
10.1109/IROS55552.2023.10342043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reinforcement learning (RL), sparse rewards can present a significant challenge. Fortunately, expert actions can be utilized to overcome this issue. However, acquiring explicit expert actions can be costly, and expert observations are often more readily available. This paper presents a new approach that uses expert observations for learning in robot manipulation tasks with sparse rewards from pixel observations. Specifically, our technique involves using expert observations as intermediate visual goals for a goal-conditioned RL agent, enabling it to complete a task by successively reaching a series of goals. We demonstrate the efficacy of our method in five challenging block construction tasks in simulation and show that when combined with two state-of-the-art agents, our approach can significantly improve their performance while requiring 4-20 times fewer expert actions during training. Moreover, our method is also superior to a hierarchical baseline.
引用
收藏
页码:1200 / 1206
页数:7
相关论文
共 50 条
  • [31] Shapes From Pixels
    Fatemi, Mitra
    Amini, Arash
    Baboulaz, Loic
    Vetterli, Martin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (03) : 1193 - 1206
  • [32] From paper to pixels
    van den Berg, Christelle
    Groot, Luc
    GIM INTERNATIONAL-THE WORLDWIDE MAGAZINE FOR GEOMATICS, 2023, 37 (06): : 10 - 13
  • [33] From pixels to physics
    de Groot, N
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2003, 501 (01): : 229 - 232
  • [34] FROM PIXELS TO SEQUENCES
    ROBSON, S
    PHOTOGRAMMETRIC RECORD, 1995, 15 (86): : 327 - 330
  • [35] FROM PIXELS TO MICRODOTS
    TAZELAAR, JM
    BYTE, 1984, 9 (10): : 289 - &
  • [36] From print to pixels
    Edit Publ, Suppl (22):
  • [37] FROM PICASSO TO PIXELS
    MORRISON, GH
    ANALYTICAL CHEMISTRY, 1981, 53 (01) : 1 - 1
  • [38] Beyond pixels: Learning from multimodal hyperspectral superpixels for land cover classification
    Hong DanFeng
    Wu Xin
    Yao Jing
    Zhu XiaoXiang
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2022, 65 (04) : 802 - 808
  • [39] Combined data augmentation framework for generalizing deep reinforcement learning from pixels
    Xiong, Xi
    Shen, Chun
    Wu, Junhong
    Lu, Shuai
    Zhang, Xiaodan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [40] Beyond pixels: Learning from multimodal hyperspectral superpixels for land cover classification
    DanFeng Hong
    Xin Wu
    Jing Yao
    XiaoXiang Zhu
    Science China Technological Sciences, 2022, 65 : 802 - 808