Learning from Pixels with Expert Observations

被引:0
|
作者
Minh-Huy Hoang [1 ]
Long Dinh [2 ]
Hai Nguyen [3 ]
机构
[1] Univ Sci Ho Chi Minh City, Ho Chi Minh City, Vietnam
[2] Hanoi Univ Sci & Technol, Hanoi, Vietnam
[3] Northeastern Univ, Boston, MA 02115 USA
关键词
D O I
10.1109/IROS55552.2023.10342043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reinforcement learning (RL), sparse rewards can present a significant challenge. Fortunately, expert actions can be utilized to overcome this issue. However, acquiring explicit expert actions can be costly, and expert observations are often more readily available. This paper presents a new approach that uses expert observations for learning in robot manipulation tasks with sparse rewards from pixel observations. Specifically, our technique involves using expert observations as intermediate visual goals for a goal-conditioned RL agent, enabling it to complete a task by successively reaching a series of goals. We demonstrate the efficacy of our method in five challenging block construction tasks in simulation and show that when combined with two state-of-the-art agents, our approach can significantly improve their performance while requiring 4-20 times fewer expert actions during training. Moreover, our method is also superior to a hierarchical baseline.
引用
收藏
页码:1200 / 1206
页数:7
相关论文
共 50 条
  • [21] Learning from dependent observations
    Steinwart, Ingo
    Hush, Don
    Scovel, Clint
    JOURNAL OF MULTIVARIATE ANALYSIS, 2009, 100 (01) : 175 - 194
  • [22] Learning from multimodal observations
    Roy, D
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 579 - 582
  • [23] Learning from Partial Observations
    Michael, Loizos
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 968 - 974
  • [24] Learning from compressed observations
    Raginsky, Maxim
    2007 IEEE INFORMATION THEORY WORKSHOP, VOLS 1 AND 2, 2007, : 420 - 425
  • [25] Learning from Aggregate Observations
    Zhang, Yivan
    Charoenphakdee, Nontawat
    Wu, Zhenguo
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [26] Posthuman learning: AI from novice to expert?
    Hasse, Cathrine
    AI & SOCIETY, 2019, 34 (02) : 355 - 364
  • [27] Learning from expert hypotheses and training examples
    Kaneda, S
    Almuallim, H
    Akiba, Y
    Ishii, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (12) : 1205 - 1214
  • [28] Posthuman learning: AI from novice to expert?
    Cathrine Hasse
    AI & SOCIETY, 2019, 34 : 355 - 364
  • [29] Learning From Toys: Industry Expert Comment
    Hinrichs, Merle A.
    THUNDERBIRD INTERNATIONAL BUSINESS REVIEW, 2009, 51 (01) : 16 - 17
  • [30] Dementia: learning from the expert in our care
    Pearce, Shirley
    BRITISH JOURNAL OF OCCUPATIONAL THERAPY, 2024, 87 (01) : 24 - 25