Learning from Pixels with Expert Observations

被引:0
|
作者
Minh-Huy Hoang [1 ]
Long Dinh [2 ]
Hai Nguyen [3 ]
机构
[1] Univ Sci Ho Chi Minh City, Ho Chi Minh City, Vietnam
[2] Hanoi Univ Sci & Technol, Hanoi, Vietnam
[3] Northeastern Univ, Boston, MA 02115 USA
关键词
D O I
10.1109/IROS55552.2023.10342043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reinforcement learning (RL), sparse rewards can present a significant challenge. Fortunately, expert actions can be utilized to overcome this issue. However, acquiring explicit expert actions can be costly, and expert observations are often more readily available. This paper presents a new approach that uses expert observations for learning in robot manipulation tasks with sparse rewards from pixel observations. Specifically, our technique involves using expert observations as intermediate visual goals for a goal-conditioned RL agent, enabling it to complete a task by successively reaching a series of goals. We demonstrate the efficacy of our method in five challenging block construction tasks in simulation and show that when combined with two state-of-the-art agents, our approach can significantly improve their performance while requiring 4-20 times fewer expert actions during training. Moreover, our method is also superior to a hierarchical baseline.
引用
收藏
页码:1200 / 1206
页数:7
相关论文
共 50 条
  • [1] Learning goal hierarchies from structured observations and expert annotations
    Tolga Könik
    John E. Laird
    Machine Learning, 2006, 64 : 263 - 287
  • [2] Learning goal hierarchies from structured observations and expert annotations
    Koenik, Tolga
    Laird, John E.
    MACHINE LEARNING, 2006, 64 (1-3) : 263 - 287
  • [3] Learning goal hierarchies from structured observations and expert annotations
    Könik, T
    Laird, J
    INDUCTIVE LOGIC PROGRAMMING, PROCEEDINGS, 2004, 3194 : 198 - 215
  • [4] REINFORCEMENT LEARNING FROM PIXELS: WATERFLOODING OPTIMIZATION
    Miftakhov, Ruslan
    Efremov, Igor
    Al-Qasim, Abdulaziz S.
    PROCEEDINGS OF THE ASME 39TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, OMAE2020, VOL 11, 2020,
  • [5] Learning Latent Dynamics for Planning from Pixels
    Hafner, Danijar
    Lillicrap, Timothy
    Fischer, Ian
    Villegas, Ruben
    Ha, David
    Lee, Honglak
    Davidson, James
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [6] Learning deep dynamical models from image pixels
    Wahlstrom, Niklas
    Schon, Thomas B.
    Deisenroth, Marc Peter
    IFAC PAPERSONLINE, 2015, 48 (28): : 1059 - 1064
  • [7] From pixels to insights: Machine learning and deep learning for bioimage analysis
    Jan, Mahta
    Spangaro, Allie
    Lenartowicz, Michelle
    Usaj, Mojca Mattiazzi
    BIOESSAYS, 2024, 46 (02)
  • [8] GENERATING RULES FOR EXPERT SYSTEMS FROM OBSERVATIONS
    HO, TB
    DIDAY, E
    GETTLERSUMMA, M
    PATTERN RECOGNITION LETTERS, 1988, 7 (05) : 265 - 271
  • [9] Pacing, Pixels, and Paper: Flexibility in Learning Words from Flashcards
    Sage, Kara
    Rausch, Joseph
    Quirk, Abigail
    Halladay, Lauren
    JOURNAL OF INFORMATION TECHNOLOGY EDUCATION-RESEARCH, 2016, 15 : 431 - 456
  • [10] Expert patients: learning from HIV
    Cooke, Molly
    BMJ QUALITY & SAFETY, 2011, 20 : I67 - I68