Learning from Pixels with Expert Observations

被引:0
|
作者
Minh-Huy Hoang [1 ]
Long Dinh [2 ]
Hai Nguyen [3 ]
机构
[1] Univ Sci Ho Chi Minh City, Ho Chi Minh City, Vietnam
[2] Hanoi Univ Sci & Technol, Hanoi, Vietnam
[3] Northeastern Univ, Boston, MA 02115 USA
关键词
D O I
10.1109/IROS55552.2023.10342043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In reinforcement learning (RL), sparse rewards can present a significant challenge. Fortunately, expert actions can be utilized to overcome this issue. However, acquiring explicit expert actions can be costly, and expert observations are often more readily available. This paper presents a new approach that uses expert observations for learning in robot manipulation tasks with sparse rewards from pixel observations. Specifically, our technique involves using expert observations as intermediate visual goals for a goal-conditioned RL agent, enabling it to complete a task by successively reaching a series of goals. We demonstrate the efficacy of our method in five challenging block construction tasks in simulation and show that when combined with two state-of-the-art agents, our approach can significantly improve their performance while requiring 4-20 times fewer expert actions during training. Moreover, our method is also superior to a hierarchical baseline.
引用
收藏
页码:1200 / 1206
页数:7
相关论文
共 50 条
  • [41] Ground truth free retinal vessel segmentation by learning from simple pixels
    Zou, Beiji
    Fu, Hongpu
    Chen, Zailiang
    Liu, Qing
    IET IMAGE PROCESSING, 2021, 15 (06) : 1210 - 1220
  • [42] Beyond pixels: Learning from multimodal hyperspectral superpixels for land cover classification
    HONG DanFeng
    WU Xin
    YAO Jing
    ZHU XiaoXiang
    Science China(Technological Sciences), 2022, 65 (04) : 802 - 808
  • [43] Beyond pixels: Learning from multimodal hyperspectral superpixels for land cover classification
    HONG DanFeng
    WU Xin
    YAO Jing
    ZHU XiaoXiang
    Science China(Technological Sciences), 2022, (04) : 802 - 808
  • [44] DeepEthogram, a machine learning pipeline for supervised behavior classification from raw pixels
    Bohnslav, James P.
    Wimalasena, Nivanthika K.
    Clausing, Kelsey J.
    Dai, Yu Y.
    Yarmolinsky, David A.
    Cruz, Tomas
    Kashlan, Adam D.
    Chiappe, M. Eugenia
    Orefice, Lauren L.
    Woolf, Clifford J.
    Harvey, Christopher D.
    ELIFE, 2021, 10
  • [45] Pixels Who Violate Our Privacy! Deep Learning for Identifying Images' Key Pixels
    Veenker, Carmen
    Opdam, Danny
    Alishahi, Mina
    COMPUTER SECURITY. ESORICS 2023 INTERNATIONAL WORKSHOPS, CPS4CIP, PT II, 2024, 14399 : 552 - 568
  • [46] Prediction of pixels in an arbitrary region from pixels on the region boundary
    Öktem, L
    Astola, J
    PROCEEDINGS OF THE IEEE-EURASIP WORKSHOP ON NONLINEAR SIGNAL AND IMAGE PROCESSING (NSIP'99), 1999, : 429 - 431
  • [47] Learning county from pixels: corn yield prediction with attention-weighted multiple instance learning
    Wang, Xiaoyu
    Ma, Yuchi
    Xu, Yijia
    Huang, Qunying
    Yang, Zhengwei
    Zhang, Zhou
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025,
  • [48] A learning model based on pixels for illumination analysis
    Wang, J
    Zhang, C
    Zhang, CS
    CCCT 2003, VOL 1, PROCEEDINGS: COMPUTING/INFORMATION SYSTEMS AND TECHNOLOGIES, 2003, : 443 - 447
  • [49] Incorporating Awareness in Expert Systems - Learning from Expert's Selective Attention and Perception
    Chakraborty, Goutam
    2016 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2016, : 3646 - 3651
  • [50] On Bayesian learning from Bernoulli observations
    Bissiri, Pier Giovanni
    Walker, Stephen G.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (11) : 3520 - 3530