Critic Guided Segmentation of Rewarding Objects in First-Person Views

被引:3
|
作者
Melnik, Andrew [1 ]
Harter, Augustin [1 ]
Limberg, Christian [1 ]
Rana, Krishan [2 ]
Sunderhauf, Niko [2 ]
Ritter, Helge [1 ]
机构
[1] Univ Bielefeld, CITEC, Bielefeld, Germany
[2] Queensland Univ Technol QUT, Ctr Robot, Brisbane, Qld, Australia
关键词
Imitation learning; Reinforcement learning; Image segmentation; Reward-centric objects; First person point of view; MineRL; Minecraft;
D O I
10.1007/978-3-030-87626-5_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work discusses a learning approach to mask rewarding objects in images using sparse reward signals from an imitation learning dataset. For that we train an Hourglass network using only feedback from a critic model. The Hourglass network learns to produce a mask to decrease the critic's score of a high score image and increase the critic's score of a low score image by swapping the masked areas between these two images. We trained the model on an imitation learning dataset from the NeurIPS 2020 MineRL Competition Track, where our model learned to mask rewarding objects in a complex interactive 3D environment with a sparse reward signal. This approach was part of the 1st place winning solution in this competition. Video demonstration and code: https://rebrand.ly/critic-guided-segmentation.
引用
收藏
页码:338 / 348
页数:11
相关论文
共 50 条
  • [1] A PREDICTOR OF MOVING OBJECTS FOR FIRST-PERSON VISION
    Sanchez-Matilla, Ricardo
    Cavallaro, Andrea
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2189 - 2193
  • [2] Head Pose Estimation in First-Person Camera Views
    Alletto, Stefano
    Serra, Giuseppe
    Calderara, Simone
    Cucchiara, Rita
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4188 - 4193
  • [3] FIRST-PERSON AND THIRD-PERSON VIEWS IN ARABIC PHILOSOPHY OF MIND
    Benevich, Fedor
    RECHERCHES DE THEOLOGIE ET PHILOSOPHIE MEDIEVALES, 2023, 90 (01): : 1 - 47
  • [4] Dense Motion Segmentation for First-Person Activity Recognition
    Zhan, Kai
    Guizilini, Vitor
    Ramos, Fabio
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 123 - 128
  • [5] Event segmentation during first-person continuous events
    Magliano, Joseph P.
    Radvansky, Gabriel A.
    Forsythe, J. Christopher
    Copeland, David E.
    JOURNAL OF COGNITIVE PSYCHOLOGY, 2014, 26 (06) : 649 - 661
  • [6] Linking global top-down views to first-person views in the brain
    Xing, Jinwei
    Chrastil, Elizabeth R.
    Nitz, Douglas A.
    Krichmar, Jeffrey L.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (45)
  • [7] Detecting Activities of Daily Living in First-person Camera Views
    Pirsiavash, Hamed
    Ramanan, Deva
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2847 - 2854
  • [8] Unsupervised Learning of Important Objects from First-Person Videos
    Bertasius, Gedas
    Park, Hyun Soo
    Yu, Stella X.
    Shi, Jianbo
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1974 - 1982
  • [9] Discovering Objects of Joint Attention via First-Person Sensing
    Kera, Hiroshi
    Yonetani, Ryo
    Higuchi, Keita
    Sato, Yoichi
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 361 - 369
  • [10] Temporal Segmentation and Activity Classification from First-person Sensing
    Spriggs, Ekaterina H.
    De La Torre, Fernando
    Hebert, Martial
    2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 17 - 24