Critic Guided Segmentation of Rewarding Objects in First-Person Views

被引：3

作者：

Melnik, Andrew ^{[1
]}

Harter, Augustin ^{[1
]}

Limberg, Christian ^{[1
]}

Rana, Krishan ^{[2
]}

Sunderhauf, Niko ^{[2
]}

Ritter, Helge ^{[1
]}

机构：

[1] Univ Bielefeld, CITEC, Bielefeld, Germany

[2] Queensland Univ Technol QUT, Ctr Robot, Brisbane, Qld, Australia

来源：

ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2021 | 2021年 / 12873卷

关键词：

Imitation learning; Reinforcement learning; Image segmentation; Reward-centric objects; First person point of view; MineRL; Minecraft;

D O I：

10.1007/978-3-030-87626-5_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work discusses a learning approach to mask rewarding objects in images using sparse reward signals from an imitation learning dataset. For that we train an Hourglass network using only feedback from a critic model. The Hourglass network learns to produce a mask to decrease the critic's score of a high score image and increase the critic's score of a low score image by swapping the masked areas between these two images. We trained the model on an imitation learning dataset from the NeurIPS 2020 MineRL Competition Track, where our model learned to mask rewarding objects in a complex interactive 3D environment with a sparse reward signal. This approach was part of the 1st place winning solution in this competition. Video demonstration and code: https://rebrand.ly/critic-guided-segmentation.

引用

页码：338 / 348

页数：11

共 50 条

[1] A PREDICTOR OF MOVING OBJECTS FOR FIRST-PERSON VISION
Sanchez-Matilla, Ricardo
Cavallaro, Andrea
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2189 - 2193
[2] Head Pose Estimation in First-Person Camera Views
Alletto, Stefano
Serra, Giuseppe
Calderara, Simone
Cucchiara, Rita
2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4188 - 4193
[3] FIRST-PERSON AND THIRD-PERSON VIEWS IN ARABIC PHILOSOPHY OF MIND
Benevich, Fedor
RECHERCHES DE THEOLOGIE ET PHILOSOPHIE MEDIEVALES, 2023, 90 (01): : 1 - 47
[4] Dense Motion Segmentation for First-Person Activity Recognition
Zhan, Kai
Guizilini, Vitor
Ramos, Fabio
2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 123 - 128
[5] Event segmentation during first-person continuous events
Magliano, Joseph P.
Radvansky, Gabriel A.
Forsythe, J. Christopher
Copeland, David E.
JOURNAL OF COGNITIVE PSYCHOLOGY, 2014, 26 (06) : 649 - 661
[6] Linking global top-down views to first-person views in the brain
Xing, Jinwei
Chrastil, Elizabeth R.
Nitz, Douglas A.
Krichmar, Jeffrey L.
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (45)
[7] Detecting Activities of Daily Living in First-person Camera Views
Pirsiavash, Hamed
Ramanan, Deva
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2847 - 2854
[8] Unsupervised Learning of Important Objects from First-Person Videos
Bertasius, Gedas
Park, Hyun Soo
Yu, Stella X.
Shi, Jianbo
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1974 - 1982
[9] Discovering Objects of Joint Attention via First-Person Sensing
Kera, Hiroshi
Yonetani, Ryo
Higuchi, Keita
Sato, Yoichi
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 361 - 369
[10] Temporal Segmentation and Activity Classification from First-person Sensing
Spriggs, Ekaterina H.
De La Torre, Fernando
Hebert, Martial
2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 17 - 24

← 1 2 3 4 5 →