3D Estimation of Visual Focus of Attention

被引:0
|
作者
Antunes Simoes, Carlos Miguel [1 ]
Moreno, Plinio [2 ]
机构
[1] Inst Super Tecn, Lisbon, Portugal
[2] Inst Super Tecn, Inst Sistemas & Robot, Lisbon, Portugal
关键词
VFOA; Eye Gaze; Head Pose; Object Detection; Human-Robot Interaction;
D O I
10.1109/ICARSC55462.2022.9784797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Humanoid and social robots may provide valuable resources to society in the most diverse and complex activities and challenges, thanks to their increasing mechanical and decisionmaking abilities. However, robots must comprehend and acquire information about their surroundings for proper interaction with humans. The VFOA can be used as the primary conversational cue. To tackle these challenges, we develop a novel approach that estimates and tracks the VFOA. The proposed model stems from the consideration that the eye gaze and head pose carry information about actions and interactions. The proposed formulation leads to a 3D algorithm that considers: (i) A bounding box of every object in the field of view of the robot's camera, and (ii) a ray casting algorithm that considers head and gaze directions. A Kalman filter performs the tracking of the gaze. Finally, the VFOA algorithm estimates the object of attention based on a weighted sum of gaze and head pose information. We study the parameters of 3D VFOA algorithm, running simulated scenarios for a selection of the most adequate parameters. The novel approach is validated, tested and benchmarked on the public MPI Sintel dataset containing animated real-world interactions.
引用
收藏
页码:211 / 217
页数:7
相关论文
共 50 条
  • [1] Visual Focus of Attention Estimation in 3D Scene with an Arbitrary Number of Targets
    Siegfried, Remy
    Odobez, Jean-Marc
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3147 - 3155
  • [2] Visual Attention for Rendered 3D Shapes
    Lavoue, Guillaume
    Cordier, Frederic
    Seo, Hyewon
    Larabi, Mohamed-Chaker
    [J]. COMPUTER GRAPHICS FORUM, 2018, 37 (02) : 191 - 203
  • [3] PRECUING OF VISUAL-ATTENTION IN A 3D DISPLAY
    ZIMBA, L
    BRITO, CF
    GULLEY, J
    THOMAS, K
    [J]. INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1993, 34 (04) : 1232 - 1232
  • [4] Stereoscopic Visual Attention Model for 3D Video
    Zhang, Yun
    Jiang, Gangyi
    Yu, Mei
    Chen, Ken
    [J]. ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 314 - 324
  • [5] 3D Mapping of Visual Attention for Smart Rehabilitation
    McMurrough, Christopher D.
    Lioulemes, Alexandros
    Phan, Scott
    Makedon, Fillia
    [J]. 8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
  • [6] JOINT ESTIMATION OF HEAD POSE AND VISUAL FOCUS OF ATTENTION
    Huang, Yingning
    Duan, Dingrui
    Cui, Jinshi
    Davoine, Franck
    Wang, Li
    Zha, Hongbin
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3332 - 3336
  • [7] Visual Focus of Attention Estimation With Unsupervised Incremental Learning
    Duffner, Stefan
    Garcia, Christophe
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (12) : 2264 - 2272
  • [8] Survey of recent advances in 3D visual attention for robotics
    Potapova, Ekaterina
    Zillich, Michael
    Vincze, Markus
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2017, 36 (11): : 1159 - 1176
  • [9] Learning Stereoscopic Visual Attention Model for 3D Video
    Huang, Gang-jian
    Du, Xin
    Zhu, Yun-fang
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 6 - 9
  • [10] Wearable Gaze Trackers: Mapping Visual Attention in 3D
    Jensen, Rasmus R.
    Stets, Jonathan D.
    Suurmets, Seidi
    Clement, Jesper
    Aanaes, Henrik
    [J]. IMAGE ANALYSIS, SCIA 2017, PT I, 2017, 10269 : 66 - 76