Multimodal Across Domains Gaze Target Detection

被引:2
|
作者
Tonini, Francesco [1 ]
Beyan, Cigdem [1 ]
Ricci, Elisa [1 ,2 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[2] Fdn Bruno Kessler, Deep Visual Learning Res Grp, Trento, Italy
关键词
Gaze target detection; gaze following; domain adaptation; RGB image; depth map; multimodal data;
D O I
10.1145/3536221.3556624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the gaze target detection problem in single images captured from the third-person perspective. We present a multimodal deep architecture to infer where a person in a scene is looking. This spatial model is trained on the head images of the person-of-interest, scene and depth maps representing rich context information. Our model, unlike several prior art, do not require supervision of the gaze angles, do not rely on head orientation information and/or location of the eyes of person-of-interest. Extensive experiments demonstrate the stronger performance of our method on multiple benchmark datasets. We also investigated several variations of our method by altering joint-learning of multimodal data. Some variations outperform a few prior art as well. First time in this paper, we inspect domain adaptation for gaze target detection, and we empower our multimodal network to efectively handle the domain gap across datasets. The code of the proposed method is available at https://github.com/francescotonini/multimodal- across-domains-gaze-target-detection.
引用
收藏
页码:420 / 431
页数:12
相关论文
共 50 条
  • [21] End-to-End Human-Gaze-Target Detection with Transformers
    Tu, Danyang
    Min, Xiongkuo
    Duan, Huiyu
    Guo, Guodong
    Zhai, Guangtao
    Shen, Wei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2192 - 2200
  • [22] Everyday Eye Contact Detection Using Unsupervised Gaze Target Discovery
    Zhang, Xucong
    Sugano, Yusuke
    Bulling, Andreas
    [J]. UIST'17: PROCEEDINGS OF THE 30TH ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY, 2017, : 193 - 203
  • [23] Multimodal Gaze-based Interaction
    Pfeiffer, Thies
    Wachsmuth, Ipke
    [J]. AT-AUTOMATISIERUNGSTECHNIK, 2013, 61 (11) : 770 - 776
  • [24] Multimodal Depression Detection: Fusion Analysis of Paralinguistic, Head Pose and Eye Gaze Behaviors
    Alghowinem, Sharifa
    Goecke, Roland
    Wagner, Michael
    Epps, Julien
    Hyett, Matthew
    Parker, Gordon
    Breakspear, Michael
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) : 478 - 490
  • [25] Multimodal Target Detection by Sparse Coding: Application to Paint Loss Detection in Paintings
    Huang, Shaoguang
    Cornelis, Bruno
    Devolder, Bart
    Martens, Maximiliaan
    Pizurica, Aleksandra
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 7681 - 7696
  • [26] Multimodal Interaction with Gaze and Controller Gesture
    Chia, Wen Han
    Cai, Yiyu
    Ho, Andrew
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY ADJUNCT (ISMAR-ADJUNCT 2022), 2022, : 518 - 523
  • [27] Multimodal coordination of vocal and gaze behavior in mother-infant dyads across the first year of life
    Northrup, Jessie B.
    Iverson, Jana M.
    [J]. INFANCY, 2020, 25 (06) : 952 - 972
  • [28] Multimodal Gaze Interaction for Creative Design
    Creed, Chris
    Frutos-Pascual, Maite
    Williams, Ian
    [J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [29] Infrared Target Detection Algorithm Based on Pseudo Multimodal Images
    An Hao-nan
    Zhao Ming
    Pan Sheng-da
    Lin Chang-qing
    [J]. ACTA PHOTONICA SINICA, 2020, 49 (08)
  • [30] Transfer Learning from Minimal Target Data by Mapping across Relational Domains
    Mihalkova, Lilyana
    Mooney, Raymond J.
    [J]. 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 1163 - 1168