Dual Attention Guided Gaze Target Detection in the Wild

被引:39
|
作者
Fang, Yi [1 ]
Tang, Jiapeng [1 ]
Shen, Wang [1 ]
Shen, Wei [2 ]
Gu, Xiao [1 ]
Song, Li [1 ]
Zhai, Guangtao [1 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.01123
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaze target detection aims to infer where each person in a scene is looking. Existing works focus on 2D gaze and 2D saliency, but fail to exploit 3D contexts. In this work, we propose a three-stage method to simulate the human gaze inference behavior in 3D space. In the first stage, we introduce a coarse-to-fine strategy to robustly estimate a 3D gaze orientation from the head. The predicted gaze is decomposed into a planar gaze on the image plane and a depth-channel gaze. In the second stage, we develop a Dual Attention Module (DAM), which takes the planar gaze to produce the filed of view and masks interfering objects regulated by depth information according to the depth-channel gaze. In the third stage, we use the generated dual attention as guidance to perform two sub-tasks: (1) identifying whether the gaze target is inside or out of the image; (2) locating the target if inside. Extensive experiments demonstrate that our approach performs favorably against state-of-the-art methods on GazeFollow and VideoAttentionTarget datasets.
引用
收藏
页码:11385 / 11394
页数:10
相关论文
共 50 条
  • [1] Dual Regression-Enhanced Gaze Target Detection in the Wild
    Wang, Xinming
    Zhang, Hanlin
    Wang, Zhiyong
    Nie, Wei
    Yang, Zhihao
    Ren, Weihong
    Xu, Qiong
    Xu, Xiu
    Liu, Honghai
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (01) : 219 - 229
  • [2] Region-Guided and Dual Attention Discriminative Learning Network for Hyperspectral Target Detection
    Zhong, Jia-Ping
    Li, Yun-Song
    Xie, Wei-Ying
    Lei, Jie
    Paolo, Gamba
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (05): : 1716 - 1729
  • [3] Gaze Target Detection by Merging Human Attention and Activity Cues
    Yang, Yaokun
    Yin, Yihan
    Lu, Feng
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6585 - 6593
  • [4] Gaze Target Detection Network Based on Attention Mechanism and Depth Prior
    Zhu, Yun
    Zhu, Dongchen
    Zhang, Guanghui
    Sun, Yanzan
    Zhang, Xiaolin
    [J]. Computer Engineering and Applications, 2024, 60 (14) : 240 - 249
  • [5] Object Detection Algorithm Guided by Dual Attention Models
    Ji Zhong
    Kong Qiankun
    Wang Jian
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (06)
  • [6] Gaze Target Estimation Inspired by Interactive Attention
    Hu, Zhengxi
    Zhao, Kunxu
    Zhou, Bohan
    Guo, Hang
    Wu, Shichao
    Yang, Yuxue
    Liu, Jingtai
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8524 - 8536
  • [7] AgeDETR: Attention-Guided Efficient DETR for Space Target Detection
    Wang, Xiaojuan
    Xi, Bobo
    Xu, Haitao
    Zheng, Tie
    Xue, Changbin
    [J]. Remote Sensing, 2024, 16 (18)
  • [8] Guided Attention and Joint Loss for Infrared Dim Small Target Detection
    Tong, Yunfei
    Liu, Jing
    Fu, Zhiling
    Wang, Zhe
    Yang, Hai
    Niu, Saisai
    Tan, Qinyan
    [J]. IEEE Transactions on Geoscience and Remote Sensing, 2024, 62
  • [9] Learning Gaze Transition for Gaze Target Detection in Video
    Yang, Xingming
    Shi, Junbiao
    Li, Ziqiang
    Wu, Kewei
    Xie, Zhao
    [J]. Computer Engineering and Applications, 2024, 60 (20) : 293 - 301
  • [10] Dual-attention guided network for facial action unit detection
    Song, Wenyu
    Shi, Shuze
    Wu, Yuxuan
    An, Gaoyun
    [J]. IET IMAGE PROCESSING, 2022, 16 (08) : 2157 - 2170