Attention Flow: End-to-End Joint Attention Estimation

被引:0
|
作者
Sumer, Omer [1 ]
Gerjets, Peter [2 ]
Trautwein, Ulrich [1 ]
Kasneci, Enkelejda [1 ]
机构
[1] Univ Tubingen, Tubingen, Germany
[2] Leibniz Inst Wissensmedien, Tubingen, Germany
关键词
CHILDREN; MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of understanding joint attention in third-person social scene videos. Joint attention is the shared gaze behaviour of two or more individuals on an object or an area of interest and has a wide range of applications such as human-computer interaction, educational assessment, treatment of patients with attention disorders, and many more. Our method, Attention Flow, learns joint attention in an end-to-end fashion by using saliency-augmented attention maps and two novel convolutional attention mechanisms that determine to select relevant features and improve joint attention localization. We compare the effect of saliency maps and attention mechanisms and report quantitative and qualitative results on the detection and localization of joint attention in the VideoCoAtt dataset, which contains complex social scenes.
引用
收藏
页码:3316 / 3325
页数:10
相关论文
共 50 条
  • [41] LCANet: End-to-End Lipreading with Cascaded Attention-CTC
    Xu, Kai
    Li, Dawei
    Cassimatis, Nick
    Wang, Xiaolong
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 548 - 555
  • [42] An End-to-End Formula Recognition Method Integrated Attention Mechanism
    Zhou, Mingle
    Cai, Ming
    Li, Gang
    Li, Min
    MATHEMATICS, 2023, 11 (01)
  • [43] End-to-End ASR with Adaptive Span Self-Attention
    Chang, Xuankai
    Subramanian, Aswin Shanmugam
    Guo, Pengcheng
    Watanabe, Shinji
    Fujita, Yuya
    Omachi, Motoi
    INTERSPEECH 2020, 2020, : 3595 - 3599
  • [44] An End-to-End Lane Detection Model with Attention and Residual Block
    Wang, Bo
    Yan, Xiaoting
    Li, Deguang
    Computational Intelligence and Neuroscience, 2022, 2022
  • [45] Dynamic DETR: End-to-End Object Detection with Dynamic Attention
    Dai, Xiyang
    Chen, Yinpeng
    Yang, Jianwei
    Zhang, Pengchuan
    Yuan, Lu
    Zhang, Lei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2968 - 2977
  • [46] End-to-end temporal attention extraction and human action recognition
    Hong Zhang
    Miao Xin
    Shuhang Wang
    Yifan Yang
    Lei Zhang
    Helong Wang
    Machine Vision and Applications, 2018, 29 : 1127 - 1142
  • [47] Attention Based End-to-End Network for Short Video Classification
    Zhu, Hui
    Zou, Chao
    Wang, Zhenyu
    Xu, Kai
    Huang, Zihao
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 490 - 494
  • [48] End-to-end temporal attention extraction and human action recognition
    Zhang, Hong
    Xin, Miao
    Wang, Shuhang
    Yang, Yifan
    Zhang, Lei
    Wang, Helong
    MACHINE VISION AND APPLICATIONS, 2018, 29 (07) : 1127 - 1142
  • [49] Explaining Autonomous Driving by Learning End-to-End Visual Attention
    Cultrera, Luca
    Seidenari, Lorenzo
    Becattini, Federico
    Pala, Pietro
    Del Bimbo, Alberto
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1389 - 1398
  • [50] Joint CTC-Attention End-to-End Speech Recognition with a Triangle Recurrent Neural Network Encoder
    Zhu T.
    Cheng C.
    Journal of Shanghai Jiaotong University (Science), 2020, 25 (01) : 70 - 75