Asymmetry-aware bilinear pooling in multi-modal data for head pose estimation

被引:2
|
作者
Chen, Jiazhong [1 ]
Li, Qingqing [1 ]
Ren, Dakai [2 ]
Cao, Hua [1 ]
Ling, Hefei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; Asymmetry-aware; Bilinear pooling; ATTENTION; REPRESENTATION; NETWORK;
D O I
10.1016/j.image.2022.116895
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The head pose on roll and yaw directions is decided by the asymmetric appearance in human faces, and the contextual information of asymmetric appearance is encoded in a head pose related neighborhood. However, CNNs used in existing head pose estimation methods often evenly performs on the features of full image. Thus it is hard to collect the contextual information of such asymmetric appearance by those methods. To address this issue, this paper proposes a novel head pose estimation method that could perceive the asymmetric appearance in human faces. Specifically, the awareness of such asymmetry is undertaken by the local pairwise feature interaction in head pose related neighborhood via bilinear pooling. Evaluations on two public datasets demonstrate that our method could achieve promising results.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [1] Multi-modal rumour detection using bilinear pooling and domain adversarial neural networks
    Wang C.
    Zhang H.
    Zhang J.
    Gu L.
    International Journal of Security and Networks, 2023, 18 (03) : 175 - 188
  • [2] ESSAY-ANCHOR ATTENTIVE MULTI-MODAL BILINEAR POOLING FOR TEXTBOOK QUESTION ANSWERING
    Li, Juzheng
    Su, Hang
    Zhu, Jun
    Zhang, Bo
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [3] Asymmetry-Aware Scheduling in Heterogeneous Multi-core Architectures
    Zhang, Tao
    Pan, Xiaohui
    Shu, Wei
    Wu, Min-You
    NETWORK AND PARALLEL COMPUTING, NPC 2013, 2013, 8147 : 257 - 268
  • [4] Multi-Modal Driven Pose-Controllable Talking Head Generation
    Sun, Kuiyuan
    Liu, Xiaolong
    Li, Xiaolong
    Zhao, Yao
    Wang, Wei
    ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 20 (12)
  • [5] Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question Answering
    Yu, Zhou
    Yu, Jun
    Fan, Jianping
    Tao, Dacheng
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1839 - 1848
  • [6] Deep Fusion for Multi-Modal 6D Pose Estimation
    Lin, Shifeng
    Wang, Zunran
    Zhang, Shenghao
    Ling, Yonggen
    Yang, Chenguang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, : 1 - 10
  • [7] Multi-Modal Sensor Fusion for Indoor Mobile Robot Pose Estimation
    Dobrev, Yassen
    Flores, Sergio
    Vossiek, Martin
    PROCEEDINGS OF THE 2016 IEEE/ION POSITION, LOCATION AND NAVIGATION SYMPOSIUM (PLANS), 2016, : 553 - 556
  • [8] An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data
    Ulutan, Oytun
    Riggan, Benjamin S.
    Nasrabadi, Nasser M.
    Manjunath, B. S.
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1160 - 1169
  • [9] Asymmetry-Aware Load Balancing With Adaptive Switching Granularity in Data Center
    Liu, Jingling
    Huang, Jiawei
    Li, Weihe
    Wang, Jianxin
    He, Tian
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (03) : 1145 - 1158
  • [10] Multi-modal AI Systems for Human and Animal Pose Estimation in Challenging Conditions
    Deng, Qianyi
    2023 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING, SMARTCOMP, 2023, : 239 - 240