Asymmetry-aware bilinear pooling in multi-modal data for head pose estimation

被引:2
|
作者
Chen, Jiazhong [1 ]
Li, Qingqing [1 ]
Ren, Dakai [2 ]
Cao, Hua [1 ]
Ling, Hefei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; Asymmetry-aware; Bilinear pooling; ATTENTION; REPRESENTATION; NETWORK;
D O I
10.1016/j.image.2022.116895
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The head pose on roll and yaw directions is decided by the asymmetric appearance in human faces, and the contextual information of asymmetric appearance is encoded in a head pose related neighborhood. However, CNNs used in existing head pose estimation methods often evenly performs on the features of full image. Thus it is hard to collect the contextual information of such asymmetric appearance by those methods. To address this issue, this paper proposes a novel head pose estimation method that could perceive the asymmetric appearance in human faces. Specifically, the awareness of such asymmetry is undertaken by the local pairwise feature interaction in head pose related neighborhood via bilinear pooling. Evaluations on two public datasets demonstrate that our method could achieve promising results.
引用
下载
收藏
页数:10
相关论文
共 50 条
  • [21] Unified losses for multi-modal pose coding and regression
    Johnson, Leif
    Cooper, Joseph
    Ballard, Dana
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [22] Multi-modal background-aware for defect semantic segmentation with limited data
    Shan, Dexing
    Zhang, Yunzhou
    Liu, Shitong
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024,
  • [23] Multi-Modal Depression Detection and Estimation
    Yang, Le
    2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS (ACIIW), 2019, : 26 - 30
  • [24] A Transformer-based multi-modal fusion network for 6D pose estimation
    Hong, Jia-Xin
    Zhang, Hong-Bo
    Liu, Jing-Hua
    Lei, Qing
    Yang, Li-Jie
    Du, Ji-Xiang
    INFORMATION FUSION, 2024, 105
  • [25] Perceiver Hopfield Pooling for Dynamic Multi-modal and Multi-instance Fusion
    Roessle, Dominik
    Cremers, Daniel
    Schoen, Torsten
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 599 - 610
  • [26] Skeleton aware multi-modal sign language recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2021, : 3408 - 3418
  • [27] Skeleton Aware Multi-modal Sign Language Recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3408 - 3418
  • [28] Skeleton aware multi-modal sign language recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    arXiv, 2021,
  • [29] Intention aware interactive multi-modal robot programming
    Iba, S
    Paredis, CJJ
    Khosla, PK
    IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 3479 - 3484
  • [30] SAWS: Selective Asymmetry-aware Work-Stealing for Asymmetric Multi-Core Architectures
    Guo, Haodong
    Chen, Quan
    Guo, Minyi
    Xu, Liting
    PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 116 - 123