PointFaceFormer: local and global attention based transformer for 3D point cloud face recognition

被引:0
|
作者
Gao, Ziqi [1 ,2 ]
Li, Qiufu [1 ,2 ]
Wang, Gui [1 ,2 ,3 ]
Shen, Linlin [1 ,2 ,3 ]
机构
[1] Shenzhen Univ, Comp Vis Inst, Shenzhen, Peoples R China
[2] Shenzhen Univ, Natl Engn Lab Big Data Syst Comp Technol, Shenzhen, Peoples R China
[3] Univ Nottingham, Dept Comp Sci, Ningbo, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/FG59268.2024.10581966
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing 3D point cloud-based facial recognition struggles to fully leverage both global and local information inherent in the 3D point cloud data. In this paper, we introduce the PointFaceFormer, the first Transformer model designed for 3D point cloud face recognition. It incorporates an attention mechanism based on dot product and cosine functions to construct a similarity Transformer architecture, which effectively extracts both local and global features from the point cloud data. Experimental results demonstrate that PointFaceFormer achieves a recognition accuracy of 89.08% and a verification accuracy of 76.93% on the large-scale facial point cloud dataset Lock3DFace, which is a new state-of-the-art in 3D face recognition. Furthermore, PointFaceFormer exhibits excellent generalization performance on cross-quality datasets. Additionally, we validate the effectiveness of the attention mechanism through ablation experiments, which justify the effectiveness of the proposed modules.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] PFFNet: A point cloud based method for 3D face flow estimation☆
    Li, Dong
    Deng, Yuchen
    Huang, Zijun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2025, 107
  • [42] Quality Judgment of 3D Face Point Cloud Based on Feature Fusion
    Gao, Gong
    Liu, Hong
    Yang, Hongyu
    IEEE ACCESS, 2022, 10 : 106513 - 106519
  • [43] Learning 3D Face Representation with Vision Transformer for Masked Face Recognition
    Wang, Yuan
    Yang, Zhen
    Zhang, Zhiqiang
    Zang, Huaijuan
    Zhu, Qiang
    Zhan, Shu
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 505 - 511
  • [44] Semantic segmentation of 3D point cloud based on contextual attention CNN
    Yang J.
    Dang J.
    Tongxin Xuebao/Journal on Communications, 2020, 41 (07): : 195 - 203
  • [45] 3D Point Cloud Descriptor for Posture Recognition
    Khokhlova, Margarita
    Migniot, Cyrille
    Dipanda, Albert
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 161 - 168
  • [46] Robust 3D Local SIFT Features for 3D Face Recognition
    Ming, Yue
    Jin, Yi
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2015), PT III, 2015, 9246 : 352 - 359
  • [47] LOCAL DESCRIPTORS MATCHING FOR 3D FACE RECOGNITION
    Werghi, Naoufel
    Berretti, Stefano
    Del Bimbo, Alberto
    Pala, Pietro
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 3710 - 3714
  • [48] Beyond local patches: Preserving global-local interactions by enhancing self-attention via 3D point cloud tokenization
    Khan, M. Q.
    Shahzad, M.
    Khan, S. A.
    Fraz, M. M.
    Zhu, X. X.
    PATTERN RECOGNITION, 2024, 155
  • [49] Human Action Recognition Based on 3D Convolution and Multi-Attention Transformer
    Liu, Minghua
    Li, Wenjing
    He, Bo
    Wang, Chuanxu
    Qu, Lianen
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [50] Outdoor large-scene 3D point cloud reconstruction based on transformer
    Tang, Fangzhou
    Zhang, Shuting
    Zhu, Bocheng
    Sun, Junren
    FRONTIERS IN PHYSICS, 2024, 12