Towards End-to-End Face Recognition Method through Spatial Transformer

被引:0
|
作者
Wu, Yanfang [1 ,2 ]
Lu, Xiaobo [1 ,2 ]
Qin, Chen [1 ,2 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1088/1742-6596/1575/1/012090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Face recognition technology is widely applied in daily life, but in most methods, similarity or affine transformation is employed to align face images according to five facial landmarks. The face alignment module is implemented independently, thus it's difficult in end-to-end training. In this paper, the main purpose is to design a towards end-to-end trainable face recognition method based on indoor scenes. Due to that spatial transformer can implement any parametrizable transformation, we joint it with recognition network, making end-to-end training possible. Simultaneously, any prior knowledge on facial landmarks isn't required. The model jointly with spatial transformer can achieve 0.3% higher accuracy than similarity transformation. Most downsampling methods ignore the sampling theorem, making convolutional networks not shift-invariant. We replace max-pooling by MaxBlurPool in spatial transformer network, and the accuracy is improved by 0.25%.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] End-to-End Spatial Transform Face Detection and Recognition
    Zhang, Hongxin
    Chi, Liying
    [J]. Virtual Reality and Intelligent Hardware, 2020, 2 (02): : 119 - 131
  • [2] Spatial–temporal transformer for end-to-end sign language recognition
    Zhenchao Cui
    Wenbo Zhang
    Zhaoxin Li
    Zhaoqi Wang
    [J]. Complex & Intelligent Systems, 2023, 9 : 4645 - 4656
  • [3] An end-to-end face recognition method with alignment learning
    Tang, Fenggao
    Wu, Xuedong
    Zhu, Zhiyu
    Wan, Zhengang
    Chang, Yanchao
    Du, Zhaoping
    Gu, Lili
    [J]. OPTIK, 2020, 205
  • [4] An End-to-End Air Writing Recognition Method Based on Transformer
    Tan, Xuhang
    Tong, Jicheng
    Matsumaru, Takafumi
    Dutta, Vibekananda
    He, Xin
    [J]. IEEE ACCESS, 2023, 11 : 109885 - 109898
  • [5] Spatial-temporal transformer for end-to-end sign language recognition
    Cui, Zhenchao
    Zhang, Wenbo
    Li, Zhaoxin
    Wang, Zhaoqi
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (04) : 4645 - 4656
  • [6] Toward End-to-End Face Recognition Through Alignment Learning
    Zhong, Yuanyi
    Chen, Jiansheng
    Huang, Bo
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (08) : 1213 - 1217
  • [7] Dynamic deformable transformer for end-to-end face alignment
    Han, Liming
    Yang, Chi
    Li, Qing
    Yao, Bin
    Jiao, Zixian
    Xie, Qianyang
    [J]. IET COMPUTER VISION, 2023, 17 (08) : 948 - 961
  • [8] An End-to-End Spatial-Temporal Transformer Model for Surgical Action Triplet Recognition
    Zou, Xiaoyang
    Yu, Derong
    Tao, Rong
    Zheng, Guoyan
    [J]. 12TH ASIAN-PACIFIC CONFERENCE ON MEDICAL AND BIOLOGICAL ENGINEERING, VOL 2, APCMBE 2023, 2024, 104 : 114 - 120
  • [9] Online Compressive Transformer for End-to-End Speech Recognition
    Leong, Chi-Hang
    Huang, Yu-Han
    Chien, Jen-Tzung
    [J]. INTERSPEECH 2021, 2021, : 2082 - 2086
  • [10] Towards end-to-end container code recognition
    Li, Yanchao
    Li, Hao
    Gao, Guangwei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15901 - 15918