Accurate Facial Landmark Detector via Multi-scale Transformer

被引:2
|
作者
Sha, Yuyang [1 ]
Meng, Weiyu [1 ]
Zhai, Xiaobing [1 ]
Xie, Can [1 ]
Li, Kefeng [1 ]
机构
[1] Macao Polytech Univ, Fac Appl Sci, Taipa, Macao, Peoples R China
关键词
Facial landmark detection; Vision transformer; Multi-scale feature; Global information;
D O I
10.1007/978-981-99-8469-5_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial landmark detection is an essential prerequisite for many face applications, which has attracted much attention and made remarkable progress in recent years. However, some problems still need to be solved urgently, including improving the accuracy of facial landmark detectors in complex scenes, encoding long-range relationships between keypoints and facial components, and optimizing the robustness of methods in unconstrained environments. To address these problems, we propose a novel facial landmark detector via multi-scale transformer (MTLD), which contains three modules: Multi-scale Transformer, Joint Regression, and Structure Loss. The proposed Multi-scale Transformer focuses on capturing long-range information and cross-scale representations from multi-scale feature maps. The Joint Regression takes advantage of both coordinate and heatmap regression, which could boost the inference speed without sacrificing model accuracy. Furthermore, in order to explore the structural dependency between facial landmarks, we design the Structure Loss to fully utilize the geometric information in face images. We evaluate the proposed method through extensive experiments on four benchmark datasets. The results demonstrate that our method outperforms state-of-the-art approaches both in accuracy and efficiency.
引用
收藏
页码:278 / 290
页数:13
相关论文
共 50 条
  • [1] Fatigue Driving Recognition Method Based on Multi-Scale Facial Landmark Detector
    Xiao, Weichu
    Liu, Hongli
    Ma, Ziji
    Chen, Weihong
    Sun, Changliang
    Shi, Bo
    ELECTRONICS, 2022, 11 (24)
  • [2] TALKINGFLOW: TALKING FACIAL LANDMARK GENERATION WITH MULTI-SCALE NORMALIZING FLOW NETWORK
    Liang, Sen
    Zhou, Zhize
    Li, Rong
    Zhang, Juyong
    Bao, Hujun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4628 - 4632
  • [3] Progressive Multi-Scale Vision Transformer for Facial Action Unit Detection
    Wang, Chongwen
    Wang, Zicheng
    FRONTIERS IN NEUROROBOTICS, 2022, 15 (15):
  • [4] Multi-Scale Detector for Accurate Vehicle Detection in Traffic Surveillance Data
    Kim, Kwang-Ju
    Kim, Pyong-Kun
    Chung, Yun-Su
    Choi, Doo-Hyun
    IEEE ACCESS, 2019, 7 : 78311 - 78319
  • [5] LMTformer: facial depression recognition with lightweight multi-scale transformer from videos
    He, Lang
    Zhao, Junnan
    Zhang, Jie
    Jiang, Jiewei
    Qi, Senqing
    Wang, Zhongmin
    Wu, Di
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [6] Multi-scale Hybrid Transformer Network with Grouped Convolutional Embedding for Automatic Cephalometric Landmark Detection
    Wu, Fuli
    Chen, Lijie
    Feng, Bin
    Hao, Pengyi
    COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, CAD/GRAPHICS 2023, 2024, 14250 : 250 - 265
  • [7] Frequency Learning via Multi-Scale Fourier Transformer for MRI Reconstruction
    Yi, Qiaosi
    Fang, Faming
    Zhang, Guixu
    Zeng, Tieyong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (11) : 5506 - 5517
  • [8] Multi-scale facial scanning via spatial LSTM for latent facial feature representation
    Kim, Seong Tae
    Choi, Yeoreum
    Ro, Yong Man
    2017 INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG), 2017,
  • [9] Towards Accurate Facial Landmark Detection via Cascaded Transformers
    Li, Hui
    Guo, Zidong
    Rhee, Seon-Min
    Han, Seungju
    Han, Jae-Joon
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4166 - 4175
  • [10] Accurate Multi-Scale License Plate Localization Via Image Saliency
    He, Tong
    Yao, Jian
    Zhang, Kao
    Hou, Yaolin
    Han, Shiyao
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 1567 - 1572