TransMarker: A Pure Vision Transformer for Facial Landmark Detection

被引:2
|
作者
Wu, Wenyan [1 ]
Cai, Yici [1 ]
Zhou, Qiang [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
关键词
D O I
10.1109/ICPR56361.2022.9956248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years, Convolution Neural Networks (CNNs) have achieved impressive results in facial landmark detection task. Especially, the u-shaped architecture, also known as Unet, has become the de-facto standard and achieved tremendous success. However, due to the locality property of convolution operation, it has a limitation in modeling global and long-range semantic information interaction, which is essential in localization tasks. In this work, we propose a Unet-like pure transformer method TransMarker, in which we give a new perspective to tackle facial landmark detection task in a sequence-to-sequence manner. We first split the input image into non-overlapping patches, which are seen as tokens in NLP tasks. Then, we feed the image patches into a symmetric u-shaped Encoder-Decoder architecture for local-global semantic feature learning. In addition, we introduce a Dense Skip-Connection schema to leverage the multi-level information within different resolutions. Note that, unlike conventional U-net architecture, we design the network with pure Transformer blocks, without any conventional operations. Extensive experiments demonstrate the state-of-the-art performance of our method on several standard datasets, i.e., WFLW, COFW and 300W, which remarkably outperform previous convolutional-based methods.
引用
收藏
页码:3580 / 3587
页数:8
相关论文
共 50 条
  • [41] Facial Landmark Detection with Tweaked Convolutional Neural Networks
    Wu, Yue
    Hassner, Tal
    Kim, Kanggeon
    Medioni, Gerard
    Natarajan, Prem
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (12) : 3067 - 3074
  • [42] Robust facial landmark detection for intelligent vehicle system
    Wu, JW
    Trivedi, MM
    ANALYSIS AND MODELLING OF FACES AND GESTURES, PROCEEDINGS, 2005, 3723 : 213 - 228
  • [43] Driver Facial Landmark Detection in Real Driving Situations
    Jeong, Mira
    Ko, Byoung Chul
    Kwak, Sooyeong
    Nam, Jae-Yeal
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 2753 - 2767
  • [44] Simultaneous Facial Landmark Detection, Pose and Deformation Estimation under Facial Occlusion
    Wu, Yue
    Gou, Chao
    Ji, Qiang
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5719 - 5728
  • [45] Morphological independence for landmark detection in vision based SLAM
    Villaverde, Ivan
    Grana, Manuel
    d'Anjou, Alicia
    COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 847 - +
  • [46] Analysis of surgical outcome after upper eyelid surgery by computer vision algorithm using face and facial landmark detection
    İlke Bahçeci Şimşek
    Can Şirolu
    Graefe's Archive for Clinical and Experimental Ophthalmology, 2021, 259 : 3119 - 3125
  • [47] Analysis of surgical outcome after upper eyelid surgery by computer vision algorithm using face and facial landmark detection
    Bahceci Simsek, Ilke
    Sirolu, Can
    GRAEFES ARCHIVE FOR CLINICAL AND EXPERIMENTAL OPHTHALMOLOGY, 2021, 259 (10) : 3119 - 3125
  • [48] Vision Transformer With Attentive Pooling for Robust Facial Expression Recognition
    Xue, Fanglei
    Wang, Qiangchang
    Tan, Zichang
    Ma, Zhongsong
    Guo, Guodong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3244 - 3256
  • [49] Research on facial recognition of sika deer based on vision transformer
    Gong, He
    Luo, Tianye
    Ni, Lingyun
    Li, Ji
    Guo, Jie
    Liu, Tonghe
    Feng, Ruilong
    Mu, Ye
    Hu, Tianli
    Sun, Yu
    Guo, Ying
    Li, Shijun
    ECOLOGICAL INFORMATICS, 2023, 78
  • [50] Enhanced Facial Emotion Recognition Using Vision Transformer Models
    Fatima, N. Sabiyath
    Deepika, G.
    Anthonisamy, Arun
    Chitra, R. Jothi
    Muralidharan, J.
    Alagarsamy, Manjunathan
    Ramyasree, Kummari
    JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2025, 20 (02) : 1143 - 1152