TransMarker: A Pure Vision Transformer for Facial Landmark Detection

被引:2
|
作者
Wu, Wenyan [1 ]
Cai, Yici [1 ]
Zhou, Qiang [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing Natl Res Ctr Informat Sci & Technol BNRis, Beijing, Peoples R China
关键词
D O I
10.1109/ICPR56361.2022.9956248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years, Convolution Neural Networks (CNNs) have achieved impressive results in facial landmark detection task. Especially, the u-shaped architecture, also known as Unet, has become the de-facto standard and achieved tremendous success. However, due to the locality property of convolution operation, it has a limitation in modeling global and long-range semantic information interaction, which is essential in localization tasks. In this work, we propose a Unet-like pure transformer method TransMarker, in which we give a new perspective to tackle facial landmark detection task in a sequence-to-sequence manner. We first split the input image into non-overlapping patches, which are seen as tokens in NLP tasks. Then, we feed the image patches into a symmetric u-shaped Encoder-Decoder architecture for local-global semantic feature learning. In addition, we introduce a Dense Skip-Connection schema to leverage the multi-level information within different resolutions. Note that, unlike conventional U-net architecture, we design the network with pure Transformer blocks, without any conventional operations. Extensive experiments demonstrate the state-of-the-art performance of our method on several standard datasets, i.e., WFLW, COFW and 300W, which remarkably outperform previous convolutional-based methods.
引用
收藏
页码:3580 / 3587
页数:8
相关论文
共 50 条
  • [21] Face Recognition Based on Facial Landmark Detection
    Juhong, Aniwat
    Pintavirooj, C.
    2017 10TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON), 2017,
  • [22] Facial Landmark Detection Algorithm in Complex Scenes
    Gao, Haoqi
    Yang, Xing
    Hu, Yihua
    Xu, Haoli
    Liang, Zhenyu
    Wang, Bingwen
    Xiang, Huiqing
    Hu, Zhiyang
    Hu, Shulong
    2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 352 - 358
  • [23] Deep Structured Prediction for Facial Landmark Detection
    Chen, Lisha
    Su, Hui
    Ji, Qiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [24] Dilated Skip Convolution for Facial Landmark Detection
    Chim, Seyha
    Lee, Jin-Gu
    Park, Ho-Hyun
    SENSORS, 2019, 19 (24)
  • [25] Multi-spectral Facial Landmark Detection
    Keong, Jin
    Dong, Xingbo
    Jin, Zhe
    Mallat, Khawla
    Dugelay, Jean-Luc
    2020 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2020,
  • [26] Fast Facial Landmark Detection and Applications: A Survey
    Khabarlak, Kostiantyn
    Koriashkina, Larysa
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2022, 22 (01): : 12 - 41
  • [27] Recurrent neural network for facial landmark detection
    Chen, Yu
    Yang, Jian
    Qian, Jianjun
    NEUROCOMPUTING, 2017, 219 : 26 - 38
  • [28] Face recognition based on facial landmark detection
    1600, Institute of Electrical and Electronics Engineers Inc., United States (2017-January):
  • [29] Style Aggregated Network for Facial Landmark Detection
    Dong, Xuanyi
    Yan, Yan
    Ouyang, Wanli
    Yang, Yi
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 379 - 388
  • [30] Facial Landmark Detection Under Large Pose
    Hao, Yangyang
    Zhu, Hengliang
    Shao, Zhiwen
    Tan, Xin
    Ma, Lizhuang
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 684 - 696