3D-aware Facial Landmark Detection via Multi-view Consistent Training on Synthetic Data

被引:4
|
作者
Zeng, Libing [1 ]
Chen, Lele [2 ]
Bao, Wentao [3 ]
Li, Zhong [2 ]
Xu, Yi [2 ]
Yuan, Junsong [4 ]
Kalantari, Nima K. [1 ]
机构
[1] Texas A&M Univ, College Stn, TX 77843 USA
[2] InnoPeak Technol Inc, OPPO US Res Ctr, Palo Alto, CA USA
[3] Michigan State Univ, E Lansing, MI 48824 USA
[4] SUNY Buffalo, Buffalo, NY USA
关键词
D O I
10.1109/CVPR52729.2023.01226
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate facial landmark detection on wild images plays an essential role in human-computer interaction, entertainment, and medical applications. Existing approaches have limitations in enforcing 3D consistency while detecting 3D/2D facial landmarks due to the lack of multi-view in-the-wild training data. Fortunately, with the recent advances in generative visual models and neural rendering, we have witnessed rapid progress towards high quality 3D image synthesis. In this work, we leverage such approaches to construct a synthetic dataset and propose a novel multi-view consistent learning strategy to improve 3D facial landmark detection accuracy on in-the-wild images. The proposed 3D-aware module can be plugged into any learning-based landmark detection algorithm to enhance its accuracy. We demonstrate the superiority of the proposed plug-in module with extensive comparison against state-of-the-art methods on several real and synthetic datasets.
引用
收藏
页码:12747 / 12758
页数:12
相关论文
共 50 条
  • [1] Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis
    Zhang, Xuanmeng
    Zheng, Zhedong
    Gao, Daiheng
    Zhang, Bang
    Pan, Pan
    Yang, Yi
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18429 - 18438
  • [2] Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis
    Zhang, Xuanmeng
    Zheng, Zhedong
    Gao, Daiheng
    Zhang, Bang
    Yang, Yi
    Chua, Tat-Seng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 131 (08) : 2219 - 2242
  • [3] Multi-view Consistent Generative Adversarial Networks for Compositional 3D-Aware Image Synthesis
    Xuanmeng Zhang
    Zhedong Zheng
    Daiheng Gao
    Bang Zhang
    Yi Yang
    Tat-Seng Chua
    International Journal of Computer Vision, 2023, 131 : 2219 - 2242
  • [4] Multi-view facial landmark detection by using a 3D shape model
    Cech, Jan
    Franc, Vojtech
    Uricar, Michal
    Matas, Jiri
    IMAGE AND VISION COMPUTING, 2016, 47 : 60 - 70
  • [5] Multi-view Consensus CNN for 3D Facial Landmark Placement
    Paulsen, Rasmus R.
    Juhl, Kristine Aavild
    Haspang, Thilde Marie
    Hansen, Thomas
    Ganz, Melanie
    Einarsson, Gudmundur
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 706 - 719
  • [6] Real-Time Multi-View Facial Capture with Synthetic Training
    Klaudiny, Martin
    McDonagh, Steven
    Bradley, Derek
    Beeler, Thabo
    Mitchell, Kenny
    COMPUTER GRAPHICS FORUM, 2017, 36 (02) : 325 - 336
  • [7] Multi-View Consistent 3D GAN Inversion via Bidirectional Encoder
    Wu, Haozhan
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [8] Multi-view facial action unit detection via DenseNets and CapsNets
    Dakai Ren
    Xiangmin Wen
    Jiazhong Chen
    Yu Han
    Shiqi Zhang
    Multimedia Tools and Applications, 2022, 81 : 19377 - 19394
  • [9] Multi-view facial action unit detection via DenseNets and CapsNets
    Ren, Dakai
    Wen, Xiangmin
    Chen, Jiazhong
    Han, Yu
    Zhang, Shiqi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19377 - 19394
  • [10] Multi-View Attentive Contextualization for Multi-View 3D Object Detection
    Liu, Xianpeng
    Zheng, Ce
    Qian, Ming
    Xue, Nan
    Chen, Chen
    Zhang, Zhebin
    Li, Chen
    Wu, Tianfu
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16688 - 16698