Learning spatial-temporal deformable networks for unconstrained face alignment and tracking in videos

被引:11
|
作者
Zhu, Hongyu [1 ]
Liu, Hao [1 ,2 ]
Zhu, Congcong [1 ,3 ]
Deng, Zongyong [1 ]
Sun, Xuehong [1 ,2 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Ningxia, Peoples R China
[2] Collaborat Innovat Ctr Ningxia Big Data & Artific, Yinchuan 750021, Ningxia, Peoples R China
[3] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
基金
美国国家科学基金会;
关键词
Face alignment; Face tracking; Spatial transformer; Relational reasoning; Video analysis; Biometrics; IMAGE;
D O I
10.1016/j.patcog.2020.107354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a spatial-temporal deformable networks approach to investigate both problems of face alignment in static images and face tracking in videos under unconstrained environments. Unlike conventional feature extractions which cannot explicitly exploit augmented spatial geometry for various facial shapes, in our approach, we propose a deformable hourglass networks (DHGN) method, which aims to learn a deformable mask to reduce the variances of facial deformation and extract attentional facial regions for robust feature representation. However, our DHGN is limited to extract only spatial appearance features from static facial images, which cannot explicitly exploit the temporal consistency information across consecutive frames in videos. For efficient temporal modeling, we further extend our DHGN to a temporal DHGN (T-DHGN) paradigm particularly for video-based face alignment. To this end, our T-DHGN principally incorporates with a temporal relational reasoning module, so that the temporal order relationship among frames is encoded in the relational feature. By doing this, our T-DHGN reasons about the temporal offsets to select a subset of discriminative frames over time steps, thus allowing temporal consistency information memorized to flow across frames for stable landmark tracking in videos. Compared with most state-of-the-art methods, our approach achieves superior performance on folds of widely-evaluated benchmarking datasets. Code will be made publicly available upon publication. (C) 2020 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页数:12
相关论文
共 50 条
  • [1] LEARNING DEFORMABLE HOURGLASS NETWORKS (DHGN) FOR UNCONSTRAINED FACE ALIGNMENT
    Zhang, Jiaqiang
    Zhu, Congcong
    Wu, Suping
    Yu, Zhenhua
    Sun, Xuehong
    Liu, Hao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 1960 - 1964
  • [2] Tracking the Evolving Spatial-Temporal Gene Networks
    Gong, Weikang
    Wan, Lin
    IFAC PAPERSONLINE, 2015, 48 (28): : 1365 - 1368
  • [3] A Spatial-Temporal Deformable Attention Based Framework for Breast Lesion Detection in Videos
    Qin, Chao
    Cao, Jiale
    Fu, Huazhu
    Anwer, Rao Muhammad
    Khan, Fahad Shahbaz
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT II, 2023, 14221 : 479 - 488
  • [4] Offline Deformable Face Tracking in Arbitrary Videos
    Chrysos, Grigorios G.
    Antonakos, Epameinondas
    Zafeiriou, Stefanos
    Snape, Patrick
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 954 - 962
  • [5] Object tracking in surveillance videos using spatial-temporal correlation graph model
    Zhang, Cheng
    Ma, Huadong
    Fu, Huiyuan
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2015, 41 (04): : 713 - 720
  • [6] Spatial-Temporal Relation Networks for Multi-Object Tracking
    Xu, Jiarui
    Cao, Yue
    Zhang, Zheng
    Hu, Han
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3987 - 3997
  • [7] Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking
    Li, Feng
    Tian, Cheng
    Zuo, Wangmeng
    Zhang, Lei
    Yang, Ming-Hsuan
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4904 - 4913
  • [8] STGL: Spatial-Temporal Graph Representation and Learning for Visual Tracking
    Jiang, Bo
    Zhang, Yuan
    Luo, Bin
    Cao, Xiaochun
    Tang, Jin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 2162 - 2171
  • [9] Learning Dynamic Spatial-Temporal Regularization for UAV Object Tracking
    Deng, Chenwei
    He, Shuangcheng
    Han, Yuqi
    Zhao, Boya
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1230 - 1234
  • [10] LEARNING SPATIAL-TEMPORAL CONSISTENT CORRELATION FILTER FOR VISUAL TRACKING
    Lou, Han
    Wang, Dongfei
    Jiang, Zhuqing
    Men, Aidong
    Zhou, Yun
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,