Learning spatial-temporal deformable networks for unconstrained face alignment and tracking in videos

被引:11
|
作者
Zhu, Hongyu [1 ]
Liu, Hao [1 ,2 ]
Zhu, Congcong [1 ,3 ]
Deng, Zongyong [1 ]
Sun, Xuehong [1 ,2 ]
机构
[1] Ningxia Univ, Sch Informat Engn, Yinchuan 750021, Ningxia, Peoples R China
[2] Collaborat Innovat Ctr Ningxia Big Data & Artific, Yinchuan 750021, Ningxia, Peoples R China
[3] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
基金
美国国家科学基金会;
关键词
Face alignment; Face tracking; Spatial transformer; Relational reasoning; Video analysis; Biometrics; IMAGE;
D O I
10.1016/j.patcog.2020.107354
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a spatial-temporal deformable networks approach to investigate both problems of face alignment in static images and face tracking in videos under unconstrained environments. Unlike conventional feature extractions which cannot explicitly exploit augmented spatial geometry for various facial shapes, in our approach, we propose a deformable hourglass networks (DHGN) method, which aims to learn a deformable mask to reduce the variances of facial deformation and extract attentional facial regions for robust feature representation. However, our DHGN is limited to extract only spatial appearance features from static facial images, which cannot explicitly exploit the temporal consistency information across consecutive frames in videos. For efficient temporal modeling, we further extend our DHGN to a temporal DHGN (T-DHGN) paradigm particularly for video-based face alignment. To this end, our T-DHGN principally incorporates with a temporal relational reasoning module, so that the temporal order relationship among frames is encoded in the relational feature. By doing this, our T-DHGN reasons about the temporal offsets to select a subset of discriminative frames over time steps, thus allowing temporal consistency information memorized to flow across frames for stable landmark tracking in videos. Compared with most state-of-the-art methods, our approach achieves superior performance on folds of widely-evaluated benchmarking datasets. Code will be made publicly available upon publication. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] High Speed Implementation of the Deformable Shape Tracking Face Alignment Algorithm
    Petrellis, Nikos
    Zogas, Stavros
    Christakos, Panagiotis
    Keramidas, Georgios
    Mousouliotis, Panagiotis
    Voros, Nikolaos
    Antonopoulos, Christos
    2021 24TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD 2021), 2021, : 174 - 177
  • [42] INVESTIGATION IN SPATIAL-TEMPORAL DOMAIN FOR FACE SPOOF DETECTION
    Sun, Zhonglin
    Sun, Li
    Li, Qingli
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1538 - 1542
  • [43] Spatial-Temporal Graph Attention Networks: A Deep Learning Approach for Traffic Forecasting
    Zhang, Chenhan
    Yu, James J. Q.
    Liu, Yi
    IEEE ACCESS, 2019, 7 : 166246 - 166256
  • [44] Learning Adaptive Spatial-Temporal Context-Aware Correlation Filters for UAV Tracking
    Yuan, Di
    Chang, Xiaojun
    Li, Zhihui
    He, Zhenyu
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (03)
  • [45] Learning background-aware and spatial-temporal regularized correlation filters for visual tracking
    Jianming Zhang
    Yaoqi He
    Wenjun Feng
    Jin Wang
    Neal N. Xiong
    Applied Intelligence, 2023, 53 : 7697 - 7712
  • [46] Attentive Spatial-Temporal Summary Networks for Feature Learning in Irregular Gait Recognition
    Li, Shuangqun
    Liu, Wu
    Ma, Huadong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (09) : 2361 - 2375
  • [47] Learning background-aware and spatial-temporal regularized correlation filters for visual tracking
    Zhang, Jianming
    He, Yaoqi
    Feng, Wenjun
    Wang, Jin
    Xiong, Neal N.
    APPLIED INTELLIGENCE, 2023, 53 (07) : 7697 - 7712
  • [48] Online learning and joint optimization of combined spatial-temporal models for robust visual tracking
    Zhou, Tao
    Bhaskar, Harish
    Liu, Fanghui
    Yang, Jie
    Cai, Ping
    NEUROCOMPUTING, 2017, 226 : 221 - 237
  • [49] Online Learning of Spatial-Temporal Convolution Response for Robust Real-Time Tracking
    Zhou, Jinglin
    Wang, Rong
    Ding, Jianwei
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1821 - 1826
  • [50] LEARNING A MULTI-CENTER CONVOLUTIONAL NETWORK FOR UNCONSTRAINED FACE ALIGNMENT
    Shao, Zhiwen
    Zhu, Hengliang
    Hao, Yangyang
    Wang, Min
    Ma, Lizhuang
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 109 - 114