Efficient face detection and tracking in video sequences based on deep learning

被引:20
|
作者
Zheng, Guangyong [1 ,2 ]
Xu, Yuming [3 ]
机构
[1] Hengyang Normal Univ, Coll Comp Sci & Technol, Hengyang 421002, Hunan, Peoples R China
[2] Hunan Prov Key Lab Intelligent Informat Proc & Ap, Hengyang 421002, Hunan, Peoples R China
[3] Changsha Normal Univ, Coll Informat Sci & Engn, Changsha 410100, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Face detection; Face tracking; Regression network; Correction network; MEAN SHIFT;
D O I
10.1016/j.ins.2021.03.027
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video-based face detection and tracking technology has been widely used in video surveillance, safe driving, and medical diagnosis. In video sequences, most existing face detection and tracking methods face interference caused by occlusion, ambient illumination, and changes in human posture. To accurately track human faces in video sequences, we propose an efficient face detection and tracking framework based on deep learning, which includes a SENResNet face detection model and a Regression Network-based Face Tracking (RNFT) model. Firstly, the SENResNet model integrates the Squeeze and Excitation Network (SEN) with the Residual Neural Network (ResNet). To solve the problem that deep neural networks are difficult to train, we use ResNet to overcome the problem of gradient disappearance in deep network training. To fuse the features of each channel during the convolution operation, we further integrate the SEN module into the SENResNet model. SENResNet accurately detects facial information in each frame and extracts the position of the target face, thereby providing an initialization window for face tracking. Then, the RNFT model extracts facial features from adjacent frames and predict the position of the target face in the next frame. To address the problem of feature scaling, we add a correction network to the RNFT model. The improved RNFT model extracts the rectangular frame of the target face in the previous frame and strengthens the perception of feature scaling, thereby improving its accuracy. Extensive experimental results on public facial and video datasets show that the proposed SENResNet and RNFT models are superior to the state-of-the-art comparison methods in terms of accuracy and performance. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:265 / 285
页数:21
相关论文
共 50 条
  • [1] Efficient object detection and tracking in video sequences
    Dornaika, Fadi
    Chakik, Fadi
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2012, 29 (06) : 928 - 935
  • [2] Enhanced Deep Learning Architectures for Face Liveness Detection for Static and Video Sequences
    Koshy, Ranjana
    Mahmood, Ausif
    [J]. ENTROPY, 2020, 22 (10) : 1 - 27
  • [3] RETRACTED ARTICLE: Video Face Detection Based on Deep Learning
    Weiwei Liu
    [J]. Wireless Personal Communications, 2018, 102 : 2853 - 2868
  • [4] Retraction Note: Video Face Detection Based on Deep Learning
    Weiwei Liu
    [J]. Wireless Personal Communications, 2023, 128 : 1497 - 1497
  • [5] Detection and Tracking of Moving Target Based on Deep Learning for Video SAR
    Lin, Jie
    Cheng, Li
    Wu, Fuwei
    Yang, Yuhao
    Li, Pin
    Jin, Lin
    [J]. 2022 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT), 2022,
  • [6] Face detection and tracking in video sequences using the modified census transformation
    Kueblbeck, Christian
    Ernst, Andreas
    [J]. IMAGE AND VISION COMPUTING, 2006, 24 (06) : 564 - 572
  • [7] Apperance-based tracking and face identification in video sequences
    Buenaposada, Jose Miguel
    Bekios, Juan
    Baumela, Luis
    [J]. ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2008, 5098 : 349 - +
  • [8] Face Detection in Video Sequences
    Malach, Tobias
    Bambuch, Petr
    Malach, Jindrich
    [J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE - RADIOELEKTRONIKA 2012, 2012, : 289 - 292
  • [9] LEARNING DEEP FEATURES FOR EFFICIENT FACE DETECTION
    Hbali, Youssef
    Ballihi, Lahoucine
    Ed-doughmi, Younes
    Sadgal, Mohammed
    [J]. 2019 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS 2019), 2019,
  • [10] Video Frame-Based Deep Learning Face Detection-A Review
    Krishnaraj, M.
    Raj, R. Jeberson Retna
    [J]. ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 207 - 213