Efficient face detection and tracking in video sequences based on deep learning

被引：20

作者：

Zheng, Guangyong ^{[1
,2
]}

Xu, Yuming ^{[3
]}

机构：

[1] Hengyang Normal Univ, Coll Comp Sci & Technol, Hengyang 421002, Hunan, Peoples R China

[2] Hunan Prov Key Lab Intelligent Informat Proc & Ap, Hengyang 421002, Hunan, Peoples R China

[3] Changsha Normal Univ, Coll Informat Sci & Engn, Changsha 410100, Hunan, Peoples R China

来源：

INFORMATION SCIENCES | 2021年 / 568卷

基金：

中国国家自然科学基金;

关键词：

Deep learning; Face detection; Face tracking; Regression network; Correction network; MEAN SHIFT;

D O I：

10.1016/j.ins.2021.03.027

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video-based face detection and tracking technology has been widely used in video surveillance, safe driving, and medical diagnosis. In video sequences, most existing face detection and tracking methods face interference caused by occlusion, ambient illumination, and changes in human posture. To accurately track human faces in video sequences, we propose an efficient face detection and tracking framework based on deep learning, which includes a SENResNet face detection model and a Regression Network-based Face Tracking (RNFT) model. Firstly, the SENResNet model integrates the Squeeze and Excitation Network (SEN) with the Residual Neural Network (ResNet). To solve the problem that deep neural networks are difficult to train, we use ResNet to overcome the problem of gradient disappearance in deep network training. To fuse the features of each channel during the convolution operation, we further integrate the SEN module into the SENResNet model. SENResNet accurately detects facial information in each frame and extracts the position of the target face, thereby providing an initialization window for face tracking. Then, the RNFT model extracts facial features from adjacent frames and predict the position of the target face in the next frame. To address the problem of feature scaling, we add a correction network to the RNFT model. The improved RNFT model extracts the rectangular frame of the target face in the previous frame and strengthens the perception of feature scaling, thereby improving its accuracy. Extensive experimental results on public facial and video datasets show that the proposed SENResNet and RNFT models are superior to the state-of-the-art comparison methods in terms of accuracy and performance. (c) 2021 Elsevier Inc. All rights reserved.

引用

页码：265 / 285

页数：21

共 50 条

[1] Efficient object detection and tracking in video sequences
Dornaika, Fadi
Chakik, Fadi
[J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2012, 29 (06) : 928 - 935
[2] Enhanced Deep Learning Architectures for Face Liveness Detection for Static and Video Sequences
Koshy, Ranjana
Mahmood, Ausif
[J]. ENTROPY, 2020, 22 (10) : 1 - 27
[3] RETRACTED ARTICLE: Video Face Detection Based on Deep Learning
Weiwei Liu
[J]. Wireless Personal Communications, 2018, 102 : 2853 - 2868
[4] Retraction Note: Video Face Detection Based on Deep Learning
Weiwei Liu
[J]. Wireless Personal Communications, 2023, 128 : 1497 - 1497
[5] Detection and Tracking of Moving Target Based on Deep Learning for Video SAR
Lin, Jie
Cheng, Li
Wu, Fuwei
Yang, Yuhao
Li, Pin
Jin, Lin
[J]. 2022 INTERNATIONAL CONFERENCE ON MICROWAVE AND MILLIMETER WAVE TECHNOLOGY (ICMMT), 2022,
[6] Face detection and tracking in video sequences using the modified census transformation
Kueblbeck, Christian
Ernst, Andreas
[J]. IMAGE AND VISION COMPUTING, 2006, 24 (06) : 564 - 572
[7] Apperance-based tracking and face identification in video sequences
Buenaposada, Jose Miguel
Bekios, Juan
Baumela, Luis
[J]. ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2008, 5098 : 349 - +
[8] Face Detection in Video Sequences
Malach, Tobias
Bambuch, Petr
Malach, Jindrich
[J]. PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE - RADIOELEKTRONIKA 2012, 2012, : 289 - 292
[9] LEARNING DEEP FEATURES FOR EFFICIENT FACE DETECTION
Hbali, Youssef
Ballihi, Lahoucine
Ed-doughmi, Younes
Sadgal, Mohammed
[J]. 2019 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS 2019), 2019,
[10] Video Frame-Based Deep Learning Face Detection-A Review
Krishnaraj, M.
Raj, R. Jeberson Retna
[J]. ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 207 - 213

← 1 2 3 4 5 →