Face Detection Algorithm Based on a Lightweight Attention Mechanism Network

被引：2

作者：

Gao Liuya ^{[1
]}

Sun Dong ^{[1
]}

Lu Yixiang ^{[1
]}

机构：

[1] Anhui Univ, Coll Elect Engn & Automat, Hefei 230601, Anhui, Peoples R China

来源：

LASER & OPTOELECTRONICS PROGRESS | 2021年 / 58卷 / 02期

关键词：

image processing; face detection; deep learning; lightweight network; attention mechanism; K-means++;

D O I：

10.3788/LOP202158.0210010

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This study proposes a new network framework based on a lightweight attention mechanism and the YOLOv3 backbone network. When designing the feature extraction network, the standard convolutions of the YOLOv3 backbone network are replaced using depthwise and pointwise convolutions, thereby accelerating the model training and increasing the detection speed. Next, the speed and accuracy of the model are weighted using an attention mechanism module. Finally, multiple-scale prediction layers are added to extract more feature information; simultaneously, the network parameters are optimized using the K-means+ clustering algorithm. In an experimental evaluation on face-detection performance, this method considerably improved the face-detection performance, achieving 94.08 Yo precision and 83.97 Yo recall on the Wider Face dataset. The average detection time is 0.022 s, which is 4.45 times higher than that of the original YOLOv3 algorithm.

引用

下载

页数：9

共 29 条

[1] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[2] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[3] Review of Deep Learning Based Object Detection Methods and Their Mainstream Frameworks
Duan Zhongjing
Li Shaobo
Hu Jianjun
Yang Jing
Wang Zheng
[J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (12)
[4] Farhadi A., 2018, P IEEE C COMP VIS PA
[5] Cascade Object Detection with Deformable Part Models
Felzenszwalb, Pedro F.
Girshick, Ross B.
McAllester, David
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2241 - 2248
[6] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[7] He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
[8] He Kaiming, 2015, C COMP VIS PATT REC
[9] Howard A, MOBILENETS EFFICIENT
[10] Searching for MobileNetV3
Howard, Andrew
Sandler, Mark
Chu, Grace
Chen, Liang-Chieh
Chen, Bo
Tan, Mingxing
Wang, Weijun
Zhu, Yukun
Pang, Ruoming
Vasudevan, Vijay
Le, Quoc V.
Adam, Hartwig
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324

← 1 2 3 →