Face Detection Algorithm Based on a Lightweight Attention Mechanism Network

被引:2
|
作者
Gao Liuya [1 ]
Sun Dong [1 ]
Lu Yixiang [1 ]
机构
[1] Anhui Univ, Coll Elect Engn & Automat, Hefei 230601, Anhui, Peoples R China
关键词
image processing; face detection; deep learning; lightweight network; attention mechanism; K-means++;
D O I
10.3788/LOP202158.0210010
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This study proposes a new network framework based on a lightweight attention mechanism and the YOLOv3 backbone network. When designing the feature extraction network, the standard convolutions of the YOLOv3 backbone network are replaced using depthwise and pointwise convolutions, thereby accelerating the model training and increasing the detection speed. Next, the speed and accuracy of the model are weighted using an attention mechanism module. Finally, multiple-scale prediction layers are added to extract more feature information; simultaneously, the network parameters are optimized using the K-means+ clustering algorithm. In an experimental evaluation on face-detection performance, this method considerably improved the face-detection performance, achieving 94.08 Yo precision and 83.97 Yo recall on the Wider Face dataset. The average detection time is 0.022 s, which is 4.45 times higher than that of the original YOLOv3 algorithm.
引用
下载
收藏
页数:9
相关论文
共 29 条
  • [1] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [2] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [3] Review of Deep Learning Based Object Detection Methods and Their Mainstream Frameworks
    Duan Zhongjing
    Li Shaobo
    Hu Jianjun
    Yang Jing
    Wang Zheng
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (12)
  • [4] Farhadi A., 2018, P IEEE C COMP VIS PA
  • [5] Cascade Object Detection with Deformable Part Models
    Felzenszwalb, Pedro F.
    Girshick, Ross B.
    McAllester, David
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2241 - 2248
  • [6] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [7] He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]
  • [8] He Kaiming, 2015, C COMP VIS PATT REC
  • [9] Howard A, MOBILENETS EFFICIENT
  • [10] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324