Lightweight mask detection algorithm based on improved YOLOv4-tiny

被引:9
|
作者
Zhu Jie [1 ,2 ]
Wang Jian-li [1 ]
Wang Bin [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
mask detection; YOLOv4-tiny; Spatial Pyramid Pooling; feature fusion;
D O I
10.37188/CJLCD.2021-0059
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
During the period of 2019-nCoV controlling, to prevent the spread of the virus, it is necessary to regulate the coverage of mask wearing in densely populated places such as airports and stations. In order to effectively monitor the coverage of mask wearing of crowd, this paper proposes a lightweight mask detection algorithm based on improved YOLOv4-tiny. Following the backbone network of YOLOv4-tiny, a spatial pyramid pooling structure is introduced to pool and fuse the input features at multi-scale, which makes the receptive field of the network enhanced. Then, combined with the path aggregation network, multi-scale features are fused and enhanced repeatedly in two paths to improve the expressive ability of feature maps. Finally, label smoothing is utilized to optimize the loss function for modifying the over-fitting problem in the training process. The experimental results show that the proposed algorithm achieves 94.7% AP and 85.7% AP on mask target and face target respectively (at real-time speed of 76.8 FPS on GeForce GTX 1050ti), which is 4.3% and 7.1% higher than that of YOLOv4-tiny. The proposed algorithm meets the accuracy and real-time requirements of mask detection tasks in various scenes.
引用
下载
收藏
页码:1525 / 1534
页数:11
相关论文
共 22 条
  • [1] [Anonymous], P 3 INT C LEARNING R
  • [2] Bochkovskiy A., 2004, arXiv preprint arXiv, V10934, P2020
  • [3] CABANIA HAMMOUDIK, 2021, SMART HLTH, V19
  • [4] GIRSHICKR, 2015, FASTRGCNN P IEEE INT
  • [5] GIRSHICKR DONAHUEJ, 2016, IEEE T PATTERN ANAL, V38
  • [6] HEK M, 2015, IEEE T PATTERN ANAL, V37, P1904
  • [7] IU ZD, 2020, COMPUTER ENG APPL, V56, P1
  • [8] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [9] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [10] Path Aggregation Network for Instance Segmentation
    Liu, Shu
    Qi, Lu
    Qin, Haifang
    Shi, Jianping
    Jia, Jiaya
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8759 - 8768