Lightweight and high-precision object detection algorithm based on YOLO framework

被引:2
|
作者
Fan Xin-chuan [1 ]
Chen Chun-mei [1 ]
机构
[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621010, Sichuan, Peoples R China
关键词
object detection; YOLOXs; mechanism of attention; lightweight; SIOU; Soft-NMS;
D O I
10.37188/CJLCD.2022-0328
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
Image-oriented multi-scale object detection algorithms often have the problem of mutual restriction between detection accuracy and system cost. Therefore,a lightweight and high-precision object detection algorithm based on YOLO framework is proposed. Under the YOLO framework,the mechanism of down-sampling and channel attention based on MobileNetv3 network is improved to accurately extract target features and reduce unnecessary overhead. The feature pyramid and single-stage headless fusion structure are designed,and different receptive fields are constructed to obtain different scale information,so as to enhance the adaptability of the algorithm for multi-scale targets. At the same time,SIOU is used as regression loss and Soft-NMS is used for redundant frame processing to improve the accuracy of the algorithm. Experiments are conducted on the MS COCO and UA-DETRAC. Compared with the original YOLOXs,the results show that the proposed improved algorithm reduces the number of model parameters and the computational cost reduced by 64. 98% and 57. 14% without reducing the accuracy. On the UADETRAC, mAP@0. 5 reaches 70. 5% which is improved by 3. 52%,and FPS increases by 14. 4%. The experimental results show that our algorithm greatly reduces the system overhead,improves the accuracy, and effectively guarantees the dual performance of detection.
引用
收藏
页码:945 / 954
页数:10
相关论文
共 27 条
  • [1] Bochkovskiy A., 2020, ARXIV, DOI DOI 10.48550/ARXIV.2004.10934
  • [2] Soft-NMS - Improving Object Detection With One Line of Code
    Bodla, Navaneeth
    Singh, Bharat
    Chellappa, Rama
    Davis, Larry S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
  • [3] RetinaFace: Single-shot Multi-level Face Localisation in the Wild
    Deng, Jiankang
    Guo, Jia
    Ververas, Evangelos
    Kotsia, Irene
    Zafeiriou, Stefanos
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5202 - 5211
  • [4] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
  • [5] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]
  • [6] Gevorgyan Z, 2022, Arxiv, DOI [arXiv:2205.12740, DOI 10.48550/ARXIV.2205.12740, 10.48550/arxiv.2205.12740]
  • [7] GhostNet: More Features from Cheap Operations
    Han, Kai
    Wang, Yunhe
    Tian, Qi
    Guo, Jianyuan
    Xu, Chunjing
    Xu, Chang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1577 - 1586
  • [8] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
  • [9] Pedestrian multi-target tracking method based on YOLOv5 and person re-identification
    He Yu-ting
    Che Jin
    Wu Jin-man
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (07) : 880 - 890
  • [10] Searching for MobileNetV3
    Howard, Andrew
    Sandler, Mark
    Chu, Grace
    Chen, Liang-Chieh
    Chen, Bo
    Tan, Mingxing
    Wang, Weijun
    Zhu, Yukun
    Pang, Ruoming
    Vasudevan, Vijay
    Le, Quoc V.
    Adam, Hartwig
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324