Lightweight and high-precision object detection algorithm based on YOLO framework

被引：2

作者：

Fan Xin-chuan ^{[1
]}

Chen Chun-mei ^{[1
]}

机构：

[1] Southwest Univ Sci & Technol, Sch Informat Engn, Mianyang 621010, Sichuan, Peoples R China

来源：

CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS | 2023年 / 38卷 / 07期

关键词：

object detection; YOLOXs; mechanism of attention; lightweight; SIOU; Soft-NMS;

D O I：

10.37188/CJLCD.2022-0328

中图分类号：

O7 [晶体学];

学科分类号：

0702 ; 070205 ; 0703 ; 080501 ;

摘要：

Image-oriented multi-scale object detection algorithms often have the problem of mutual restriction between detection accuracy and system cost. Therefore,a lightweight and high-precision object detection algorithm based on YOLO framework is proposed. Under the YOLO framework,the mechanism of down-sampling and channel attention based on MobileNetv3 network is improved to accurately extract target features and reduce unnecessary overhead. The feature pyramid and single-stage headless fusion structure are designed,and different receptive fields are constructed to obtain different scale information,so as to enhance the adaptability of the algorithm for multi-scale targets. At the same time,SIOU is used as regression loss and Soft-NMS is used for redundant frame processing to improve the accuracy of the algorithm. Experiments are conducted on the MS COCO and UA-DETRAC. Compared with the original YOLOXs,the results show that the proposed improved algorithm reduces the number of model parameters and the computational cost reduced by 64. 98% and 57. 14% without reducing the accuracy. On the UADETRAC, mAP@0. 5 reaches 70. 5% which is improved by 3. 52%,and FPS increases by 14. 4%. The experimental results show that our algorithm greatly reduces the system overhead,improves the accuracy, and effectively guarantees the dual performance of detection.

引用

页码：945 / 954

页数：10

共 27 条

[1] Bochkovskiy A., 2020, ARXIV, DOI DOI 10.48550/ARXIV.2004.10934
[2] Soft-NMS - Improving Object Detection With One Line of Code
Bodla, Navaneeth
Singh, Bharat
Chellappa, Rama
Davis, Larry S.
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5562 - 5570
[3] RetinaFace: Single-shot Multi-level Face Localisation in the Wild
Deng, Jiankang
Guo, Jia
Ververas, Evangelos
Kotsia, Irene
Zafeiriou, Stefanos
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5202 - 5211
[4] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861
[5] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]
[6] Gevorgyan Z, 2022, Arxiv, DOI [arXiv:2205.12740, DOI 10.48550/ARXIV.2205.12740, 10.48550/arxiv.2205.12740]
[7] GhostNet: More Features from Cheap Operations
Han, Kai
Wang, Yunhe
Tian, Qi
Guo, Jianyuan
Xu, Chunjing
Xu, Chang
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1577 - 1586
[8] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[9] Pedestrian multi-target tracking method based on YOLOv5 and person re-identification
He Yu-ting
Che Jin
Wu Jin-man
[J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2022, 37 (07) : 880 - 890
[10] Searching for MobileNetV3
Howard, Andrew
Sandler, Mark
Chu, Grace
Chen, Liang-Chieh
Chen, Bo
Tan, Mingxing
Wang, Weijun
Zhu, Yukun
Pang, Ruoming
Vasudevan, Vijay
Le, Quoc V.
Adam, Hartwig
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1314 - 1324

← 1 2 3 →