A Lightweight YOLO Object Detection Algorithm Based on Bidirectional Multi-Scale Feature Enhancement

被引:1
|
作者
Liu, Qunpo [1 ,2 ]
Zhang, Jingwen [1 ]
Zhang, Zhuoran [1 ]
Bu, Xuhui [1 ,2 ]
Hanajima, Naohiko [2 ,3 ]
机构
[1] Henan Polytech Univ, Sch Elect Engn & Automat, Jiaozuo 454000, Henan, Peoples R China
[2] Henan Intelligent Equipment, Int Joint Lab Direct Drive & Control, Zhengzhou 454000, Henan, Peoples R China
[3] Muroran Inst Technol, Coll Informat & Syst, Muroran, Hokkaido 0508585, Japan
基金
中国国家自然科学基金;
关键词
attention modules; bidirectional multiscale feature enhancements; lightweight models; object detections; weighted fusions; MODEL;
D O I
10.1002/adts.202301025
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper proposes a lightweight YOLO object detection algorithm based on bidirectional multi-scale feature enhancement. The problem is that the original YOLOv5 algorithm does not make full use of the relationship between the feature layers, resulting in the loss of target semantic information and a large number of parameters. First, a bidirectional multi-scale feature-enhanced weighted fusion backbone network is constructed to extract target features repeatedly. It enhances the fusion ability of shallow detail features and high-level semantic information to capture richer multi-scale semantic information. Second, the NCA attention module is built and integrated into the feature fusion network to enhance the critical characteristics of the target region. Finally, the Ghost module is used instead of the convolutional blocks in the original network to lighten the model while reducing the network complexity and training difficulty. Experimental results show that the improved YOLOv5 algorithm achieves 78.8% mAP@0.5 for the PASCAL VOC2012 dataset, which is 1.5% higher than the original algorithm, at 62.5 FPS. The number of parameters is also reduced by 43.6%. The mAP@0.5 on the self-made metal foreign object dataset reached 98.4%, at 58.8 FPS, which can meet the requirements of end-device deployment and real-time detection. In this paper, a bi-directional multi-scale feature-enhanced weighted fusion backbone is designed to enhance the fusion capability of shallow features and advanced features. The NCA attention module is designed and embedded into the feature fusion network to enhance the key features in the target region. The Ghost module is used to reduce the network complexity and training difficulty. image
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Fast salient object detection based on multi-scale feature aggression
    Zhang, Xiaohu
    Zhu, Lei
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5734 - 5738
  • [42] Underwater image object detection based on multi-scale feature fusion
    Yang, Chao
    Zhang, Ce
    Jiang, Longyu
    Zhang, Xinwen
    MACHINE VISION AND APPLICATIONS, 2024, 35 (06)
  • [43] MDP-YOLO: A LIGHTWEIGHT YOLOV5S ALGORITHM FOR MULTI-SCALE PEST DETECTION
    Yu, Jianghua
    Zhang, Bing
    ENGENHARIA AGRICOLA, 2023, 43 (04):
  • [44] Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion
    Wei, Zheliang
    Li, Yueyang
    Luo, Haichi
    Computer Engineering and Applications, 2024, 60 (02) : 154 - 161
  • [45] Multi-scale Pyramid Feature Maps for Object Detection
    Hao Huijun
    Ye Ronghua
    Chen Zhongyu
    Zheng Zhonglong
    2017 16TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2017, : 237 - 240
  • [46] Multi-scale HOG Feature Used in Object Detection
    Li, Jin
    Zhang, Hong
    Zhang, Lei
    Li, Yawei
    Kang, Qiaochu
    Luo, Zhaohui
    Wu, Yujie
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [47] Multi-scale redistribution feature pyramid for object detection
    Qian, Huifang
    Guo, Jiahao
    Zhou, Xuan
    AI COMMUNICATIONS, 2022, 35 (01) : 15 - 30
  • [48] MGFPN: Enhancing multi-scale feature for object detection
    He, Weiming
    Wu, You
    Xiao, Jing
    Cao, Yang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 11171 - 11181
  • [49] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    IET COMPUTER VISION, 2024, 18 (05) : 613 - 625
  • [50] Weed Detection Based on Multi-scale Fusion Module and Feature Enhancement
    Kang J.
    Liu G.
    Guo G.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2022, 53 (04): : 254 - 260