Small Object Detection using Multi-scale Feature Fusion and Attention

被引:0
|
作者
Liu, Baokai [1 ]
Du, Shiqiang [2 ]
Li, Jiacheng [1 ]
Wang, Jianhua [1 ]
Liu, Wenjie [1 ]
机构
[1] Northwest Minzu Univ, Minist Educ, Chinese Natl Informat Technol Res Inst, Key Lab Chinas Ethn Languages & Informat Technol, Lanzhou 730030, Gansu, Peoples R China
[2] Northwest Minzu Univ, Coll Math & Comp Sci, Lanzhou 730030, Gansu, Peoples R China
基金
中国国家自然科学基金;
关键词
Attention Mechanism; Dilated Convolution; Multi-scale Feature Fusion; Channel Attention; Spatial Attention;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In deep learning, object detection has achieved a very large performance improvement. However, due to the few features available for small objects, network structure, sample imbalance and other reasons, the result is unsatisfactory in small object detection. In order to solve this problem, this paper proposes a method based on the combination of mutil-scale feature fusion and dilated convolution, which uses dilated convolution to expand the receptive field of feature maps at different scales and then extracts the high-level semantic information and low-level semantic information from the backbone network. The obtained feature maps of different receptive fields are fused to obtain the final feature map prediction information. In addition, we add a series of channel attention and spatial attention mechanisms to the network to better obtain the context information of the object in the image. Experiments show that this method can have higher accuracy than the traditional YOLOv3 network in the detection of small objects. In addition, the size of 640*640 images, we can achieve 31.5% accuracy in the detection of small objects in MS COCO2017. Compared with YOLOv5, there are 4 points of improvement.
引用
收藏
页码:7246 / 7251
页数:6
相关论文
共 50 条
  • [1] MFFAMM: A Small Object Detection with Multi-Scale Feature Fusion and Attention Mechanism Module
    Qu, Zhong
    Han, Tongqiang
    Yi, Turning
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (18):
  • [2] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [3] Enhancement and Fusion of Multi-Scale Feature Maps for Small Object Detection
    Xue, Zhijun
    Chen, Wenjie
    Li, Jing
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7212 - 7217
  • [4] Multi-scale feature fusion with attention mechanism for crowded road object detection
    Wu, Jingtao
    Dai, Guojun
    Zhou, Wenhui
    Zhu, Xudong
    Wang, Zengguan
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (02)
  • [5] Multi-scale feature fusion with attention mechanism for crowded road object detection
    Jingtao Wu
    Guojun Dai
    Wenhui Zhou
    Xudong Zhu
    Zengguan Wang
    [J]. Journal of Real-Time Image Processing, 2024, 21
  • [6] Residual attention mechanism and weighted feature fusion for multi-scale object detection
    Zhang, Jie
    Qi, Qiye
    Zhang, Huanlong
    Du, Qifan
    Wang, Fengxian
    Shi, Xiaoping
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40873 - 40889
  • [7] Residual attention mechanism and weighted feature fusion for multi-scale object detection
    Jie Zhang
    Qiye Qi
    Huanlong Zhang
    Qifan Du
    Fengxian Wang
    Xiaoping Shi
    [J]. Multimedia Tools and Applications, 2023, 82 : 40873 - 40889
  • [8] Small object detection in remote sensing images based on attention mechanism and multi-scale feature fusion
    Zhang, Li-guo
    Wang, Lei
    Jin, Mei
    Geng, Xing-shuo
    Shen, Qian
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (09) : 3280 - 3297
  • [9] Remote Sensing Small Object Detection Network Based on Attention Mechanism and Multi-Scale Feature Fusion
    Qu, Junsuo
    Tang, Zongbing
    Zhang, Le
    Zhang, Yanghai
    Zhang, Zhenguo
    [J]. REMOTE SENSING, 2023, 15 (11)
  • [10] Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection
    Sima, Haifeng
    Chen, Bailiang
    Tang, Chaosheng
    Zhang, Yudong
    Sun, Junding
    [J]. IET COMPUTER VISION, 2024, 18 (05) : 613 - 625