MCANet: multi-scale contextual feature fusion network based on Atrous convolution

被引:7
|
作者
Li, Ke [1 ]
Liu, ZhanDong [1 ]
机构
[1] Xinjiang Normal Univ, Dept Comp Sci & Technol, 102 New Med Rd, Urumqi 830054, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Atrous convolution; YOLOv5; VisDrone; VOC;
D O I
10.1007/s11042-023-14800-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In past studies, atrous convolution is efficient in segmentation tasks to reinforce the receptive field and detection tasks. In addition, the attention module is efficient for feature extraction and enhancement. In this paper, we introduce atrous convolution, design a feature enhancement module, and utilize a plug-and-play technique, i.e., (AFE) module. Atrous convolution has been shown to be essential for expanding the perceptual field in past studies. We achieve this by fusing multiple layers of features of atrous convolution and adding a detection head to cope with the problem of varying object size scales. We achieve the purpose of extracting multi-scale contextual feature information while using an attention mechanism to effectively enhance the features and improve the overall multi-scale detection performance of the model. It can be added to a well-established backbone network or neck network. Therefore, based on this, we designed the C3 based on the atrous convolution (C3AT) module on the AFE module, replaced the C3 module in YOLOv5, and proposed the Multi-Scale Contextual Feature Enhancement Network (MCANet) as the neck network to obtain the final network structure. Experimental results indicate that the proposed method significantly improves inference speed and AP compared to the benchmark model. Single-model object detection results on the VisDrone2021 test set-dev dataset achieved 32.7% AP and 52.2%AP(50), a significant improvement of 8.1% AP and 11.4%AP(50) compared with the baseline model. The single-model object detection results on the VOC2007 test dataset reached 89.6% mAP.
引用
收藏
页码:34679 / 34702
页数:24
相关论文
共 50 条
  • [21] An efficient multi-scale contextual feature fusion network for counting crowds with varying densities and scales
    Xiong, Liyan
    Yi, Hu
    Huang, Xiaohui
    Huang, Weichun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13929 - 13949
  • [22] A hyperspectral image reconstruction algorithm based on RGB image using multi-scale atrous residual convolution network
    Hu, Shaoxiang
    Hou, Rong
    Ming, Luo
    Su, Meifang
    Chen, Peng
    FRONTIERS IN MARINE SCIENCE, 2023, 9
  • [23] A Novel Multi-scale Feature Fusion Based Network for Hyperspectral and Multispectral Image Fusion
    Dong, Shuai
    Huang, Shaoguang
    Zhang, Jinhan
    Zhang, Hongyan
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 530 - 544
  • [24] Multi-scale adaptive atrous graph convolution for point cloud analysis
    Xiaohong Wang
    Xu Zhao
    Kun Xu
    Shihao Xu
    The Journal of Supercomputing, 2024, 80 (6) : 7147 - 7170
  • [25] Multi-step prediction of roof pressure based on multi-scale contextual fusion network
    Zhang, Yuhai
    Yu, Qiongfang
    Tang, Gaofeng
    Wu, Qiong
    SENSORS AND ACTUATORS A-PHYSICAL, 2024, 369
  • [26] Multi-scale adaptive atrous graph convolution for point cloud analysis
    Wang, Xiaohong
    Zhao, Xu
    Xu, Kun
    Xu, Shihao
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (06): : 7147 - 7170
  • [27] Lightweight Convolution Neural Network Based on Multi-Scale Parallel Fusion for Weed Identification
    Wang, Zhen
    Guo, Jianxin
    Zhang, Shanwen
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
  • [28] Periodicity-based multi-dimensional interaction convolution network with multi-scale feature fusion for motor imagery EEG classification
    Dai, Yunshuo
    Deng, Xiao
    Fu, Xiuli
    Zhao, Yixin
    JOURNAL OF NEUROSCIENCE METHODS, 2025, 415
  • [29] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
    Li, Yuanhang
    Liu, Shuo
    Wu, Jie
    Sun, Weichao
    Wen, Qingke
    Wu, Yibiao
    Qin, Xiujuan
    Qiao, Yanyou
    REMOTE SENSING, 2025, 17 (05)
  • [30] Remote Sensing Image Denoising Based on Multi-Scale Feature Fusion and Regional Contextual Information
    Ding, Anqi
    Cai, Zhouyin
    Li, Jia
    Zhang, Junjie
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,