MCANet: multi-scale contextual feature fusion network based on Atrous convolution

被引:7
|
作者
Li, Ke [1 ]
Liu, ZhanDong [1 ]
机构
[1] Xinjiang Normal Univ, Dept Comp Sci & Technol, 102 New Med Rd, Urumqi 830054, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Atrous convolution; YOLOv5; VisDrone; VOC;
D O I
10.1007/s11042-023-14800-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In past studies, atrous convolution is efficient in segmentation tasks to reinforce the receptive field and detection tasks. In addition, the attention module is efficient for feature extraction and enhancement. In this paper, we introduce atrous convolution, design a feature enhancement module, and utilize a plug-and-play technique, i.e., (AFE) module. Atrous convolution has been shown to be essential for expanding the perceptual field in past studies. We achieve this by fusing multiple layers of features of atrous convolution and adding a detection head to cope with the problem of varying object size scales. We achieve the purpose of extracting multi-scale contextual feature information while using an attention mechanism to effectively enhance the features and improve the overall multi-scale detection performance of the model. It can be added to a well-established backbone network or neck network. Therefore, based on this, we designed the C3 based on the atrous convolution (C3AT) module on the AFE module, replaced the C3 module in YOLOv5, and proposed the Multi-Scale Contextual Feature Enhancement Network (MCANet) as the neck network to obtain the final network structure. Experimental results indicate that the proposed method significantly improves inference speed and AP compared to the benchmark model. Single-model object detection results on the VisDrone2021 test set-dev dataset achieved 32.7% AP and 52.2%AP(50), a significant improvement of 8.1% AP and 11.4%AP(50) compared with the baseline model. The single-model object detection results on the VOC2007 test dataset reached 89.6% mAP.
引用
收藏
页码:34679 / 34702
页数:24
相关论文
共 50 条
  • [41] An Alzheimer's Disease classification network based on MRI utilizing diffusion maps for multi-scale feature fusion in graph convolution
    Yang, Zhi
    Li, Kang
    Gan, Haitao
    Huang, Zhongwei
    Shi, Ming
    Zhou, Ran
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1554 - 1572
  • [42] Multi-Scale Feature Fusion for Coal-Rock Recognition Based on Completed Local Binary Pattern and Convolution Neural Network
    Liu, Xiaoyang
    Jing, Wei
    Zhou, Mingxuan
    Li, Yuxing
    ENTROPY, 2019, 21 (06)
  • [43] Multi-scale hierarchical feature fusion network for change detection
    Zheng, Hanhong
    Zhang, Mingyang
    Gong, Maoguo
    Qin, A. K.
    Liu, Tongfei
    Jiang, Fenlong
    PATTERN RECOGNITION, 2025, 161
  • [44] Double multi-scale feature fusion network for crowd counting
    Liu, Qian
    Fang, Jiongtao
    Zhong, Yixiong
    Wang, Cunbao
    Qi, Youwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (34) : 81831 - 81855
  • [45] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [46] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [47] Multi-Scale Feature Interactive Fusion Network for RGBT Tracking
    Xiao, Xianbing
    Xiong, Xingzhong
    Meng, Fanqin
    Chen, Zhen
    SENSORS, 2023, 23 (07)
  • [48] Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
    Dong, Hang
    Pan, Jinshan
    Xiang, Lei
    Hu, Zhe
    Zhang, Xinyi
    Wang, Fei
    Yang, Ming-Hsuan
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2154 - 2164
  • [49] Kinship verification based on multi-scale feature fusion
    Yan C.
    Liu Y.
    Multimedia Tools and Applications, 2024, 83 (40) : 88069 - 88090
  • [50] Drone Detection Based on Multi-scale Feature Fusion
    Zeng, Zhenni
    Wang, Zhenning
    Qin, Lang
    Li, Hui
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 194 - 198