MCANet: multi-scale contextual feature fusion network based on Atrous convolution

被引：7

作者：

Li, Ke ^{[1
]}

Liu, ZhanDong ^{[1
]}

机构：

[1] Xinjiang Normal Univ, Dept Comp Sci & Technol, 102 New Med Rd, Urumqi 830054, Xinjiang, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 22期

基金：

中国国家自然科学基金;

关键词：

Object detection; Atrous convolution; YOLOv5; VisDrone; VOC;

D O I：

10.1007/s11042-023-14800-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In past studies, atrous convolution is efficient in segmentation tasks to reinforce the receptive field and detection tasks. In addition, the attention module is efficient for feature extraction and enhancement. In this paper, we introduce atrous convolution, design a feature enhancement module, and utilize a plug-and-play technique, i.e., (AFE) module. Atrous convolution has been shown to be essential for expanding the perceptual field in past studies. We achieve this by fusing multiple layers of features of atrous convolution and adding a detection head to cope with the problem of varying object size scales. We achieve the purpose of extracting multi-scale contextual feature information while using an attention mechanism to effectively enhance the features and improve the overall multi-scale detection performance of the model. It can be added to a well-established backbone network or neck network. Therefore, based on this, we designed the C3 based on the atrous convolution (C3AT) module on the AFE module, replaced the C3 module in YOLOv5, and proposed the Multi-Scale Contextual Feature Enhancement Network (MCANet) as the neck network to obtain the final network structure. Experimental results indicate that the proposed method significantly improves inference speed and AP compared to the benchmark model. Single-model object detection results on the VisDrone2021 test set-dev dataset achieved 32.7% AP and 52.2%AP(50), a significant improvement of 8.1% AP and 11.4%AP(50) compared with the baseline model. The single-model object detection results on the VOC2007 test dataset reached 89.6% mAP.

引用

页码：34679 / 34702

页数：24

共 50 条

[21] An efficient multi-scale contextual feature fusion network for counting crowds with varying densities and scales
Xiong, Liyan
Yi, Hu
Huang, Xiaohui
Huang, Weichun
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (09) : 13929 - 13949
[22] A hyperspectral image reconstruction algorithm based on RGB image using multi-scale atrous residual convolution network
Hu, Shaoxiang
Hou, Rong
Ming, Luo
Su, Meifang
Chen, Peng
FRONTIERS IN MARINE SCIENCE, 2023, 9
[23] A Novel Multi-scale Feature Fusion Based Network for Hyperspectral and Multispectral Image Fusion
Dong, Shuai
Huang, Shaoguang
Zhang, Jinhan
Zhang, Hongyan
PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 530 - 544
[24] Multi-scale adaptive atrous graph convolution for point cloud analysis
Xiaohong Wang
Xu Zhao
Kun Xu
Shihao Xu
The Journal of Supercomputing, 2024, 80 (6) : 7147 - 7170
[25] Multi-step prediction of roof pressure based on multi-scale contextual fusion network
Zhang, Yuhai
Yu, Qiongfang
Tang, Gaofeng
Wu, Qiong
SENSORS AND ACTUATORS A-PHYSICAL, 2024, 369
[26] Multi-scale adaptive atrous graph convolution for point cloud analysis
Wang, Xiaohong
Zhao, Xu
Xu, Kun
Xu, Shihao
JOURNAL OF SUPERCOMPUTING, 2024, 80 (06): : 7147 - 7170
[27] Lightweight Convolution Neural Network Based on Multi-Scale Parallel Fusion for Weed Identification
Wang, Zhen
Guo, Jianxin
Zhang, Shanwen
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (07)
[28] Periodicity-based multi-dimensional interaction convolution network with multi-scale feature fusion for motor imagery EEG classification
Dai, Yunshuo
Deng, Xiao
Fu, Xiuli
Zhao, Yixin
JOURNAL OF NEUROSCIENCE METHODS, 2025, 415
[29] Multi-Scale Kolmogorov-Arnold Network (KAN)-Based Linear Attention Network: Multi-Scale Feature Fusion with KAN and Deformable Convolution for Urban Scene Image Semantic Segmentation
Li, Yuanhang
Liu, Shuo
Wu, Jie
Sun, Weichao
Wen, Qingke
Wu, Yibiao
Qin, Xiujuan
Qiao, Yanyou
REMOTE SENSING, 2025, 17 (05)
[30] Remote Sensing Image Denoising Based on Multi-Scale Feature Fusion and Regional Contextual Information
Ding, Anqi
Cai, Zhouyin
Li, Jia
Zhang, Junjie
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,

← 1 2 3 4 5 →