MCANet: multi-scale contextual feature fusion network based on Atrous convolution

被引：7

作者：

Li, Ke ^{[1
]}

Liu, ZhanDong ^{[1
]}

机构：

[1] Xinjiang Normal Univ, Dept Comp Sci & Technol, 102 New Med Rd, Urumqi 830054, Xinjiang, Peoples R China

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 22期

基金：

中国国家自然科学基金;

关键词：

Object detection; Atrous convolution; YOLOv5; VisDrone; VOC;

D O I：

10.1007/s11042-023-14800-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In past studies, atrous convolution is efficient in segmentation tasks to reinforce the receptive field and detection tasks. In addition, the attention module is efficient for feature extraction and enhancement. In this paper, we introduce atrous convolution, design a feature enhancement module, and utilize a plug-and-play technique, i.e., (AFE) module. Atrous convolution has been shown to be essential for expanding the perceptual field in past studies. We achieve this by fusing multiple layers of features of atrous convolution and adding a detection head to cope with the problem of varying object size scales. We achieve the purpose of extracting multi-scale contextual feature information while using an attention mechanism to effectively enhance the features and improve the overall multi-scale detection performance of the model. It can be added to a well-established backbone network or neck network. Therefore, based on this, we designed the C3 based on the atrous convolution (C3AT) module on the AFE module, replaced the C3 module in YOLOv5, and proposed the Multi-Scale Contextual Feature Enhancement Network (MCANet) as the neck network to obtain the final network structure. Experimental results indicate that the proposed method significantly improves inference speed and AP compared to the benchmark model. Single-model object detection results on the VisDrone2021 test set-dev dataset achieved 32.7% AP and 52.2%AP(50), a significant improvement of 8.1% AP and 11.4%AP(50) compared with the baseline model. The single-model object detection results on the VOC2007 test dataset reached 89.6% mAP.

引用

页码：34679 / 34702

页数：24

共 50 条

[1] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
Ke Li
ZhanDong Liu
Multimedia Tools and Applications, 2023, 82 : 34679 - 34702
[2] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
Donghua Liu
Guodong Wang
Guangtao Zhai
Multimedia Tools and Applications, 2022, 81 : 37939 - 37952
[3] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
Liu, Donghua
Wang, Guodong
Zhai, Guangtao
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37939 - 37952
[4] Contextual and Multi-Scale Feature Fusion Network for Traffic Sign Detection
Zhang, Wei
Wang, Qiang
Fan, Huijie
Tang, Yandong
2020 10TH INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2020), 2020, : 13 - 17
[5] Multi-Scale Aggregation Stereo Matching Network Based on Dense Grouping Atrous Convolution
Zou, Qijie
Zhang, Jie
Chen, Shuang
Gao, Bing
Qin, Jing
Dong, Aotian
APPLIED SCIENCES-BASEL, 2023, 13 (12):
[6] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
Li, You
Zhang, Qingxuan
Zhang, Yulei
2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
[7] Typhoon Classification Model Based on Multi-Scale Convolution Feature Fusion
Lu Peng
Zou Peiqi
Zou Guoliang
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (16)
[8] Historical Document Text Binarization using Atrous Convolution and Multi-scale Feature Decoder
Rasyidi, Hanif
Khan, Salman
2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 537 - 544
[9] Iris recognition based on local circular Gabor filters and multi-scale convolution feature fusion network
Sun, Jie
Zhao, Shipeng
Yu, Yanan
Wang, Xuan
Zhou, Lijian
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33051 - 33065
[10] MLFNet- Point Cloud Semantic Segmentation Convolution Network Based on Multi-Scale Feature Fusion
Yang, Jingfang
Zou, Bochang
Qiu, Huadong
Li, Zhi
IEEE ACCESS, 2021, 9 : 44950 - 44962

← 1 2 3 4 5 →