MCANet: multi-scale contextual feature fusion network based on Atrous convolution

被引:0
|
作者
Ke Li
ZhanDong Liu
机构
[1] Xinjiang Normal University,Department of Computer Science and Technology
来源
关键词
Object detection; Atrous convolution; YOLOv5; VisDrone; VOC;
D O I
暂无
中图分类号
学科分类号
摘要
In past studies, atrous convolution is efficient in segmentation tasks to reinforce the receptive field and detection tasks. In addition, the attention module is efficient for feature extraction and enhancement. In this paper, we introduce atrous convolution, design a feature enhancement module, and utilize a plug-and-play technique, i.e., (AFE) module. Atrous convolution has been shown to be essential for expanding the perceptual field in past studies. We achieve this by fusing multiple layers of features of atrous convolution and adding a detection head to cope with the problem of varying object size scales. We achieve the purpose of extracting multi-scale contextual feature information while using an attention mechanism to effectively enhance the features and improve the overall multi-scale detection performance of the model. It can be added to a well-established backbone network or neck network. Therefore, based on this, we designed the C3 based on the atrous convolution (C3AT) module on the AFE module, replaced the C3 module in YOLOv5, and proposed the Multi-Scale Contextual Feature Enhancement Network (MCANet) as the neck network to obtain the final network structure. Experimental results indicate that the proposed method significantly improves inference speed and AP compared to the benchmark model. Single-model object detection results on the VisDrone2021 test set-dev dataset achieved 32.7% AP and 52.2%AP50, a significant improvement of 8.1% AP and 11.4%AP50 compared with the baseline model. The single-model object detection results on the VOC2007 test dataset reached 89.6% mAP.
引用
收藏
页码:34679 / 34702
页数:23
相关论文
共 50 条
  • [1] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
    Li, Ke
    Liu, ZhanDong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (22) : 34679 - 34702
  • [2] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Liu, Donghua
    Wang, Guodong
    Zhai, Guangtao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37939 - 37952
  • [3] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Donghua Liu
    Guodong Wang
    Guangtao Zhai
    [J]. Multimedia Tools and Applications, 2022, 81 : 37939 - 37952
  • [4] Contextual and Multi-Scale Feature Fusion Network for Traffic Sign Detection
    Zhang, Wei
    Wang, Qiang
    Fan, Huijie
    Tang, Yandong
    [J]. 2020 10TH INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2020), 2020, : 13 - 17
  • [5] Multi-Scale Aggregation Stereo Matching Network Based on Dense Grouping Atrous Convolution
    Zou, Qijie
    Zhang, Jie
    Chen, Shuang
    Gao, Bing
    Qin, Jing
    Dong, Aotian
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [6] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
    Li, You
    Zhang, Qingxuan
    Zhang, Yulei
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
  • [7] Typhoon Classification Model Based on Multi-Scale Convolution Feature Fusion
    Lu Peng
    Zou Peiqi
    Zou Guoliang
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (16)
  • [8] Historical Document Text Binarization using Atrous Convolution and Multi-scale Feature Decoder
    Rasyidi, Hanif
    Khan, Salman
    [J]. 2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 537 - 544
  • [9] Iris recognition based on local circular Gabor filters and multi-scale convolution feature fusion network
    Sun, Jie
    Zhao, Shipeng
    Yu, Yanan
    Wang, Xuan
    Zhou, Lijian
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33051 - 33065
  • [10] MLFNet- Point Cloud Semantic Segmentation Convolution Network Based on Multi-Scale Feature Fusion
    Yang, Jingfang
    Zou, Bochang
    Qiu, Huadong
    Li, Zhi
    [J]. IEEE ACCESS, 2021, 9 : 44950 - 44962