MCANet: multi-scale contextual feature fusion network based on Atrous convolution

被引:7
|
作者
Li, Ke [1 ]
Liu, ZhanDong [1 ]
机构
[1] Xinjiang Normal Univ, Dept Comp Sci & Technol, 102 New Med Rd, Urumqi 830054, Xinjiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Atrous convolution; YOLOv5; VisDrone; VOC;
D O I
10.1007/s11042-023-14800-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In past studies, atrous convolution is efficient in segmentation tasks to reinforce the receptive field and detection tasks. In addition, the attention module is efficient for feature extraction and enhancement. In this paper, we introduce atrous convolution, design a feature enhancement module, and utilize a plug-and-play technique, i.e., (AFE) module. Atrous convolution has been shown to be essential for expanding the perceptual field in past studies. We achieve this by fusing multiple layers of features of atrous convolution and adding a detection head to cope with the problem of varying object size scales. We achieve the purpose of extracting multi-scale contextual feature information while using an attention mechanism to effectively enhance the features and improve the overall multi-scale detection performance of the model. It can be added to a well-established backbone network or neck network. Therefore, based on this, we designed the C3 based on the atrous convolution (C3AT) module on the AFE module, replaced the C3 module in YOLOv5, and proposed the Multi-Scale Contextual Feature Enhancement Network (MCANet) as the neck network to obtain the final network structure. Experimental results indicate that the proposed method significantly improves inference speed and AP compared to the benchmark model. Single-model object detection results on the VisDrone2021 test set-dev dataset achieved 32.7% AP and 52.2%AP(50), a significant improvement of 8.1% AP and 11.4%AP(50) compared with the baseline model. The single-model object detection results on the VOC2007 test dataset reached 89.6% mAP.
引用
收藏
页码:34679 / 34702
页数:24
相关论文
共 50 条
  • [1] MCANet: multi-scale contextual feature fusion network based on Atrous convolution
    Ke Li
    ZhanDong Liu
    Multimedia Tools and Applications, 2023, 82 : 34679 - 34702
  • [2] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Donghua Liu
    Guodong Wang
    Guangtao Zhai
    Multimedia Tools and Applications, 2022, 81 : 37939 - 37952
  • [3] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Liu, Donghua
    Wang, Guodong
    Zhai, Guangtao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37939 - 37952
  • [4] Contextual and Multi-Scale Feature Fusion Network for Traffic Sign Detection
    Zhang, Wei
    Wang, Qiang
    Fan, Huijie
    Tang, Yandong
    2020 10TH INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2020), 2020, : 13 - 17
  • [5] Multi-Scale Aggregation Stereo Matching Network Based on Dense Grouping Atrous Convolution
    Zou, Qijie
    Zhang, Jie
    Chen, Shuang
    Gao, Bing
    Qin, Jing
    Dong, Aotian
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [6] Multi-layer Feature Fusion Network with Atrous Convolution for Pedestrian Detection
    Li, You
    Zhang, Qingxuan
    Zhang, Yulei
    2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
  • [7] Typhoon Classification Model Based on Multi-Scale Convolution Feature Fusion
    Lu Peng
    Zou Peiqi
    Zou Guoliang
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (16)
  • [8] Historical Document Text Binarization using Atrous Convolution and Multi-scale Feature Decoder
    Rasyidi, Hanif
    Khan, Salman
    2019 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2019, : 537 - 544
  • [9] Iris recognition based on local circular Gabor filters and multi-scale convolution feature fusion network
    Sun, Jie
    Zhao, Shipeng
    Yu, Yanan
    Wang, Xuan
    Zhou, Lijian
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33051 - 33065
  • [10] MLFNet- Point Cloud Semantic Segmentation Convolution Network Based on Multi-Scale Feature Fusion
    Yang, Jingfang
    Zou, Bochang
    Qiu, Huadong
    Li, Zhi
    IEEE ACCESS, 2021, 9 : 44950 - 44962