Adaptive feature fusion with attention mechanism for multi-scale target detection

被引:28
|
作者
Ju, Moran [1 ,2 ,3 ,4 ,5 ]
Luo, Jiangning [6 ]
Wang, Zhongbo [1 ,2 ,3 ,4 ,5 ]
Luo, Haibo [1 ,2 ,4 ,5 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Liaoning, Peoples R China
[2] Chinese Acad Sci, Inst Robot & Intelligent Mfg, Shenyang 110016, Liaoning, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Key Lab Opt Elect Informat Proc, Shenyang 110016, Liaoning, Peoples R China
[5] Key Lab Image Understanding & Comp Vis, Shenyang 110016, Liaoning, Peoples R China
[6] McGill Univ, Montreal, PQ H3A 0G4, Canada
来源
NEURAL COMPUTING & APPLICATIONS | 2021年 / 33卷 / 07期
关键词
Deep learning; Target detection; Adaptive feature fusion; Attention mechanism; RECOGNITION;
D O I
10.1007/s00521-020-05150-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To detect the targets of different sizes, multi-scale output is used by target detectors such as YOLO V3 and DSSD. To improve the detection performance, YOLO V3 and DSSD perform feature fusion by combining two adjacent scales. However, the feature fusion only between the adjacent scales is not sufficient. It hasn't made advantage of the features at other scales. What is more, as a common operation for feature fusion, concatenating can't provide a mechanism to learn the importance and correlation of the features at different scales. In this paper, we propose adaptive feature fusion with attention mechanism (AFFAM) for multi-scale target detection. AFFAM utilizes pathway layer and subpixel convolution layer to resize the feature maps, which is helpful to learn better and complex feature mapping. In addition, AFFAM utilizes global attention mechanism and spatial position attention mechanism, respectively, to learn the correlation of the channel features and the importance of the spatial features at different scales adaptively. Finally, we combine AFFAM with YOLO V3 to build an efficient multi-scale target detector. The comparative experiments are conducted on PASCAL VOC dataset, KITTI dataset and Smart UVM dataset. Compared with the state-of-the-art target detectors, YOLO V3 with AFFAM achieved 84.34% mean average precision (mAP) at 19.9 FPS on PASCAL VOC dataset, 87.2% mAP at 21 FPS on KITTI dataset and 99.22% mAP at 20.6 FPS on Smart UVM dataset which outperforms other advanced target detectors.
引用
收藏
页码:2769 / 2781
页数:13
相关论文
共 50 条
  • [41] Vehicle detection method based on adaptive multi-scale feature fusion network
    Shen, Xuanjing
    Li, Hanyu
    Huang, Yongping
    Wang, Yu
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [42] Detection of X-Ray Contraband by Adaptive and Multi-Scale Feature Fusion
    Sun, Jia'ao
    Dong, Yishan
    Guo, Jingyuan
    Li, Mingze
    Li, Shuaichao
    Lu, Shuhua
    [J]. Computer Engineering and Applications, 2024, 60 (02) : 96 - 102
  • [43] Flower bud detection model for hydroponic Chinese kale based on the fusion of attention mechanism and multi-scale feature
    注意力与多尺度特征融合的水培芥蓝花蕾检测
    [J]. Zhen, Wenbin (wenbinzhen@163.com), 1600, Chinese Society of Agricultural Engineering (37): : 161 - 168
  • [44] Peach Flower Density Detection Based on an Improved CNN Incorporating Attention Mechanism and Multi-Scale Feature Fusion
    Tao, Kun
    Wang, Aichen
    Shen, Yidie
    Lu, Zemin
    Peng, Futian
    Wei, Xinhua
    [J]. HORTICULTURAE, 2022, 8 (10)
  • [45] Multi-scale object detection in UAV images based on adaptive feature fusion
    Tan, Siqi
    Duan, Zhijian
    Pu, Longzhong
    [J]. PLOS ONE, 2024, 19 (03):
  • [46] Fine-Grained Detection Model Based on Attention Mechanism and Multi-Scale Feature Fusion for Cocoon Sorting
    Zheng, Han
    Guo, Xueqiang
    Ma, Yuejia
    Zeng, Xiaoxi
    Chen, Jun
    Zhang, Taohong
    [J]. AGRICULTURE-BASEL, 2024, 14 (05):
  • [47] Siamese Network with Multi-scale Feature Fusion and Dual Attention Mechanism for Template Matching
    Zhao, Kai
    He, Binbing
    Pan, Shiju
    Zhu, Yuan
    [J]. 2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6588 - 6592
  • [48] A method of single-shot target detection with multi-scale feature fusion and feature enhancement
    Qu, Zhong
    Shang, Xue
    Xia, Shu-Fang
    Yi, Tu-Ming
    Zhou, Dong-Yang
    [J]. IET IMAGE PROCESSING, 2022, 16 (06) : 1752 - 1763
  • [49] Small Object Detection in UAV Remote Sensing Images Based on Intra-Group Multi-Scale Fusion Attention and Adaptive Weighted Feature Fusion Mechanism
    Yuan, Zhe
    Gong, Jianglei
    Guo, Baolong
    Wang, Chao
    Liao, Nannan
    Song, Jiawei
    Wu, Qiming
    [J]. Remote Sensing, 2024, 16 (22)
  • [50] Ship Detection in SAR Images Based on Multi-Scale Feature Extraction and Adaptive Feature Fusion
    Zhou, Kexue
    Zhang, Min
    Wang, Hai
    Tan, Jinlin
    [J]. REMOTE SENSING, 2022, 14 (03)