Joint-attention feature fusion network and dual-adaptive NMS for object detection

被引:35
|
作者
Ma, Wentao [1 ]
Zhou, Tongqing [1 ]
Qin, Jiaohua [2 ]
Zhou, Qingyang [2 ]
Cai, Zhiping [1 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha 410073, Hunan, Peoples R China
[2] Cent South Univ Forestry & Technol, Coll Comp Sci & Informat Technol, Changsha 410004, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Joint-attention; Adaptive NMS;
D O I
10.1016/j.knosys.2022.108213
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention mechanisms and Non-Maximum Suppression (NMS) have proven to be effective components in object detection. However, feature fusion of different scales and layers based on a single attention mechanism cannot always yield gratifying performance, and may introduce redundant information that makes the results worse than expected. NMS methods, on the other hand, generally face the single-constant threshold dilemma, namely, a lower threshold leads to the miss of highly overlapped instance objects while a higher one brings in more false positives. Therefore, how to optimize different dimensions of correlation in feature mapping and how to adaptively set the NMS threshold still hinder effective object detection. While independently addressing each will cause suboptimal detection, this paper proposes to feed the informative feature representation from a joint-attention feature fusion network into adaptive NMS for a comprehensive performance enhancement. Specifically, we embed two types of attention modules in a three-level Feature Pyramid Network (FPN): the channel-attention module is adopted for enhanced feature representation by re-evaluating relationships between channels from a global perspective; the position-attention module is used to exploit the correlation between features to discover rich contextual feature information. Furthermore, we develop dual-adaptive NMS to dynamically adjust the suppression thresholds according to instance objects density, namely, the threshold rises as instance objects gather and decays when objects appear sparsely. The proposed method is evaluated on the COCO dataset and extensive experimental results demonstrate its superior performance compared with existing methods. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Feature extraction and fusion network for salient object detection
    Dai, Chao
    Pan, Chen
    He, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33955 - 33969
  • [32] Selective feature fusion network for salient object detection
    Sun, Fengming
    Yuan, Xia
    Zhao, Chunxia
    IET COMPUTER VISION, 2023, 17 (04) : 483 - 495
  • [33] Feature extraction and fusion network for salient object detection
    Chao Dai
    Chen Pan
    Wei He
    Multimedia Tools and Applications, 2022, 81 : 33955 - 33969
  • [34] Hierarchical Feature Fusion Network for Salient Object Detection
    Li, Xuelong
    Song, Dawei
    Dong, Yongsheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 9165 - 9175
  • [35] HYPER FEATURE FUSION PYRAMID NETWORK FOR OBJECT DETECTION
    Huang, Shouzhi
    Li, Xiaoyu
    Jiang, Zhuqing
    Guo, Xiaoqiang
    Men, Aidong
    2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW 2018), 2018,
  • [36] D-NMS: A dynamic NMS network for general object detection
    Zhao, Hao
    Wang, Jikai
    Dai, Deyun
    Lin, Shiqi
    Chen, Zonghai
    NEUROCOMPUTING, 2022, 512 : 225 - 234
  • [37] Adaptive Dual Attention Fusion Network for RGB-D Surface Defect Detection
    Jiang, Xiaoheng
    Liu, Jingqi
    Yan, Feng
    Lu, Yang
    Jin, Shaohui
    Liu, Hao
    Xu, Mingliang
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 392 - 406
  • [38] Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network
    Wang, Jinpeng
    Su, Nan
    Zhao, Chunhui
    Yan, Yiming
    Feng, Shou
    REMOTE SENSING, 2024, 16 (20)
  • [39] Lightweight Adaptive Feature Selection Network for Object Detection
    Yang A.-P.
    Song S.-Y.
    Cheng S.-M.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2021, 42 (09): : 1238 - 1245
  • [40] Dual-Stream Feature Fusion Network for Detection and ReID in Multi-object Tracking
    He, Qingyou
    Li, Liangqun
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2022, 13629 : 247 - 260