Small Object Detection Method Based on Global Multi-Level Perception and Dynamic Region Aggregation

被引:6
|
作者
Zhu, Zhiqin [1 ]
Zheng, Renzhong [1 ]
Qi, Guanqiu [2 ]
Li, Shuang [1 ]
Li, Yuanyuan [1 ]
Gao, Xinbo [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[2] State Univ New York Buffalo State, Comp Informat Syst Dept, Buffalo, NY 14222 USA
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Noise; Data augmentation; Detectors; Task analysis; Semantics; Small object; sparse strategy; global multi-level perception; dynamic region aggregation; CONVOLUTIONAL NEURAL-NETWORK; AWARE;
D O I
10.1109/TCSVT.2024.3402097
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the field of object detection, detecting small objects is an important and challenging task. However, most existing methods tend to focus on designing complex network structures, lack attention to global representation, and ignore redundant noise and dense distribution of small objects in complex networks. To address the above problems, this paper proposes a small object detection method based on global multi-level perception and dynamic region aggregation. The method achieves accurate detection by dynamically aggregating effective features within a region while fully perceiving the features. This method mainly consists of two modules: global multi-level perception module and dynamic region aggregation module. In the global multi-level perception module, self-attention is used to perceive the global region, and its linear transformation is mapped through a convolutional network to increase the local details of global perception, thereby obtaining more refined global information. The dynamic region aggregation module, devised with a sparse strategy in mind, selectively interacts with relevant features. This design allows aggregation of key features of individual instances, effectively mitigating noise interference. Consequently, this approach addresses the challenges associated with densely distributed targets and enhances the model's ability to discriminate on a fine-grained level. This proposed method was evaluated on two popular datasets. Experimental results show that this method outperforms state-of-the-art methods in small object detection tasks, demonstrating good performance and potential applications.
引用
收藏
页码:10011 / 10022
页数:12
相关论文
共 50 条
  • [1] Multi-level Proposal Relations Aggregation for Video Object Detection
    Yu, Chongkai
    Chen, Wenjie
    Wu, Bing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 734 - 745
  • [2] Multi-Level Fusion Supervised Saliency Object Detection Based on Global Response
    Chen, Hui
    Peng, Li
    Computer Engineering and Applications, 2023, 59 (24) : 238 - 247
  • [3] An Object Detection Method Combining Multi-Level Feature Fusion and Region Channel Attention
    Zhu, Ge
    Wei, Zizun
    Lin, Feng
    IEEE ACCESS, 2021, 9 : 25101 - 25109
  • [4] An Object Detection Method Combining Multi-Level Feature Fusion and Region Channel Attention
    Zhu, Ge
    Wei, Zizun
    Lin, Feng
    IEEE Access, 2021, 9 : 25101 - 25109
  • [5] Multi-level object detection by multi-sensor perception of traffic scenes
    Yuan, Sheng
    Zhang, Qi
    Zhu, Li
    Wang, Su
    Zang, Yujie
    Zhao, Xi
    NEUROCOMPUTING, 2022, 514 : 486 - 499
  • [6] Multi-level loss object tracking adversarial attack method based on spatial perception
    Cheng X.
    Wang Y.
    Zhang N.
    Fu Z.
    Chen B.
    Zhao G.
    Tongxin Xuebao/Journal on Communications, 2021, 42 (11): : 242 - 254
  • [7] Video object detection algorithm based on multi-level feature aggregation under mixed sampler
    Qin S.
    Gai S.
    Da F.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (01): : 10 - 19
  • [8] MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection
    Sun, Guanxiong
    Hua, Yang
    Hu, Guosheng
    Robertson, Neil
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 2620 - 2627
  • [9] Attention guided multi-level feature aggregation network for camouflaged object detection
    Wang, Anzhi
    Ren, Chunhong
    Zhao, Shuang
    Mu, Shibiao
    IMAGE AND VISION COMPUTING, 2024, 144
  • [10] Vehicle object detection method based on candidate region aggregation
    Zhang, Luyang
    Wang, Haitao
    Wang, Xinyao
    Liu, Qiang
    Wang, Huaibin
    Wang, Hailong
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1635 - 1647