Small Object Detection Method Based on Global Multi-Level Perception and Dynamic Region Aggregation

被引:6
|
作者
Zhu, Zhiqin [1 ]
Zheng, Renzhong [1 ]
Qi, Guanqiu [2 ]
Li, Shuang [1 ]
Li, Yuanyuan [1 ]
Gao, Xinbo [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Automat, Chongqing 400065, Peoples R China
[2] State Univ New York Buffalo State, Comp Informat Syst Dept, Buffalo, NY 14222 USA
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Noise; Data augmentation; Detectors; Task analysis; Semantics; Small object; sparse strategy; global multi-level perception; dynamic region aggregation; CONVOLUTIONAL NEURAL-NETWORK; AWARE;
D O I
10.1109/TCSVT.2024.3402097
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the field of object detection, detecting small objects is an important and challenging task. However, most existing methods tend to focus on designing complex network structures, lack attention to global representation, and ignore redundant noise and dense distribution of small objects in complex networks. To address the above problems, this paper proposes a small object detection method based on global multi-level perception and dynamic region aggregation. The method achieves accurate detection by dynamically aggregating effective features within a region while fully perceiving the features. This method mainly consists of two modules: global multi-level perception module and dynamic region aggregation module. In the global multi-level perception module, self-attention is used to perceive the global region, and its linear transformation is mapped through a convolutional network to increase the local details of global perception, thereby obtaining more refined global information. The dynamic region aggregation module, devised with a sparse strategy in mind, selectively interacts with relevant features. This design allows aggregation of key features of individual instances, effectively mitigating noise interference. Consequently, this approach addresses the challenges associated with densely distributed targets and enhances the model's ability to discriminate on a fine-grained level. This proposed method was evaluated on two popular datasets. Experimental results show that this method outperforms state-of-the-art methods in small object detection tasks, demonstrating good performance and potential applications.
引用
收藏
页码:10011 / 10022
页数:12
相关论文
共 50 条
  • [21] Multi-Level Foreground Prompt for Incremental Object Detection
    Mo, Jianwen
    Zou, Ronghua
    Yuan, Hua
    IEEE ACCESS, 2025, 13 : 4048 - 4066
  • [22] Multi-Level Feature Aggregation-Based Joint Keypoint Detection and Description
    Li, Jun
    Li, Xiang
    Wei, Yifei
    Song, Mei
    Wang, Xiaojun
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 2529 - 2540
  • [23] A multi-level depiction method for painterly rendering based on visual perception cue
    Lee, Hochang
    Seo, Sanghyun
    Ryoo, Seungtaek
    Ahn, Keejoo
    Yoon, Kyunghyun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2013, 64 (02) : 277 - 292
  • [24] A multi-level depiction method for painterly rendering based on visual perception cue
    Hochang Lee
    Sanghyun Seo
    Seungtaek Ryoo
    Keejoo Ahn
    Kyunghyun Yoon
    Multimedia Tools and Applications, 2013, 64 : 277 - 292
  • [25] A dynamic admission control method based on the multi-level collaboration in DVE
    Shi, Huaji
    Zhou, Qi
    Ni, Dan
    Journal of Computational Information Systems, 2012, 8 (18): : 7783 - 7790
  • [26] Multi-level feature fusion pyramid network for object detection
    Guo, Zebin
    Shuai, Hui
    Liu, Guangcan
    Zhu, Yisheng
    Wang, Wenqing
    VISUAL COMPUTER, 2023, 39 (09): : 4267 - 4277
  • [27] Unsupervised Salient Object Detection by Aggregating Multi-Level Cues
    Xia, Chenxing
    Zhang, Hanling
    IEEE PHOTONICS JOURNAL, 2018, 10 (06):
  • [28] Multi-level consistency regularization for domain adaptive object detection
    Tian, Kun
    Zhang, Chenghao
    Wang, Ying
    Xiang, Shiming
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 18003 - 18018
  • [29] Multi-level feature fusion pyramid network for object detection
    Zebin Guo
    Hui Shuai
    Guangcan Liu
    Yisheng Zhu
    Wenqing Wang
    The Visual Computer, 2023, 39 : 4267 - 4277
  • [30] Deep Salient Object Detection by Integrating Multi-level Cues
    Zhang, Jing
    Dai, Yuchao
    Porikli, Fatih
    2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1 - 10