DIMD-DETR: DDQ-DETR With Improved Metric Space for End-to-End Object Detector on Remote Sensing Aircrafts

被引:0
|
作者
Liu, Huan [1 ]
Ren, Xuefeng [1 ]
Gan, Yang [1 ]
Chen, Yongming [1 ]
Lin, Ping [1 ]
机构
[1] Hubei Normal Univ, Sch Elect Engn & Automat, Huangshi 435000, Peoples R China
基金
中国国家自然科学基金;
关键词
Aircraft; Remote sensing; Feature extraction; Atmospheric modeling; Accuracy; Adaptation models; Transformers; Object detection; Image resolution; Dynamic scheduling; Aircraft detection; end-to-end; metric space; remote sensing; transformer; IMAGES; CLASSIFICATION;
D O I
10.1109/JSTARS.2025.3530141
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Aircraft target detection in remote sensing images faces numerous challenges, including target size variations, low resolution, and complex backgrounds. To address these challenges, an enhanced end-to-end aircraft detection framework (DIMD-DETR) is developed based on an improved metric space. Initially, a bilayer targeted prediction method is proposed to strengthen gradient interaction across decoder layers, thereby enhancing detection accuracy and sensitivity in complex scenarios. The pyramid structure and self-attention mechanism from pyramid vision transformer V2 are incorporated to enable effective joint learning of both global and local features, which significantly boosts performance for low-resolution targets. To further enhance the model's generalization capabilities, an aircraft-specific data augmentation strategy is meticulously devised, thereby improving the model's adaptability to variations in scale and appearance. In addition, a metric-space-based loss function is developed to optimize the collaborative effects of the modular architecture, enhancing detection performance in complex backgrounds and under varying target conditions. Finally, a dynamic learning rate scheduling strategy is proposed to balance rapid convergence with global exploration, thereby elevating the model's robustness in challenging environments. Compared to current popular networks, our model demonstrated superior detection performance with fewer parameters.
引用
收藏
页码:4498 / 4509
页数:12
相关论文
共 28 条
  • [1] Dynamic DETR: End-to-End Object Detection with Dynamic Attention
    Dai, Xiyang
    Chen, Yinpeng
    Yang, Jianwei
    Zhang, Pengchuan
    Yuan, Lu
    Zhang, Lei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2968 - 2977
  • [2] Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
    Zhang, Manyuan
    Song, Guanglu
    Liu, Yu
    Li, Hongsheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6578 - 6587
  • [3] L-DETR: A Light-Weight Detector for End-to-End Object Detection With Transformers
    Li, Tianyang
    Wang, Jian
    Zhang, Tibing
    IEEE ACCESS, 2022, 10 : 105685 - 105692
  • [4] V-DETR: Pure Transformer for End-to-End Object Detection
    Dung Nguyen
    Van-Dung Hoang
    Van-Tuong-Lan Le
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, ACIIDS 2024, 2024, 14796 : 120 - 131
  • [5] DETR-ORD: An Improved DETR Detector for Oriented Remote Sensing Object Detection with Feature Reconstruction and Dynamic Query
    He, Xiaohai
    Liang, Kaiwen
    Zhang, Weimin
    Li, Fangxing
    Jiang, Zhou
    Zuo, Zhengqing
    Tan, Xinyan
    REMOTE SENSING, 2024, 16 (18)
  • [6] CPH DETR: Comprehensive Regression Loss for End-to-End Object Detection
    Wu, Jihao
    Li, Shufang
    Kang, Guxia
    Yang, Yuqing
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 93 - 107
  • [7] Rotated-DETR: an End-to-End Transformer-based Oriented Object Detector for Aerial Images
    Kim, Jinbeom
    Lee, Giljun
    Kim, Taejune
    Woo, Simon S.
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1248 - 1255
  • [8] DITA: DETR with improved queries for end-to-end temporal action detection
    Lu, Chongkai
    Mak, Man-Wai
    NEUROCOMPUTING, 2024, 596
  • [9] Pruning DETR: efficient end-to-end object detection with sparse structured pruning
    Huaiyuan Sun
    Shuili Zhang
    Xve Tian
    Yuanyuan Zou
    Signal, Image and Video Processing, 2024, 18 : 129 - 135
  • [10] Pruning DETR: efficient end-to-end object detection with sparse structured pruning
    Sun, Huaiyuan
    Zhang, Shuili
    Tian, Xve
    Zou, Yuanyuan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (01) : 129 - 135