Target Detection in Remote Sensing Image Based on Deformable Transformer and Adaptive Detection Head

被引:0
|
作者
Peng Haokang [1 ]
Ge Yun [1 ,2 ]
Yang Xiaoyu [1 ]
Hu Changquan [1 ]
机构
[1] Nanchang Hangkong Univ, Sch Software, Nanchang 330063, Jiangxi, Peoples R China
[2] Jiangxi Huihang Engn Consulting Co Ltd, Nanchang 330038, Jiangxi, Peoples R China
关键词
remote sensing image; target detection; Deformable Transformer; task learning module; adaptive detection head; L1-IoU loss; OBJECT DETECTION;
D O I
10.3788/LOP231702
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To address the challenges of precise localization of targets in optical remote sensing images and conflict between classification and localization features in the detection head, a remote sensing image target detection method based on Deformable Transformer and adaptive detection head is proposed. First, we design a feature extraction network based on feature fusion and Deformable Transformer. The feature fusion module enriches the semantic information of shallow convolution neural network features, and the Deformable Transformer establishes dependencies on distant features. This in turn effectively captures global semantic information and improves feature representation capability. Second, an adaptive detection head based on task learning module is constructed to enhance task awareness within the detection head. It automatically learns and adjusts the feature representation for classification and localization tasks, and thereby, mitigates feature conflicts. Finally, the L1-IoU loss is proposed as a localization loss function to provide a more accurate assessment of localization error between candidate boxes and ground truth boxes during training, thereby improving the accuracy of object localization. The effectiveness of the proposed method is evaluated on high-resolution remote sensing datasets, NWPU VHR-10 and RSOD. The results show significant improvements when compared to other methods.
引用
收藏
页数:12
相关论文
共 30 条
  • [1] Single Shot Anchor Refinement Network for Oriented Object Detection in Optical Remote Sensing Imagery
    Bao, Songze
    Zhong, Xing
    Zhu, Ruifei
    Zhang, Xiaonan
    Li, Zhuqiang
    Li, Mengyang
    [J]. IEEE ACCESS, 2019, 7 : 87150 - 87161
  • [2] Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
  • [3] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [4] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [5] Harris C, 1988, P ALV VIS C 1988 SEP, P1
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
  • [8] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755
  • [9] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [10] DCL-Net: Augmenting the Capability of Classification and Localization for Remote Sensing Object Detection
    Liu, Enhai
    Zheng, Yu
    Pan, Bin
    Xu, Xia
    Shi, Zhenwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (09): : 7933 - 7944