OrtDet: An Orientation Robust Detector via Transformer for Object Detection in Aerial Images

被引:3
|
作者
Zhao, Ling [1 ]
Liu, Tianhua [1 ]
Xie, Shuchun [2 ]
Huang, Haoze [1 ]
Qi, Ji [1 ]
机构
[1] Cent South Univ, Sch Geosci & Info Phys, South Lushan Rd, Changsha 410083, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Traff & Transportat Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; rotation-equivariant; self-attention;
D O I
10.3390/rs14246329
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The detection of arbitrarily rotated objects in aerial images is challenging due to the highly complex backgrounds and the multiple angles of objects. Existing detectors are not robust relative to the varying angle of objects because the CNNs do not explicitly model the orientation's variation. In this paper, we propose an Orientation Robust Detector (OrtDet) to solve this problem, which aims to learn features that change accordingly with the object's rotation (i.e., rotation-equivariant features). Specifically, we introduce a vision transformer as the backbone to capture its remote contextual associations via the degree of feature similarities. By capturing the features of each part of the object and their relative spatial distribution, OrtDet can learn features that have a complete response to any direction of the object. In addition, we use the tokens concatenation layer (TCL) strategy, which generates a pyramidal feature hierarchy for addressing vastly different scales of objects. To avoid the confusion of angle regression, we predict the relative gliding offsets of the vertices in each corresponding side of the horizontal bounding boxes (HBBs) to represent the oriented bounding boxes (OBBs). To intuitively reflect the robustness of the detector, a new metric, the mean rotation precision (mRP), is proposed to quantitatively measure the model's learning ability for a rotation-equivariant feature. Experiments on the DOTA-v1.0, DOTA-v1.5, and HRSC2016 datasets show that our method improves the mAP by 0.5, 1.1, and 2.2 and reduces mRP detection fluctuations by 0.74, 0.56, and 0.52, respectively.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Dense-and-Similar Object detection in aerial images
    Wang, Xiaobin
    Yan, Ye
    Sun, Haohui
    Zhu, Dekang
    PATTERN RECOGNITION LETTERS, 2023, 176 : 153 - 159
  • [42] Rethinking Classification of Oriented Object Detection in Aerial Images
    Nguyen, Phuc
    Truong, Thang
    Vo, Nguyen D.
    Nguyen, Khang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 738 - 747
  • [43] Adaptive dynamic networks for object detection in aerial images
    Wu, Zhenyu
    Yan, Haibin
    PATTERN RECOGNITION LETTERS, 2023, 166 : 8 - 15
  • [44] A Hybrid Moving Object Detection Method for Aerial Images
    Huang, Chung-Hsien
    Wu, Yi-Ta
    Kao, Jau-Hong
    Shih, Ming-Yu
    Chou, Cheng-Chuan
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 357 - 368
  • [45] Enhancing Object Detection Algorithms by Synthetic Aerial Images
    Yilmaz, Can
    Maras, Bahri
    Yilmaz, Gorkem
    Ceylan, Goksu
    Hamamcioglu, Onder
    Arica, Nafiz
    Ertuzun, Aysin Baytan
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [46] Scale Decoupled Pyramid for Object Detection in Aerial Images
    Ma, You
    Chai, Lin
    Jin, Lizuo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [47] Object Detection in Aerial Images Based on Cascaded CNN
    Zhang, Wei
    Li, Jiaojie
    Qi, Shengxiang
    2018 INTERNATIONAL CONFERENCE ON SENSOR NETWORKS AND SIGNAL PROCESSING (SNSP 2018), 2018, : 434 - 439
  • [48] Dot Distance for Tiny Object Detection in Aerial Images
    Xu, Chang
    Wang, Jinwang
    Yang, Wen
    Yu, Lei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 1192 - 1201
  • [49] ReDet: A Rotation-equivariant Detector for Aerial Object Detection
    Han, Jiaming
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2785 - 2794
  • [50] WSODet: A Weakly Supervised Oriented Detector for Aerial Object Detection
    Tan, Zhiwen
    Jiang, Zhiguo
    Guo, Chen
    Zhang, Haopeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61