OrtDet: An Orientation Robust Detector via Transformer for Object Detection in Aerial Images

被引:3
|
作者
Zhao, Ling [1 ]
Liu, Tianhua [1 ]
Xie, Shuchun [2 ]
Huang, Haoze [1 ]
Qi, Ji [1 ]
机构
[1] Cent South Univ, Sch Geosci & Info Phys, South Lushan Rd, Changsha 410083, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Traff & Transportat Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; rotation-equivariant; self-attention;
D O I
10.3390/rs14246329
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The detection of arbitrarily rotated objects in aerial images is challenging due to the highly complex backgrounds and the multiple angles of objects. Existing detectors are not robust relative to the varying angle of objects because the CNNs do not explicitly model the orientation's variation. In this paper, we propose an Orientation Robust Detector (OrtDet) to solve this problem, which aims to learn features that change accordingly with the object's rotation (i.e., rotation-equivariant features). Specifically, we introduce a vision transformer as the backbone to capture its remote contextual associations via the degree of feature similarities. By capturing the features of each part of the object and their relative spatial distribution, OrtDet can learn features that have a complete response to any direction of the object. In addition, we use the tokens concatenation layer (TCL) strategy, which generates a pyramidal feature hierarchy for addressing vastly different scales of objects. To avoid the confusion of angle regression, we predict the relative gliding offsets of the vertices in each corresponding side of the horizontal bounding boxes (HBBs) to represent the oriented bounding boxes (OBBs). To intuitively reflect the robustness of the detector, a new metric, the mean rotation precision (mRP), is proposed to quantitatively measure the model's learning ability for a rotation-equivariant feature. Experiments on the DOTA-v1.0, DOTA-v1.5, and HRSC2016 datasets show that our method improves the mAP by 0.5, 1.1, and 2.2 and reduces mRP detection fluctuations by 0.74, 0.56, and 0.52, respectively.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Enhanced Tiny Object Detection in Aerial Images
    Fu, Tianyi
    Yang, Benyi
    Dong, Hongbin
    Deng, Baosong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 149 - 161
  • [22] SFTN: Fast object detection for aerial images
    Chen, Li
    Zhang, Fan
    Guo, Wei
    Li, Tianyang
    Sun, Mingqian
    IET IMAGE PROCESSING, 2023, 17 (13) : 3897 - 3907
  • [23] Automatic aircraft object detection in aerial images
    Li, YC
    Chen, HX
    Mei, YH
    Yang, JB
    Zheng, W
    FIFTH INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND CONTROL TECHNOLOGY, 2003, 5253 : 547 - 551
  • [24] A Research of object detection on UAVs aerial images
    Xie, Xiaozhu
    Lu, Gang
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 342 - 345
  • [25] Improved TPH for object detection in aerial images
    Wang, Xiaobin
    Zhu, Dekang
    Yan, Ye
    Sun, Haohui
    JOURNAL OF SPATIAL SCIENCE, 2024, 69 (02) : 493 - 505
  • [26] On the Robustness of Object Detection Models on Aerial Images
    He, Haodong
    Ding, Jian
    Xu, Bowen
    Xia, Gui-Song
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [27] Task interleaving and orientation estimation for high-precision oriented object detection in aerial images
    Ming, Qi
    Miao, Lingjuan
    Zhou, Zhiqiang
    Song, Junjie
    Dong, Yunpeng
    Yang, Xue
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 : 241 - 255
  • [28] Generating object proposals for improved object detection in aerial images
    Sommer, Lars W.
    Schuchert, Tobias
    Beyerer, Juergen
    ELECTRO-OPTICAL REMOTE SENSING X, 2016, 9988
  • [29] Robust Vehicle Detection in Aerial Images Using Bag-of-Words and Orientation Aware Scanning
    Zhou, Hailing
    Wei, Lei
    Lim, Chee Peng
    Creighton, Douglas
    Nahavandi, Saeid
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12): : 7074 - 7085
  • [30] Robust Object Detection in Aerial Imagery Based on Multi-Scale Detector and Soft Densely Connected
    Zhang, Miaohui
    Zhang, Bo
    Liu, Mengya
    Xin, Ming
    IEEE ACCESS, 2020, 8 : 92791 - 92801