OrtDet: An Orientation Robust Detector via Transformer for Object Detection in Aerial Images

被引:3
|
作者
Zhao, Ling [1 ]
Liu, Tianhua [1 ]
Xie, Shuchun [2 ]
Huang, Haoze [1 ]
Qi, Ji [1 ]
机构
[1] Cent South Univ, Sch Geosci & Info Phys, South Lushan Rd, Changsha 410083, Peoples R China
[2] Changsha Univ Sci & Technol, Sch Traff & Transportat Engn, Changsha 410114, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; rotation-equivariant; self-attention;
D O I
10.3390/rs14246329
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The detection of arbitrarily rotated objects in aerial images is challenging due to the highly complex backgrounds and the multiple angles of objects. Existing detectors are not robust relative to the varying angle of objects because the CNNs do not explicitly model the orientation's variation. In this paper, we propose an Orientation Robust Detector (OrtDet) to solve this problem, which aims to learn features that change accordingly with the object's rotation (i.e., rotation-equivariant features). Specifically, we introduce a vision transformer as the backbone to capture its remote contextual associations via the degree of feature similarities. By capturing the features of each part of the object and their relative spatial distribution, OrtDet can learn features that have a complete response to any direction of the object. In addition, we use the tokens concatenation layer (TCL) strategy, which generates a pyramidal feature hierarchy for addressing vastly different scales of objects. To avoid the confusion of angle regression, we predict the relative gliding offsets of the vertices in each corresponding side of the horizontal bounding boxes (HBBs) to represent the oriented bounding boxes (OBBs). To intuitively reflect the robustness of the detector, a new metric, the mean rotation precision (mRP), is proposed to quantitatively measure the model's learning ability for a rotation-equivariant feature. Experiments on the DOTA-v1.0, DOTA-v1.5, and HRSC2016 datasets show that our method improves the mAP by 0.5, 1.1, and 2.2 and reduces mRP detection fluctuations by 0.74, 0.56, and 0.52, respectively.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Rotated Object Detection via Scale-Invariant Mahalanobis Distance in Aerial Images
    Wen, Siyang
    Guo, Wei
    Liu, Yi
    Wu, Ruijie
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [32] Orientation-selective building detection in aerial images
    Manno-Kovacs, Andrea
    Sziranyi, Tamas
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2015, 108 : 94 - 112
  • [33] YOLO-ERF: lightweight object detector for UAV aerial images
    Xin Wang
    Ning He
    Chen Hong
    Fengxi Sun
    Wenjing Han
    Qi Wang
    Multimedia Systems, 2023, 29 (6) : 3329 - 3339
  • [34] YOLO-ERF: lightweight object detector for UAV aerial images
    Wang, Xin
    He, Ning
    Hong, Chen
    Sun, Fengxi
    Han, Wenjing
    Wang, Qi
    MULTIMEDIA SYSTEMS, 2023, 29 (06) : 3329 - 3339
  • [35] Learning Semantic Keypoints for Object Detection in Aerial Images
    Kim, Minsu
    Joung, Sunghun
    Song, Taeyong
    Kim, Hanjae
    Sohn, Kwanghoon
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [36] Density Map Guided Object Detection in Aerial Images
    Li, Changlin
    Yang, Taojiannan
    Zhu, Sijie
    Chen, Chen
    Guan, Shanyue
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 737 - 746
  • [37] A Refined Hybrid Network for Object Detection in Aerial Images
    Yu, Ying
    Yang, Xi
    Li, Jie
    Gao, Xinbo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [38] Scale Enhancement Network for Object Detection in Aerial Images
    Mao, Shihan
    Wang, Zhi
    He, Qineng
    Zhu, Zhangqing
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (02)
  • [39] Unsupervised Cluster Guided Object Detection in Aerial Images
    Liao, Jiajia
    Piao, Yingchao
    Su, Jinhe
    Cai, Guorong
    Huang, Xingwang
    Chen, Long
    Huang, Zhaohong
    Wu, Yundong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 11204 - 11216
  • [40] Learning Semantic Keypoints for Object Detection in Aerial Images
    Kim, Minsu
    Joung, Sunghun
    Song, Taeyong
    Kim, Hanjae
    Sohn, Kwanghoon
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20