Survey on object detection in tilting box for remote sensing images

被引:0
|
作者
Zhang L. [1 ]
Zhang Y. [1 ]
Yu Y. [1 ]
Ma Y. [1 ,2 ]
Jiang H. [1 ,3 ]
机构
[1] Information Engineering University Institute of Geospatial Information, Zhengzhou
[2] JiMei University, Xiamen
[3] Marine Map Information Center, Tianjin
基金
中国国家自然科学基金;
关键词
convolutional neural network; deep learning; object detection in tilting bounding box; remote sensing image;
D O I
10.11834/jrs.20210247
中图分类号
学科分类号
摘要
Object detection is a basic issue in remote sensing images processing, especially with the breakthrough of deep learning and the development of remote sensing images acquisition, object detection in aerial imagery based on deep learning has attracted extensive attention. Though considerable progress has been made, many obstacles still exist because of the large-scale and highly complex backgrounds of optical remote sensing images. Besides, when detecting densely arranged and arbitrary oriented objects in aerial imagery, approaches based on horizontal proposals for common object detection often suffer from the issue of mismatch. Many domestic and foreign scholars have proposed a series of tilting box object detection algorithms based on deep learning, which promoted the improvement of object detection effect of remote sensing image.In order to enable researchers in related fields to have a comprehensive understanding of the theory, process and existing problems of remote sensing image tilting box object detection based on deep learning, this paper will systematically organize and summarize them. In this paper, we first analyze the limitations of the horizontal bounding box(HBB) object detection algorithms SENSING applied to remote sensing images, i.e. the introduction of background noise, inappropriate post-processing operation NMS and the inability to determine the orientation of objects accurately, it can be better solved by using the tilting bounding box object detection method. Next, we list the classical HBB object detection algorithms based on deep learning, and briefly introduce the principle of some of them. Then, emphatically introduces the development of tilting bounding box object detection algorithm, and the improvement process of two-stage tilting bounding box object detection algorithm is introduced from three aspects of feature extraction network, anchor boxes and proposal region design and loss function, about the one-stage detection algorithm, there are few researches at the present stage, hence two algorithms are simply introduced. At the fourth section, the detection performance of the existing tilt box object detection algorithms on two public and challenging aerial datasets (i.e. DOTA and HRSC2016) is demonstrated. From the comparison results of the three tables, it can be seen that: it is necessary to design a certain object feature enhancement module for the particularity of the objects in the remote sensing images, the problem of RSE in tilting bounding box detection algorithm still needs to be solved further. Although the one-stage detection algorithm is slightly worse than the two-stage algorithm in accuracy, it has obvious advantages in efficiency, so it still has certain research value. Finally, we summarize the existing problems of tilt box target detection algorithm from six aspects, and look forward to its future development trend. NATIONAL REMOTE SENSINGBULLETIN www.jors.cn. © 2022 National Remote Sensing Bulletin. All rights reserved.
引用
下载
收藏
页码:1723 / 1743
页数:20
相关论文
共 61 条
  • [1] Azimi S M, Vig E, Bahmanyar R, Korner M, Reinartz P., Towards multi-class object detection in unconstrained remote sensing imagery, Asian Conference on Computer Vision, pp. 150-165, (2018)
  • [2] Dai J F, Li Y, He K M, Sun J., R-FCN: object detection via region-based fully convolutional networks, Proceedings of the 30th International Conference on Neural Information Processing Systems, (2016)
  • [3] Dalal N, Triggs B., Histograms of oriented gradients for human detection, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp. 886-893, (2005)
  • [4] Ding J, Xue N, Long Y, Xia G S, Lu Q K., Learning roi transformer for oriented object detection in aerial images, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2849-2858, (2019)
  • [5] Duan K W, Bai S, Xie L X, Qi H G, Huang Q M, Tian Q., Centernet: Keypoint triplets for object detection, Proceedings of the IEEE International Conference on Computer Vision, pp. 6569-6578, (2019)
  • [6] Felzenszwalb P F, Girshick R B, McAllester D., Cascade object detection with deformable part models, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2241-2248, (2010)
  • [7] Fu K, Chang Z H, Zhang Y, Xu G L, Zhang K S, Sun X., Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images, ISPRS Journal of Photogrammetry and Remote Sensing, 161, pp. 294-308, (2020)
  • [8] Fu K, Chen Z, Zhang Y, Sun X., Enhanced feature representation in detection for optical remote sensing images, Remote Sensing, 11, 18, (2019)
  • [9] Girshick R, Donahue J, Darrell T, Malik J., Rich feature hierarchies for accurate object detection and semantic segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, (2014)
  • [10] Girshick R., Fast r-cnn, Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)