MsRi-CCF: Multi-Scale and Rotation-Insensitive Convolutional Channel Features for Geospatial Object Detection

被引:30
|
作者
Wu, Xin [1 ,2 ]
Hong, Danfeng [3 ,4 ]
Ghamisi, Pedram [5 ]
Li, Wei [1 ,2 ]
Tao, Ran [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Beijing Key Lab Fract Signals & Syst, Sch Informat & Elect, Beijing 100081, Peoples R China
[3] German Aerosp Ctr DLR, Remote Sensing Technol Inst IMF, D-82234 Wessling, Germany
[4] Tech Univ Munich, Signal Proc Earth Observat SiPEO, D-80333 Munich, Germany
[5] Helmholtz Zentrum Dresden Rossendorf, Helmholtz Inst Freiberg Resource Technol, Explorat Div, Machine Learning Grp, D-09599 Freiberg, Germany
基金
中国国家自然科学基金;
关键词
AdaBoost; deep learning; object detection; optical remote sensing imagery; outlier removal; multi-scale aggregation; rotation-insensitive; CLASSIFICATION; IMAGES; REPRESENTATIONS;
D O I
10.3390/rs10121990
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Geospatial object detection is a fundamental but challenging problem in the remote sensing community. Although deep learning has shown its power in extracting discriminative features, there is still room for improvement in its detection performance, particularly for objects with large ranges of variations in scale and direction. To this end, a novel approach, entitled multi-scale and rotation-insensitive convolutional channel features (MsRi-CCF), is proposed for geospatial object detection by integrating robust low-level feature generation, classifier generation with outlier removal, and detection with a power law. The low-level feature generation step consists of rotation-insensitive and multi-scale convolutional channel features, which were obtained by learning a regularized convolutional neural network (CNN) and integrating multi-scaled convolutional feature maps, followed by the fine-tuning of high-level connections in the CNN, respectively. Then, these generated features were fed into AdaBoost (chosen due to its lower computation and storage costs) with outlier removal to construct an object detection framework that facilitates robust classifier training. In the test phase, we adopted a log-space sampling approach instead of fine-scale sampling by using the fast feature pyramid strategy based on a computable power law. Extensive experimental results demonstrate that compared with several state-of-the-art baselines, the proposed MsRi-CCF approach yields better detection results, with 90.19% precision with the satellite dataset and 81.44% average precision with the NWPU VHR-10 datasets. Importantly, MsRi-CCF incurs no additional computational cost, which is only 0.92 s and 0.7 s per test image on the two datasets. Furthermore, we determined that most previous methods fail to gain an acceptable detection performance, particularly when they face several obstacles, such as deformations in objects (e.g., rotation, illumination, and scaling). Yet, these factors are effectively addressed by MsRi-CCF, yielding a robust geospatial object detection method.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Oil spill detection: SAR multi-scale segmentation & object features evaluation
    Topouzelis, K
    Karathanassi, V
    Pavlakis, P
    Rokos, D
    [J]. REMOTE SENSING OF THE OCEAN AND SEA ICE 2002, 2002, 4880 : 77 - 87
  • [22] Transferring scale-independent features to support multi-scale object recognition with deep convolutional neural network
    Zhou, Xiran
    [J]. 26TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2018), 2018, : 614 - 615
  • [23] LOROD: Fully Convolutional Network for Real-time Multi-scale Object Detection Algorithm
    Hou, Shaoqi
    Li, Chao
    Liu, Xueting
    Zeng, Yuhao
    Du, Wenyi
    Yin, Guangqiang
    [J]. 2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 579 - 584
  • [24] FAS-Net: Construct Effective Features Adaptively for Multi-Scale Object Detection
    Yan, Jiangqiao
    Zhang, Yue
    Chang, Zhonghan
    Zhang, Tengfei
    Yan, Menglong
    Diao, Wenhui
    Wang, Hongqi
    Sun, Xian
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12573 - 12580
  • [25] ScarfNet: Multi-scale Features with Deeply Fused and Redistributed Semantics for Enhanced Object Detection
    Hyeok, Yoo Jin
    Dongsuk, Kum
    Won, Choi Jun
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4505 - 4512
  • [26] Matching Multi-Scale Features and Prediction Tasks for Real-Time Object Detection
    Du Hongjie
    Sun Hanqing
    Cao Jiale
    Pang Yanwei
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (12)
  • [27] Learning Deep Structured Multi-scale Features for Crisp and Object Occlusion Edge Detection
    Dong, Zihao
    Zhang, Ruixun
    Shao, Xiuli
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 253 - 266
  • [28] Bmsmlet: boosting multi-scale information on multi-level aggregated features for salient object detection
    Ziwei Wu
    Tong Jia
    Yunhe Wu
    Zhikang Zeng
    Feng Liang
    [J]. The Visual Computer, 2024, 40 (2) : 1131 - 1144
  • [29] MSRMNet: Multi-scale skip residual and multi-mixed features network for salient object detection
    Liu, Xinlong
    Wang, Luping
    [J]. NEURAL NETWORKS, 2024, 173
  • [30] Bmsmlet: boosting multi-scale information on multi-level aggregated features for salient object detection
    Wu, Ziwei
    Jia, Tong
    Wu, Yunhe
    Zeng, Zhikang
    Liang, Feng
    [J]. VISUAL COMPUTER, 2024, 40 (02): : 1131 - 1144