MsRi-CCF: Multi-Scale and Rotation-Insensitive Convolutional Channel Features for Geospatial Object Detection

被引:30
|
作者
Wu, Xin [1 ,2 ]
Hong, Danfeng [3 ,4 ]
Ghamisi, Pedram [5 ]
Li, Wei [1 ,2 ]
Tao, Ran [1 ,2 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing 100081, Peoples R China
[2] Beijing Inst Technol, Beijing Key Lab Fract Signals & Syst, Sch Informat & Elect, Beijing 100081, Peoples R China
[3] German Aerosp Ctr DLR, Remote Sensing Technol Inst IMF, D-82234 Wessling, Germany
[4] Tech Univ Munich, Signal Proc Earth Observat SiPEO, D-80333 Munich, Germany
[5] Helmholtz Zentrum Dresden Rossendorf, Helmholtz Inst Freiberg Resource Technol, Explorat Div, Machine Learning Grp, D-09599 Freiberg, Germany
基金
中国国家自然科学基金;
关键词
AdaBoost; deep learning; object detection; optical remote sensing imagery; outlier removal; multi-scale aggregation; rotation-insensitive; CLASSIFICATION; IMAGES; REPRESENTATIONS;
D O I
10.3390/rs10121990
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Geospatial object detection is a fundamental but challenging problem in the remote sensing community. Although deep learning has shown its power in extracting discriminative features, there is still room for improvement in its detection performance, particularly for objects with large ranges of variations in scale and direction. To this end, a novel approach, entitled multi-scale and rotation-insensitive convolutional channel features (MsRi-CCF), is proposed for geospatial object detection by integrating robust low-level feature generation, classifier generation with outlier removal, and detection with a power law. The low-level feature generation step consists of rotation-insensitive and multi-scale convolutional channel features, which were obtained by learning a regularized convolutional neural network (CNN) and integrating multi-scaled convolutional feature maps, followed by the fine-tuning of high-level connections in the CNN, respectively. Then, these generated features were fed into AdaBoost (chosen due to its lower computation and storage costs) with outlier removal to construct an object detection framework that facilitates robust classifier training. In the test phase, we adopted a log-space sampling approach instead of fine-scale sampling by using the fast feature pyramid strategy based on a computable power law. Extensive experimental results demonstrate that compared with several state-of-the-art baselines, the proposed MsRi-CCF approach yields better detection results, with 90.19% precision with the satellite dataset and 81.44% average precision with the NWPU VHR-10 datasets. Importantly, MsRi-CCF incurs no additional computational cost, which is only 0.92 s and 0.7 s per test image on the two datasets. Furthermore, we determined that most previous methods fail to gain an acceptable detection performance, particularly when they face several obstacles, such as deformations in objects (e.g., rotation, illumination, and scaling). Yet, these factors are effectively addressed by MsRi-CCF, yielding a robust geospatial object detection method.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery
    Chen, Jie
    Wan, Li
    Zhu, Jingru
    Xu, Gang
    Deng, Min
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) : 681 - 685
  • [42] MSF-YOLO: A multi-scale features fusion-based method for small object detection
    Yang, Fengyu
    Zhou, Jiaqi
    Chen, Yuan
    Liao, Jie
    Yang, Mingxiang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (22) : 61239 - 61260
  • [43] Multi-Scale Feature Integrated Attention-Based Rotation Network for Object Detection in VHR Aerial Images
    Yang, Feng
    Li, Wentong
    Hu, Haiwei
    Li, Wanyi
    Wang, Peng
    [J]. SENSORS, 2020, 20 (06)
  • [44] Deep multi-scale dual-channel convolutional neural network for Internet of Things apple disease detection
    Zhang, Wenzhuo
    Zhou, Guoxiong
    Chen, Aibin
    Hu, Yahui
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 194
  • [45] EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion
    Zhang, Wenbiao
    Chen, Gang
    Wang, Hongyan
    Yang, Lina
    Sun, Tao
    [J]. Eurasip Journal on Advances in Signal Processing, 2024, 2024 (01)
  • [46] 3D-MSFC: A 3D multi-scale features compression method for object detection
    Li, Zhengxin
    Tian, Chongzhen
    Yuan, Hui
    Lu, Xin
    Malekmohamadi, Hossein
    [J]. Displays, 2024, 85
  • [47] DECA: a novel multi-scale efficient channel attention module for object detection in real-life fire images
    Wang, Junjie
    Yu, Jiong
    He, Zhu
    [J]. APPLIED INTELLIGENCE, 2022, 52 (02) : 1362 - 1375
  • [48] DECA: a novel multi-scale efficient channel attention module for object detection in real-life fire images
    Junjie Wang
    Jiong Yu
    Zhu He
    [J]. Applied Intelligence, 2022, 52 : 1362 - 1375
  • [49] A multi-scale parallel convolutional neural network for automatic sleep apnea detection using single-channel EEG signals
    Jiang, Dihong
    Ma, Yu
    Wang, Yuanyuan
    [J]. 2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [50] Hardhat-Wearing Detection Based on a Lightweight Convolutional Neural Network with Multi-Scale Features and a Top-Down Module
    Wang, Lu
    Xie, Liangbin
    Yang, Peiyu
    Deng, Qingxu
    Du, Shuo
    Xu, Lisheng
    [J]. SENSORS, 2020, 20 (07)