Complete and Invariant Instance Classifier Refinement for Weakly Supervised Object Detection in Remote Sensing Images

被引:1
|
作者
Qian, Xiaoliang [1 ]
Wang, Chenhao [1 ]
Wang, Wei [1 ]
Yao, Xiwen [2 ]
Cheng, Gong [2 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Elect & Informat Engn, Zhengzhou 450002, Peoples R China
[2] Northwestern Polytech Univ, Sch Automat, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Complete definition of pseudosoft label (CPSL); invariant multiple instance learning (IMIL); remote sensing image (RSI); weakly supervised object detection (WSOD);
D O I
10.1109/TGRS.2024.3408895
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Weakly supervised object detection (WSOD) in remote sensing images (RSIs) is used to detect high-value objects by utilizing image-level labels. However, the current models still have two problems. First, the misclassification of neighboring instances has easily occurred because the one-hot label is assigned to all of the seed instances and their neighboring instances. Second, the supervisory information of each instance classifier refinement (ICR) branch is generated from the predicted class score of the upper ICR branch rather than the real label; thus, the prediction mistake of each ICR branch will be accumulated with the propagation of supervisory information. To address the first problem, a complete definition of the pseudosoft label (CPSL) of instances is proposed to directly train each ICR branch, where the CPSL of seed instances is defined according to the predicted class scores of upper ICR branch, and the CPSL of other instances is determined by the spatial distance weighted feature similarity (FS) between them and seed instances. To handle the second problem, an invariant multiple instance learning (IMIL) scheme is proposed to indirectly train each ICR branch by using real image-level labels. Furthermore, the affine transformations of the original image are incorporated into the baseline model to enhance the invariance of our model. The ablation studies verify the effectiveness of CPSL, IMIL, and their combination. The quantitative comparisons with popular methods show that 73.63% (31.08%) mean average precision (mAP) and 79.88% (57.52%) correct localization (CorLoc) of our method are the best on the NWPU VHR-10.v2 (DIOR) dataset, and the qualitative comparisons intuitively demonstrate it again.
引用
下载
收藏
页数:13
相关论文
共 50 条
  • [21] SAM-Induced Pseudo Fully Supervised Learning for Weakly Supervised Object Detection in Remote Sensing Images
    Qian, Xiaoliang
    Lin, Chenyang
    Chen, Zhiwu
    Wang, Wei
    REMOTE SENSING, 2024, 16 (09)
  • [22] SAENet: Self-Supervised Adversarial and Equivariant Network for Weakly Supervised Object Detection in Remote Sensing Images
    Feng, Xiaoxu
    Yao, Xiwen
    Cheng, Gong
    Han, Jungong
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [23] Weakly Supervised Learning for Target Detection in Remote Sensing Images
    Zhang, Dingwen
    Han, Junwei
    Cheng, Gong
    Liu, Zhenbao
    Bu, Shuhui
    Guo, Lei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (04) : 701 - 705
  • [24] CLASSIFIER REFINEMENT FOR WEAKLY SUPERVISED OBJECT DETECTION WITH CLASS-SPECIFIC ACTIVATION MAP
    Du, Peilun
    Zhang, Haitao
    Ma, Huadong
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3367 - 3371
  • [25] Weakly supervised object extraction with iterative contour prior for remote sensing images
    Chu He
    Yu Zhang
    Bo Shi
    Xin Su
    Xin Xu
    Mingsheng Liao
    EURASIP Journal on Advances in Signal Processing, 2013
  • [26] Weakly supervised object extraction with iterative contour prior for remote sensing images
    He, Chu
    Zhang, Yu
    Shi, Bo
    Su, Xin
    Xu, Xin
    Liao, Mingsheng
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [27] TCANet: Triple Context-Aware Network for Weakly Supervised Object Detection in Remote Sensing Images
    Feng, Xiaoxu
    Han, Junwei
    Yao, Xiwen
    Cheng, Gong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08): : 6946 - 6955
  • [28] Multi-view contextual adaptation network for weakly supervised object detection in remote sensing images
    Ye, Binfeng
    Zhang, Junjie
    Rao, Yutao
    Gao, Rui
    Zeng, Dan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2024, 45 (13) : 4344 - 4366
  • [29] Negative Bootstrapping for Weakly Supervised Target Detection in Remote Sensing Images
    Zhou, Peicheng
    Zhang, Dingwen
    Cheng, Gong
    Han, Junwei
    2015 1ST IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2015, : 318 - 323
  • [30] Adaptive Feature Refinement for Oriented Object Detection in Remote Sensing Images
    Liu, Enhai
    Xu, Jiayin
    Li, Yan
    Fan, Shiyan
    Computer Engineering and Applications, 2023, 59 (24) : 155 - 164