Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation

被引:4
|
作者
Cao, Qinglong [1 ,2 ]
Chen, Yuntian [2 ,3 ]
Ma, Chao [1 ]
Yang, Xiaokang [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[2] Eastern Inst Technol, Ningbo Inst Digital Twin, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
美国国家科学基金会;
关键词
Consistent prediction; few-shot aerial semantic segmentation; rotation invariance; rotation-adaptive matching; NETWORK;
D O I
10.1109/TGRS.2023.3338699
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot aerial image semantic segmentation is a challenging task that requires precisely parsing unseen-category objects in query aerial images with limited annotated support aerial images. Formally, category prototypes would be extracted from support samples to segment query images in a pixel-to-pixel matching manner. However, aerial objects in aerial images are often distributed with arbitrary orientations, and varying orientations could cause a dramatic feature change. This unique property of aerial images renders conventional matching manner without consideration of orientations fails to activate same-category objects with different orientations. Furthermore, the oscillation of the confidence scores in existing rotation-insensitive algorithms, engendered by the striking changes of object orientations, often leads to false recognition of lower scored rotated semantic objects. To tackle these challenges, inspired by the intrinsic rotation invariance in aerial images, we propose a novel few-shot rotation-invariant aerial semantic segmentation network (FRINet) to efficiently segment aerial semantic objects with diverse orientations. Specifically, through extracting orientation-varying yet category-consistent support information, FRINet provides rotation-adaptive matching for each query feature in a feature-aggregation manner. Meanwhile, to encourage consistent predictions for aerial objects with arbitrary orientations, segmentation predictions from different orientations are supervised by the same label and further fused to obtain the final rotation-invariant prediction in a complementary manner. Moreover, aiming at providing a better solution searching space, the backbones are newly pretrained in the base category to basically boost the segmentation performance. Extensive experiments on the few-shot aerial image semantic segmentation benchmark demonstrate that the proposed FRINet achieves a new state-of-the-art performance. The code is available at https://github.com/caoql98/FRINet.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [31] Few-shot semantic segmentation for industrial defect recognition
    Shi, Xiangwen
    Zhang, Shaobing
    Cheng, Miao
    He, Lian
    Tang, Xianghong
    Cui, Zhe
    COMPUTERS IN INDUSTRY, 2023, 148
  • [32] A lightweight siamese transformer for few-shot semantic segmentation
    Hegui Zhu
    Yange Zhou
    Cong Jiang
    Lianping Yang
    Wuming Jiang
    Zhimu Wang
    Neural Computing and Applications, 2024, 36 : 7455 - 7469
  • [33] Variational Prototype Inference for Few-Shot Semantic Segmentation
    Wang, Haochen
    Yang, Yandan
    Cao, Xianbin
    Zhen, Xiantong
    Snoek, Cees
    Shao, Ling
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 525 - 534
  • [34] Harmonic Feature Activation for Few-Shot Semantic Segmentation
    Liu, Binghao
    Jiao, Jianbin
    Ye, Qixiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3142 - 3153
  • [35] A Strong Baseline for Generalized Few-Shot Semantic Segmentation
    Hajimiri, Sina
    Boudiaf, Malik
    Ben Ayed, Ismail
    Dolz, Jose
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11269 - 11278
  • [36] Few-Shot Semantic Segmentation via Mask Aggregation
    Wei Ao
    Shunyi Zheng
    Yan Meng
    Yang Yang
    Neural Processing Letters, 56
  • [37] Few-Shot and Zero-Shot Semantic Segmentation for Food Images
    Honbu, Yuma
    Yanai, Keiji
    PROCEEDINGS OF THE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA FOR COOKING AND EATING ACTIVITIES (CEA '21), 2021, : 25 - 28
  • [38] SANet: similarity aggregation and semantic fusion for few-shot semantic segmentation
    Ye, Minrui
    Zhang, Tao
    APPLIED INTELLIGENCE, 2025, 55 (02)
  • [39] Part-Based Semantic Transform for Few-Shot Semantic Segmentation
    Yang, Boyu
    Wan, Fang
    Liu, Chang
    Li, Bohao
    Ji, Xiangyang
    Ye, Qixiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7141 - 7152
  • [40] SML: Semantic meta-learning for few-shot semantic segmentation * *
    Pambala, Ayyappa Kumar
    Dutta, Titir
    Biswas, Soma
    PATTERN RECOGNITION LETTERS, 2021, 147 : 93 - 99