Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation

被引:4
|
作者
Cao, Qinglong [1 ,2 ]
Chen, Yuntian [2 ,3 ]
Ma, Chao [1 ]
Yang, Xiaokang [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
[2] Eastern Inst Technol, Ningbo Inst Digital Twin, Ningbo 315200, Peoples R China
[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
美国国家科学基金会;
关键词
Consistent prediction; few-shot aerial semantic segmentation; rotation invariance; rotation-adaptive matching; NETWORK;
D O I
10.1109/TGRS.2023.3338699
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Few-shot aerial image semantic segmentation is a challenging task that requires precisely parsing unseen-category objects in query aerial images with limited annotated support aerial images. Formally, category prototypes would be extracted from support samples to segment query images in a pixel-to-pixel matching manner. However, aerial objects in aerial images are often distributed with arbitrary orientations, and varying orientations could cause a dramatic feature change. This unique property of aerial images renders conventional matching manner without consideration of orientations fails to activate same-category objects with different orientations. Furthermore, the oscillation of the confidence scores in existing rotation-insensitive algorithms, engendered by the striking changes of object orientations, often leads to false recognition of lower scored rotated semantic objects. To tackle these challenges, inspired by the intrinsic rotation invariance in aerial images, we propose a novel few-shot rotation-invariant aerial semantic segmentation network (FRINet) to efficiently segment aerial semantic objects with diverse orientations. Specifically, through extracting orientation-varying yet category-consistent support information, FRINet provides rotation-adaptive matching for each query feature in a feature-aggregation manner. Meanwhile, to encourage consistent predictions for aerial objects with arbitrary orientations, segmentation predictions from different orientations are supervised by the same label and further fused to obtain the final rotation-invariant prediction in a complementary manner. Moreover, aiming at providing a better solution searching space, the backbones are newly pretrained in the base category to basically boost the segmentation performance. Extensive experiments on the few-shot aerial image semantic segmentation benchmark demonstrate that the proposed FRINet achieves a new state-of-the-art performance. The code is available at https://github.com/caoql98/FRINet.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [21] Incorporating Depth Information into Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3582 - 3588
  • [22] Dynamic Extension Nets for Few-shot Semantic Segmentation
    Liu, Lizhao
    Cao, Junyi
    Liu, Minqian
    Guo, Yong
    Chen, Qi
    Tan, Mingkui
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1441 - 1449
  • [23] Few-shot semantic segmentation: a review on recent approaches
    Zhaobin Chang
    Yonggang Lu
    Xingcheng Ran
    Xiong Gao
    Xiangwen Wang
    Neural Computing and Applications, 2023, 35 : 18251 - 18275
  • [24] Few-Shot Semantic Segmentation for Complex Driving Scenes
    Zhou, Jingxing
    Chen, Ruei-Bo
    Beyerer, Juergen
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 695 - 702
  • [25] Prediction Calibration for Generalized Few-Shot Semantic Segmentation
    Lu, Zhihe
    He, Sen
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3311 - 3323
  • [26] Cross-Domain Few-Shot Semantic Segmentation
    Lei, Shuo
    Zhang, Xuchao
    He, Jianfeng
    Chen, Fanglan
    Du, Bowen
    Lu, Chang-Tien
    COMPUTER VISION - ECCV 2022, PT XXX, 2022, 13690 : 73 - 90
  • [27] Few-shot semantic segmentation: a review on recent approaches
    Chang, Zhaobin
    Lu, Yonggang
    Ran, Xingcheng
    Gao, Xiong
    Wang, Xiangwen
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (25): : 18251 - 18275
  • [28] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469
  • [29] Research Status and Analysis of Few-Shot Semantic Segmentation
    Chen, Shan-Juan
    Yu, Yun-Long
    Li, Ying-Ming
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (10): : 2417 - 2451
  • [30] Few-shot semantic segmentation in complex industrial components
    Xu C.
    Wang B.
    Gan J.
    Jiang J.
    Wang Y.
    Tu M.
    Zhou W.
    Multimedia Tools and Applications, 2025, 84 (2) : 1013 - 1030