Few-Shot Rotation-Invariant Aerial Image Semantic Segmentation

被引：4

作者：

Cao, Qinglong ^{[1
,2
]}

Chen, Yuntian ^{[2
,3
]}

Ma, Chao ^{[1
]}

Yang, Xiaokang ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China

[2] Eastern Inst Technol, Ningbo Inst Digital Twin, Ningbo 315200, Peoples R China

[3] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

美国国家科学基金会;

关键词：

Consistent prediction; few-shot aerial semantic segmentation; rotation invariance; rotation-adaptive matching; NETWORK;

D O I：

10.1109/TGRS.2023.3338699

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Few-shot aerial image semantic segmentation is a challenging task that requires precisely parsing unseen-category objects in query aerial images with limited annotated support aerial images. Formally, category prototypes would be extracted from support samples to segment query images in a pixel-to-pixel matching manner. However, aerial objects in aerial images are often distributed with arbitrary orientations, and varying orientations could cause a dramatic feature change. This unique property of aerial images renders conventional matching manner without consideration of orientations fails to activate same-category objects with different orientations. Furthermore, the oscillation of the confidence scores in existing rotation-insensitive algorithms, engendered by the striking changes of object orientations, often leads to false recognition of lower scored rotated semantic objects. To tackle these challenges, inspired by the intrinsic rotation invariance in aerial images, we propose a novel few-shot rotation-invariant aerial semantic segmentation network (FRINet) to efficiently segment aerial semantic objects with diverse orientations. Specifically, through extracting orientation-varying yet category-consistent support information, FRINet provides rotation-adaptive matching for each query feature in a feature-aggregation manner. Meanwhile, to encourage consistent predictions for aerial objects with arbitrary orientations, segmentation predictions from different orientations are supervised by the same label and further fused to obtain the final rotation-invariant prediction in a complementary manner. Moreover, aiming at providing a better solution searching space, the backbones are newly pretrained in the base category to basically boost the segmentation performance. Extensive experiments on the few-shot aerial image semantic segmentation benchmark demonstrate that the proposed FRINet achieves a new state-of-the-art performance. The code is available at https://github.com/caoql98/FRINet.

引用

页码：1 / 13

页数：13

共 50 条

[31] Few-shot semantic segmentation for industrial defect recognition
Shi, Xiangwen
Zhang, Shaobing
Cheng, Miao
He, Lian
Tang, Xianghong
Cui, Zhe
COMPUTERS IN INDUSTRY, 2023, 148
[32] A lightweight siamese transformer for few-shot semantic segmentation
Hegui Zhu
Yange Zhou
Cong Jiang
Lianping Yang
Wuming Jiang
Zhimu Wang
Neural Computing and Applications, 2024, 36 : 7455 - 7469
[33] Variational Prototype Inference for Few-Shot Semantic Segmentation
Wang, Haochen
Yang, Yandan
Cao, Xianbin
Zhen, Xiantong
Snoek, Cees
Shao, Ling
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 525 - 534
[34] Harmonic Feature Activation for Few-Shot Semantic Segmentation
Liu, Binghao
Jiao, Jianbin
Ye, Qixiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3142 - 3153
[35] A Strong Baseline for Generalized Few-Shot Semantic Segmentation
Hajimiri, Sina
Boudiaf, Malik
Ben Ayed, Ismail
Dolz, Jose
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11269 - 11278
[36] Few-Shot Semantic Segmentation via Mask Aggregation
Wei Ao
Shunyi Zheng
Yan Meng
Yang Yang
Neural Processing Letters, 56
[37] Few-Shot and Zero-Shot Semantic Segmentation for Food Images
Honbu, Yuma
Yanai, Keiji
PROCEEDINGS OF THE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA FOR COOKING AND EATING ACTIVITIES (CEA '21), 2021, : 25 - 28
[38] SANet: similarity aggregation and semantic fusion for few-shot semantic segmentation
Ye, Minrui
Zhang, Tao
APPLIED INTELLIGENCE, 2025, 55 (02)
[39] Part-Based Semantic Transform for Few-Shot Semantic Segmentation
Yang, Boyu
Wan, Fang
Liu, Chang
Li, Bohao
Ji, Xiangyang
Ye, Qixiang
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) : 7141 - 7152
[40] SML: Semantic meta-learning for few-shot semantic segmentation * *
Pambala, Ayyappa Kumar
Dutta, Titir
Biswas, Soma
PATTERN RECOGNITION LETTERS, 2021, 147 : 93 - 99

← 1 2 3 4 5 →