Coarse-to-fine semantic segmentation of satellite images

被引：2

作者：

Chen, Hao ^{[1
]}

Yang, Wen ^{[2
]}

Liu, Li ^{[3
]}

Xia, Gui-Song ^{[1
,4
,5
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China

[2] Wuhan Univ, Sch Elect Informat, Wuhan 430079, Peoples R China

[3] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha 410073, Peoples R China

[4] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan 430072, Peoples R China

[5] Wuhan Univ, Inst Artificial Intelligence, Wuhan 430072, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2024年 / 217卷

基金：

中国国家自然科学基金;

关键词：

Local-softmax; Multi-prototype learning; Semantic segmentation; COVER; AREA;

D O I：

10.1016/j.isprsjprs.2024.07.028

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Training deep neural networks for semantic segmentation of aerial images relies heavily on obtaining a large number of precise pixel-level annotations, which can cause significant annotation expenses. Given the fact that acquiring fine-class annotations is considerably more challenging than obtaining coarse-class annotations, we present a novel semi-supervised learning framework, which utilizes high spatial resolution images annotated with coarse-class labels alongside a very small set of fine-grained annotated images as the training set, thereby achieving classification results that are refined in both spatial resolution and categorical granularity. Specifically, this framework adopts Mix Transformer (MiT) as the backbone architecture to accommodate both local feature extraction and long-range dependency modeling capabilities and utilizes multi-prototype learning to model each class as multiple sub-prototypes, preserving the intrinsic variance characteristics within classes. We propose a dedicated co-training approach tailored for extracting fine-grained pseudo-labels from coarse- grained samples. In this approach, a local-softmax pseudo-labeling strategy is developed to ensure a harmonious balance between the efficiency and accuracy of the pseudo-labeling, and four losses are formulated for both single-level class and cross-category granularity supervised learning. We evaluate the proposed framework on the Gaofen Image Dataset (GID) and Five-Billion-Pixels (FBP) dataset, confirming its feasibility and superior results. In particular, based on coarse-class annotations, the performance achieved using only 5% of fineclass labels, in terms of the four metrics, namely mIoU, mean UA, mean F1-score, and OA, reached 91%, 96%, 89%, and 93% of the fully-supervised baseline performance respectively. The code is available at https://github.com/chenhaocs/C2F.

引用

页码：1 / 17

页数：17

共 50 条

[1] Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning
Luo, Yadan
Wang, Ziwei
Huang, Zi
Yang, Yang
Zhao, Cong
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 237 - 246
[2] CasNet:A Cascade Coarse-to-Fine Network for Semantic Segmentation
Zhenyang Wang
Zhidong Deng
Shiyao Wang
Tsinghua Science and Technology, 2019, 24 (02) : 207 - 215
[3] Continual coarse-to-fine domain adaptation in semantic segmentation
Shenaj, Donald
Barbato, Francesco
Michieli, Umberto
Zanuttigh, Pietro
IMAGE AND VISION COMPUTING, 2022, 121
[4] CasNet: A Cascade Coarse-to-Fine Network for Semantic Segmentation
Wang, Zhenyang
Deng, Zhidong
Wang, Shiyao
TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (02) : 207 - 215
[5] Coarse-to-Fine Feature Mining for Video Semantic Segmentation
Sun, Guolei
Liu, Yun
Ding, Henghui
Probst, Thomas
Van Gool, Luc
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3116 - 3127
[6] Coarse-to-Fine Particle Segmentation in Microscopic Urinary Images
Qian, Jiye
Fang, Bin
Li, Chunyan
Chen, Lin
2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 1978 - 1981
[7] Coarse-to-Fine Lung Segmentation in Computed Tomography Images
Qiang, Yan
Ji, Guohua
Han, Xiaohong
Zhao, Juanjuan
Liao, Xiaolei
Cui, Zhihua
JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (02) : 330 - 334
[8] CFNet: A Coarse-to-Fine Network for Few Shot Semantic Segmentation
Liu, Jiade
Jung, Cheolkon
2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
[9] Coarse-to-fine Semantic Video Segmentation using Supervoxel Trees
Jain, Aastha
Chatterjee, Shaunak
Vidal, Rene
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1865 - 1872
[10] A coarse-to-fine approach to prostate boundary segmentation in ultrasound images
Sahba, Farhang
Tizhoosh, Hamid R.
Salama, Magdy M.
BIOMEDICAL ENGINEERING ONLINE, 2005, 4 (1)

← 1 2 3 4 5 →