Improving Semantic Segmentation via Efficient Self-Training

被引：38

作者：

Zhu, Yi ^{[1
]}

Zhang, Zhongyue ^{[2
]}

Wu, Chongruo ^{[3
]}

Zhang, Zhi ^{[1
]}

He, Tong ^{[1
]}

Zhang, Hang ^{[4
]}

Manmatha, R. ^{[1
]}

Li, Mu ^{[1
]}

Smola, Alexander ^{[1
]}

机构：

[1] Amazon Web Serv, Santa Clara, CA 95054 USA

[2] Snapchat, Sunnyvale, CA 94085 USA

[3] Univ Calif Davis, Davis, CA 95616 USA

[4] Facebook, Menlo Pk, CA 94025 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2024年 / 46卷 / 03期

基金：

澳大利亚研究理事会;

关键词：

Training; Semantics; Computational modeling; Image segmentation; Data models; Schedules; Predictive models; Semantic segmentation; semi-supervised learning; self-training; fast training schedule; cross-domain generalization;

D O I：

10.1109/TPAMI.2021.3138337

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Starting from the seminal work of Fully Convolutional Networks (FCN), there has been significant progress on semantic segmentation. However, deep learning models often require large amounts of pixelwise annotations to train accurate and robust models. Given the prohibitively expensive annotation cost of segmentation masks, we introduce a self-training framework in this paper to leverage pseudo labels generated from unlabeled data. In order to handle the data imbalance problem of semantic segmentation, we propose a centroid sampling strategy to uniformly select training samples from every class within each epoch. We also introduce a fast training schedule to alleviate the computational burden. This enables us to explore the usage of large amounts of pseudo labels. Our Centroid Sampling based Self-Training framework (CSST) achieves state-of-the-art results on Cityscapes and CamVid datasets. On PASCAL VOC 2012 test set, our models trained with the original train set even outperform the same models trained on the much bigger augmented train set. This indicates the effectiveness of CSST when there are fewer annotations. We also demonstrate promising few-shot generalization capability from Cityscapes to BDD100K and from Cityscapes to Mapillary datasets.

引用

页码：1589 / 1602

页数：14

共 50 条

[31] KEST: Kernel Distance Based Efficient Self-Training for Improving Controllable Text Generation
Feng, Yuxi
Yi, Xiaoyuan
Lakshmanan, Laks V. S.
Xie, Xing
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5049 - 5057
[32] A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation
Yang, Guoqing
Zhu, Chuang
Zhang, Yu
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 876 - 881
[33] Efficient Semantic Segmentation via Self-Attention and Self-Distillation
An, Shumin
Liao, Qingmin
Lu, Zongqing
Xue, Jing-Hao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15256 - 15266
[34] Self-training guided disentangled adaptation for cross-domain remote sensing image semantic segmentation
Zhao, Qi
Lyu, Shuchang
Zhao, Hongbo
Liu, Binghao
Chen, Lijiang
Cheng, Guangliang
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
[35] Unsupervised global-local domain adaptation with self-training for remote sensing image semantic segmentation
Zhang, Junbo
Li, Zhiyong
Wang, Mantao
Li, Kunhong
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (05) : 2254 - 2284
[36] Domain Adaptive LiDAR Point Cloud Segmentation via Density-Aware Self-Training
Xiao, Aoran
Huang, Jiaxing
Liu, Kangcheng
Guan, Dayan
Zhang, Xiaoqin
Lu, Shijian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13627 - 13639
[37] Semi self-training beard/moustache detection and segmentation simultaneously
Le, T. Hoang Ngan
Luu, Khoa
Zhu, Chenchen
Savvides, Marios
IMAGE AND VISION COMPUTING, 2017, 58 : 214 - 223
[38] Transductive Image Segmentation: Self-training and Effect of Uncertainty Estimation
Kamnitsas, Konstantinos
Winzeck, Stefan
Kornaropoulos, Evgenios N.
Whitehouse, Daniel
Englman, Cameron
Phyu, Poe
Pao, Norman
Menon, David K.
Rueckert, Daniel
Das, Tilak
Newcombe, Virginia F. J.
Glocker, Ben
DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND AFFORDABLE HEALTHCARE AND AI FOR RESOURCE DIVERSE GLOBAL HEALTH (DART 2021), 2021, 12968 : 79 - 89
[39] A two-stage domain adaptive remote sensing image semantic segmentation network combined with self-training
Luo, Zhenglian
He, Lingmin
2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 847 - 852
[40] One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation
Liu, Zhengzhe
Qi, Xiaojuan
Fu, Chi-Wing
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1726 - 1736

← 1 2 3 4 5 →