Improving Semantic Segmentation via Efficient Self-Training

被引:38
|
作者
Zhu, Yi [1 ]
Zhang, Zhongyue [2 ]
Wu, Chongruo [3 ]
Zhang, Zhi [1 ]
He, Tong [1 ]
Zhang, Hang [4 ]
Manmatha, R. [1 ]
Li, Mu [1 ]
Smola, Alexander [1 ]
机构
[1] Amazon Web Serv, Santa Clara, CA 95054 USA
[2] Snapchat, Sunnyvale, CA 94085 USA
[3] Univ Calif Davis, Davis, CA 95616 USA
[4] Facebook, Menlo Pk, CA 94025 USA
基金
澳大利亚研究理事会;
关键词
Training; Semantics; Computational modeling; Image segmentation; Data models; Schedules; Predictive models; Semantic segmentation; semi-supervised learning; self-training; fast training schedule; cross-domain generalization;
D O I
10.1109/TPAMI.2021.3138337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Starting from the seminal work of Fully Convolutional Networks (FCN), there has been significant progress on semantic segmentation. However, deep learning models often require large amounts of pixelwise annotations to train accurate and robust models. Given the prohibitively expensive annotation cost of segmentation masks, we introduce a self-training framework in this paper to leverage pseudo labels generated from unlabeled data. In order to handle the data imbalance problem of semantic segmentation, we propose a centroid sampling strategy to uniformly select training samples from every class within each epoch. We also introduce a fast training schedule to alleviate the computational burden. This enables us to explore the usage of large amounts of pseudo labels. Our Centroid Sampling based Self-Training framework (CSST) achieves state-of-the-art results on Cityscapes and CamVid datasets. On PASCAL VOC 2012 test set, our models trained with the original train set even outperform the same models trained on the much bigger augmented train set. This indicates the effectiveness of CSST when there are fewer annotations. We also demonstrate promising few-shot generalization capability from Cityscapes to BDD100K and from Cityscapes to Mapillary datasets.
引用
收藏
页码:1589 / 1602
页数:14
相关论文
共 50 条
  • [31] KEST: Kernel Distance Based Efficient Self-Training for Improving Controllable Text Generation
    Feng, Yuxi
    Yi, Xiaoyuan
    Lakshmanan, Laks V. S.
    Xie, Xing
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5049 - 5057
  • [32] A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation
    Yang, Guoqing
    Zhu, Chuang
    Zhang, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 876 - 881
  • [33] Efficient Semantic Segmentation via Self-Attention and Self-Distillation
    An, Shumin
    Liao, Qingmin
    Lu, Zongqing
    Xue, Jing-Hao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (09) : 15256 - 15266
  • [34] Self-training guided disentangled adaptation for cross-domain remote sensing image semantic segmentation
    Zhao, Qi
    Lyu, Shuchang
    Zhao, Hongbo
    Liu, Binghao
    Chen, Lijiang
    Cheng, Guangliang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
  • [35] Unsupervised global-local domain adaptation with self-training for remote sensing image semantic segmentation
    Zhang, Junbo
    Li, Zhiyong
    Wang, Mantao
    Li, Kunhong
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (05) : 2254 - 2284
  • [36] Domain Adaptive LiDAR Point Cloud Segmentation via Density-Aware Self-Training
    Xiao, Aoran
    Huang, Jiaxing
    Liu, Kangcheng
    Guan, Dayan
    Zhang, Xiaoqin
    Lu, Shijian
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13627 - 13639
  • [37] Semi self-training beard/moustache detection and segmentation simultaneously
    Le, T. Hoang Ngan
    Luu, Khoa
    Zhu, Chenchen
    Savvides, Marios
    IMAGE AND VISION COMPUTING, 2017, 58 : 214 - 223
  • [38] Transductive Image Segmentation: Self-training and Effect of Uncertainty Estimation
    Kamnitsas, Konstantinos
    Winzeck, Stefan
    Kornaropoulos, Evgenios N.
    Whitehouse, Daniel
    Englman, Cameron
    Phyu, Poe
    Pao, Norman
    Menon, David K.
    Rueckert, Daniel
    Das, Tilak
    Newcombe, Virginia F. J.
    Glocker, Ben
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND AFFORDABLE HEALTHCARE AND AI FOR RESOURCE DIVERSE GLOBAL HEALTH (DART 2021), 2021, 12968 : 79 - 89
  • [39] A two-stage domain adaptive remote sensing image semantic segmentation network combined with self-training
    Luo, Zhenglian
    He, Lingmin
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATION, ICCEA 2024, 2024, : 847 - 852
  • [40] One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation
    Liu, Zhengzhe
    Qi, Xiaojuan
    Fu, Chi-Wing
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1726 - 1736