Multi-view knowledge distillation for efficient semantic segmentation

被引:4
|
作者
Wang, Chen [1 ]
Zhong, Jiang [1 ]
Dai, Qizhu [1 ]
Qi, Yafei [2 ]
Shi, Fengyuan [3 ]
Fang, Bin [1 ]
Li, Xue [4 ]
机构
[1] Chongqing Univ, Sch Comp Sci, Chongqing 400044, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Peoples R China
[4] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Multi-view learning; Knowledge distillation; Knowledge aggregation; Semantic segmentation; ENSEMBLE;
D O I
10.1007/s11554-023-01296-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art semantic segmentation models achieve remarkable success in segmentation accuracy. However, the huge model size and computing cost restrict their applications on low-latency online systems or devices. Knowledge distillation has been one popular solution for compressing large-scale segmentation models, which train a small segmentation model from a large teacher model. However, one teacher model's knowledge may be insufficiently diverse to train an accurate student model. Meanwhile, the student model may inherit bias from the teacher model. This paper proposes a multi-view knowledge distillation framework called MVKD for efficient semantic segmentation. MVKD could aggregate the multi-view knowledge from multiple teacher models and transfer the multi-view knowledge to the student model. In MVKD, we introduce one multi-view co-tuning strategy to acquire uniformity among the multi-view knowledge in features from different teachers. In addition, we propose a multi-view feature distillation loss and a multi-view output distillation loss to transfer the multi-view knowledge in the features and outputs from multiple teachers to the student. We evaluate the proposed MVKD on three benchmark datasets, Cityscapes, CamVid, and Pascal VOC 2012. Experimental results demonstrate the effectiveness of the proposed MVKD in compressing semantic segmentation models.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Multi-view 3D Entangled Forest For Semantic Segmentation and Mapping
    Antonello, Morris
    Wolf, Daniel
    Prankl, Johann
    Ghidoni, Stefano
    Menegatti, Emanuele
    Vincze, Markus
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 1855 - 1862
  • [22] Channel Affinity Knowledge Distillation for Semantic Segmentation
    Li, Huakun
    Zhang, Yuhang
    Tian, Shishun
    Cheng, Pengfei
    You, Rong
    Zou, Wenbin
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,
  • [23] Knowledge distillation for incremental learning in semantic segmentation
    Michieli, Umberto
    Zanuttigh, Pietro
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 205
  • [24] Semantic Communications for Multi-View Generation
    Wei, Hao
    Ni, Wanli
    Xu, Wenjun
    Jiang, Wenchao
    Niyato, Dusit
    Zhang, Ping
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (06) : 1308 - 1312
  • [25] MULTI-VIEW STEREO WITH SEMANTIC PRIORS
    Stathopoulou, E. -K.
    Remondino, F.
    27TH CIPA INTERNATIONAL SYMPOSIUM: DOCUMENTING THE PAST FOR A BETTER FUTURE, 2019, 42-2 (W15): : 1135 - 1140
  • [26] Semantic-SRF: Sparse Multi-view Indoor Semantic Segmentation with Stereo Neural Radiance Fields
    Eteke, Cem
    Zhang, Jinpeng
    Steinbach, Eckehard
    2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP), 2022,
  • [27] Cotemporal Multi-View Video Segmentation
    Djelouah, Abdelaziz
    Franco, Jean-Sebastien
    Boyer, Edmond
    Perez, Patrick
    Drettakis, George
    PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, : 360 - 369
  • [28] Adaptive segmentation for multi-view stereo
    Khuboni, Ray
    Naidoo, Bashan
    IET COMPUTER VISION, 2017, 11 (01) : 10 - 21
  • [29] A Multi-View Integrated Ensemble for the Background Discrimination of Semi-Supervised Semantic Segmentation
    Gwak, Hyunmin
    Jeong, Yongho
    Kim, Chanyeong
    Lee, Yonghak
    Yang, Seongmin
    Kim, Sunghwan
    APPLIED SCIENCES-BASEL, 2023, 13 (24):
  • [30] Multi-view based neural network for semantic segmentation on 3D scenes
    Yonghua LU
    Mingmin ZHEN
    Tian FANG
    ScienceChina(InformationSciences), 2019, 62 (12) : 248 - 250