Multi-view knowledge distillation for efficient semantic segmentation

被引:4
|
作者
Wang, Chen [1 ]
Zhong, Jiang [1 ]
Dai, Qizhu [1 ]
Qi, Yafei [2 ]
Shi, Fengyuan [3 ]
Fang, Bin [1 ]
Li, Xue [4 ]
机构
[1] Chongqing Univ, Sch Comp Sci, Chongqing 400044, Peoples R China
[2] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
[3] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Peoples R China
[4] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
基金
中国国家自然科学基金;
关键词
Multi-view learning; Knowledge distillation; Knowledge aggregation; Semantic segmentation; ENSEMBLE;
D O I
10.1007/s11554-023-01296-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current state-of-the-art semantic segmentation models achieve remarkable success in segmentation accuracy. However, the huge model size and computing cost restrict their applications on low-latency online systems or devices. Knowledge distillation has been one popular solution for compressing large-scale segmentation models, which train a small segmentation model from a large teacher model. However, one teacher model's knowledge may be insufficiently diverse to train an accurate student model. Meanwhile, the student model may inherit bias from the teacher model. This paper proposes a multi-view knowledge distillation framework called MVKD for efficient semantic segmentation. MVKD could aggregate the multi-view knowledge from multiple teacher models and transfer the multi-view knowledge to the student model. In MVKD, we introduce one multi-view co-tuning strategy to acquire uniformity among the multi-view knowledge in features from different teachers. In addition, we propose a multi-view feature distillation loss and a multi-view output distillation loss to transfer the multi-view knowledge in the features and outputs from multiple teachers to the student. We evaluate the proposed MVKD on three benchmark datasets, Cityscapes, CamVid, and Pascal VOC 2012. Experimental results demonstrate the effectiveness of the proposed MVKD in compressing semantic segmentation models.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Latent domain knowledge distillation for nighttime semantic segmentation
    Liu, Yunan
    Wang, Simiao
    Wang, Chunpeng
    Lu, Mingyu
    Sang, Yu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [42] Multi-view semantic understanding for visual dialog
    Jiang, Tianling
    Zhang, Zefan
    Li, Xin
    Ji, Yi
    Liu, Chunping
    KNOWLEDGE-BASED SYSTEMS, 2023, 268
  • [43] Multi-view Semantic Learning for Data Representation
    Luo, Peng
    Peng, Jinye
    Guan, Ziyu
    Fan, Jianping
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2015, PT I, 2015, 9284 : 367 - 382
  • [44] Multi-view semantic enhancement model for few-shot knowledge graph completion
    Ma, Ruixin
    Wu, Hao
    Wang, Xiaoru
    Wang, Weihe
    Ma, Yunlong
    Zhao, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [45] Multi-View Image Classification With Visual, Semantic and View Consistency
    Zhang, Chunjie
    Cheng, Jian
    Tian, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 617 - 627
  • [46] Knowledge Adaptation for Efficient Semantic Segmentation
    He, Tong
    Shen, Chunhua
    Tian, Zhi
    Gong, Dong
    Sun, Changming
    Yan, Youliang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 578 - 587
  • [47] An Efficient Method based on Multi-view Semantic Alignment for Cross-view Geo-localization
    Wang, Yifeng
    Xia, Yamei
    Lu, Tianbo
    Zhang, Xiaoyan
    Yao, Wenbin
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [48] Multi-View correlation distillation for incremental object detection
    Yang, Dongbao
    Zhou, Yu
    Zhang, Aoting
    Sun, Xurui
    Wu, Dayan
    Wang, Weiping
    Ye, Qixiang
    PATTERN RECOGNITION, 2022, 131
  • [49] Multi-View Object Segmentation in Space and Time
    Djelouah, Abdelaziz
    Franco, Jean-Sebastien
    Boyer, Edmond
    Le Clerc, Francois
    Perez, Patrick
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2640 - 2647
  • [50] Efficient Uncertainty Estimation in Semantic Segmentation via Distillation
    Holder, Christopher J.
    Shafique, Muhammad
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3080 - 3087