SKILL: SIMILARITY-AWARE KNOWLEDGE DISTILLATION FOR SPEECH SELF-SUPERVISED LEARNING

被引:0
|
作者
Zampierin, Luca [1 ,2 ]
Hacene, Ghouthi Boukli [1 ,5 ]
Nguyen, Bac [1 ]
Ravanelli, Mirco [3 ,4 ,5 ]
机构
[1] Sony Europe BV, Stuttgart Lab 1, Stuttgart, Germany
[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[3] Concordia Univ, Montreal, PQ, Canada
[4] Univ Montreal, Montreal, PQ, Canada
[5] Mila Quebec AI Inst, Montreal, PQ, Canada
关键词
Model compression; self-supervised learning; knowledge distillation;
D O I
10.1109/ICASSPW62465.2024.10626978
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Self-supervised learning (SSL) has achieved remarkable success across various speech-processing tasks. To enhance its efficiency, previous works often leverage the use of compression techniques. A notable recent attempt is DPHuBERT, which applies joint knowledge distillation (KD) and structured pruning to learn a significantly smaller SSL model. In this paper, we contribute to this research domain by introducing SKILL, a novel method that conducts distillation across groups of layers instead of distilling individual arbitrarily selected layers within the teacher network. The identification of the layers to distill is achieved through a hierarchical clustering procedure applied to layer similarity measures. Extensive experiments demonstrate that our distilled version ofWavLM Base+ not only outperforms DPHuBERT but also achieves state-of-the-art results in the 30M parameters model class across several SUPERB tasks.
引用
收藏
页码:675 / 679
页数:5
相关论文
共 50 条
  • [21] Self-Supervised Hypergraph Learning for Knowledge-Aware Social Recommendation
    Li, Munan
    Li, Jialong
    Yang, Liping
    Ding, Qi
    ELECTRONICS, 2024, 13 (07)
  • [22] Knowledge-Aware Self-supervised Graph Representation Learning for Recommendation
    Sun, Yeheng
    Zhu, Jinghua
    Xi, Heran
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 420 - 432
  • [23] COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
    Denize, Julien
    Liashuha, Mykola
    Rabarisoa, Jaonary
    Orcesi, Astrid
    Herault, Romain
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 518 - 528
  • [24] Similarity-Aware Network Embedding with Self-Paced Learning
    Huang, Chao
    Shi, Baoxu
    Zhang, Xuchao
    Wu, Xian
    Chawla, Nitesh V.
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2113 - 2116
  • [25] A perceptual similarity space for speech based on self-supervised speech representations
    Chernyak, Bronya R.
    Bradlow, Ann R.
    Keshet, Joseph
    Goldrick, Matthew
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3915 - 3929
  • [26] Geography-Aware Self-Supervised Learning
    Ayush, Kumar
    Uzkent, Burak
    Meng, Chenlin
    Tanmay, Kumar
    Burke, Marshall
    Lobell, David
    Ermon, Stefano
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10161 - 10170
  • [27] Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations
    Meng, Chuan
    Ren, Pengjie
    Chen, Zhumin
    Ren, Zhaochun
    Xi, Tengxiao
    de Rijke, Maarten
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 522 - 532
  • [28] DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
    Peng, Yifan
    Sudo, Yui
    Muhammad, Shakeel
    Watanabe, Shinji
    INTERSPEECH 2023, 2023, : 62 - 66
  • [29] Self-Supervised Speech Representation Learning: A Review
    Mohamed, Abdelrahman
    Lee, Hung-yi
    Borgholt, Lasse
    Havtorn, Jakob D.
    Edin, Joakim
    Igel, Christian
    Kirchhoff, Katrin
    Li, Shang-Wen
    Livescu, Karen
    Maaloe, Lars
    Sainath, Tara N.
    Watanabe, Shinji
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1179 - 1210
  • [30] Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration
    Hertel, Brendan
    Ahmadzadeh, S. Reza
    2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 652 - 657