SKILL: SIMILARITY-AWARE KNOWLEDGE DISTILLATION FOR SPEECH SELF-SUPERVISED LEARNING

被引：0

作者：

Zampierin, Luca ^{[1
,2
]}

Hacene, Ghouthi Boukli ^{[1
,5
]}

Nguyen, Bac ^{[1
]}

Ravanelli, Mirco ^{[3
,4
,5
]}

机构：

[1] Sony Europe BV, Stuttgart Lab 1, Stuttgart, Germany

[2] Ecole Polytech Fed Lausanne, Lausanne, Switzerland

[3] Concordia Univ, Montreal, PQ, Canada

[4] Univ Montreal, Montreal, PQ, Canada

[5] Mila Quebec AI Inst, Montreal, PQ, Canada

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024 | 2024年

关键词：

Model compression; self-supervised learning; knowledge distillation;

D O I：

10.1109/ICASSPW62465.2024.10626978

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Self-supervised learning (SSL) has achieved remarkable success across various speech-processing tasks. To enhance its efficiency, previous works often leverage the use of compression techniques. A notable recent attempt is DPHuBERT, which applies joint knowledge distillation (KD) and structured pruning to learn a significantly smaller SSL model. In this paper, we contribute to this research domain by introducing SKILL, a novel method that conducts distillation across groups of layers instead of distilling individual arbitrarily selected layers within the teacher network. The identification of the layers to distill is achieved through a hierarchical clustering procedure applied to layer similarity measures. Extensive experiments demonstrate that our distilled version ofWavLM Base+ not only outperforms DPHuBERT but also achieves state-of-the-art results in the 30M parameters model class across several SUPERB tasks.

引用

页码：675 / 679

页数：5

共 50 条

[21] Self-Supervised Hypergraph Learning for Knowledge-Aware Social Recommendation
Li, Munan
Li, Jialong
Yang, Liping
Ding, Qi
ELECTRONICS, 2024, 13 (07)
[22] Knowledge-Aware Self-supervised Graph Representation Learning for Recommendation
Sun, Yeheng
Zhu, Jinghua
Xi, Heran
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 420 - 432
[23] COMEDIAN: Self-Supervised Learning and Knowledge Distillation for Action Spotting using Transformers
Denize, Julien
Liashuha, Mykola
Rabarisoa, Jaonary
Orcesi, Astrid
Herault, Romain
2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 518 - 528
[24] Similarity-Aware Network Embedding with Self-Paced Learning
Huang, Chao
Shi, Baoxu
Zhang, Xuchao
Wu, Xian
Chawla, Nitesh V.
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2113 - 2116
[25] A perceptual similarity space for speech based on self-supervised speech representations
Chernyak, Bronya R.
Bradlow, Ann R.
Keshet, Joseph
Goldrick, Matthew
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3915 - 3929
[26] Geography-Aware Self-Supervised Learning
Ayush, Kumar
Uzkent, Burak
Meng, Chenlin
Tanmay, Kumar
Burke, Marshall
Lobell, David
Ermon, Stefano
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10161 - 10170
[27] Initiative-Aware Self-Supervised Learning for Knowledge-Grounded Conversations
Meng, Chuan
Ren, Pengjie
Chen, Zhumin
Ren, Zhaochun
Xi, Tengxiao
de Rijke, Maarten
SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 522 - 532
[28] DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models
Peng, Yifan
Sudo, Yui
Muhammad, Shakeel
Watanabe, Shinji
INTERSPEECH 2023, 2023, : 62 - 66
[29] Self-Supervised Speech Representation Learning: A Review
Mohamed, Abdelrahman
Lee, Hung-yi
Borgholt, Lasse
Havtorn, Jakob D.
Edin, Joakim
Igel, Christian
Kirchhoff, Katrin
Li, Shang-Wen
Livescu, Karen
Maaloe, Lars
Sainath, Tara N.
Watanabe, Shinji
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1179 - 1210
[30] Similarity-Aware Skill Reproduction based on Multi-Representational Learning from Demonstration
Hertel, Brendan
Ahmadzadeh, S. Reza
2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 652 - 657

← 1 2 3 4 5 →