Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

被引：0

作者：

Tomanek, Katrin ^{[1
]}

Zayats, Vicky ^{[1
]}

Padfield, Dirk ^{[1
]}

Vaillancourt, Kara ^{[1
]}

Biadsy, Fadi ^{[1
]}

机构：

[1] Google, Mountain View, CA 94043 USA

来源：

2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Automatic Speech Recognition (ASR) systems are often optimized to work best for speakers with canonical speech patterns. Unfortunately, these systems perform poorly when tested on atypical speech and heavily accented speech. It has previously been shown that personalization through model fine-tuning substantially improves performance. However, maintaining such large models per speaker is costly and difficult to scale. We show that by adding a relatively small number of extra parameters to the encoder layers via socalled residual adapter, we can achieve similar adaptation gains compared to model finetuning, while only updating a tiny fraction (less than 0.5%) of the model parameters. We demonstrate this on two speech adaptation tasks (atypical and accented speech) and for two state-of-the-art ASR architectures.

引用

页码：6751 / 6760

页数：10

共 42 条

[21] PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation
Saadi, Nada
Saeed, Numan
Yaqub, Mohammad
Nandakumar, Karthik
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 262 - 271
[22] ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
Inoue, Nakamasa
Otake, Shinta
Hirose, Takumi
Ohi, Masanari
Kawakami, Rei
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3867 - 3880
[23] HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation
Gui, Anchun
Xiao, Han
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 8521 - 8537
[24] Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition
Wang, Tianzi
Hu, Shoukang
Deng, Jiajun
Jin, Zengrui
Geng, Mengzhe
Wang, Yi
Meng, Helen
Liu, Xunying
INTERSPEECH 2023, 2023, : 1733 - 1737
[25] Parameter-Efficient Adaptation of Large Vision-Language Models for Video Memorability Prediction
Martin-Fernandez, Ivan
Esteban-Romero, Sergio
Fernandez-Martinez, Fernando
Gil-Martin, Manuel
SENSORS, 2025, 25 (06)
[26] An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation
Ding, Pei
Cao, Zhigang
2002, Chinese Institute of Electronics (11):
[27] An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation
Ding, P
Cao, ZG
CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (03): : 422 - 425
[28] Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
Bhatia, Anshu
Sinha, Sanchit
Dingliwal, Saket
Gopalakrishnan, Karthik
Bodapati, Sravan
Kirchhoff, Katrin
INTERSPEECH 2023, 2023, : 3362 - 3366
[29] Leveraging Low-Rank Adaptation for Parameter-Efficient Fine-Tuning in Multi-Speaker Adaptive Text-to-Speech Synthesis
Hong, Changi
Lee, Jung Hyuk
Kim, Hong Kook
IEEE ACCESS, 2024, 12 : 190711 - 190727
[30] Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
Hu, Yahao
Xie, Yifei
Wang, Tianfeng
Chen, Man
Pan, Zhisong
MATHEMATICS, 2023, 11 (20)

← 1 2 3 4 5 →