Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

被引:0
|
作者
Tomanek, Katrin [1 ]
Zayats, Vicky [1 ]
Padfield, Dirk [1 ]
Vaillancourt, Kara [1 ]
Biadsy, Fadi [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic Speech Recognition (ASR) systems are often optimized to work best for speakers with canonical speech patterns. Unfortunately, these systems perform poorly when tested on atypical speech and heavily accented speech. It has previously been shown that personalization through model fine-tuning substantially improves performance. However, maintaining such large models per speaker is costly and difficult to scale. We show that by adding a relatively small number of extra parameters to the encoder layers via socalled residual adapter, we can achieve similar adaptation gains compared to model finetuning, while only updating a tiny fraction (less than 0.5%) of the model parameters. We demonstrate this on two speech adaptation tasks (atypical and accented speech) and for two state-of-the-art ASR architectures.
引用
收藏
页码:6751 / 6760
页数:10
相关论文
共 42 条
  • [21] PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation
    Saadi, Nada
    Saeed, Numan
    Yaqub, Mohammad
    Nandakumar, Karthik
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 262 - 271
  • [22] ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks
    Inoue, Nakamasa
    Otake, Shinta
    Hirose, Takumi
    Ohi, Masanari
    Kawakami, Rei
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 3867 - 3880
  • [23] HiFi: High-Information Attention Heads Hold for Parameter-Efficient Model Adaptation
    Gui, Anchun
    Xiao, Han
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 8521 - 8537
  • [24] Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition
    Wang, Tianzi
    Hu, Shoukang
    Deng, Jiajun
    Jin, Zengrui
    Geng, Mengzhe
    Wang, Yi
    Meng, Helen
    Liu, Xunying
    INTERSPEECH 2023, 2023, : 1733 - 1737
  • [25] Parameter-Efficient Adaptation of Large Vision-Language Models for Video Memorability Prediction
    Martin-Fernandez, Ivan
    Esteban-Romero, Sergio
    Fernandez-Martinez, Fernando
    Gil-Martin, Manuel
    SENSORS, 2025, 25 (06)
  • [26] An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation
    Ding, Pei
    Cao, Zhigang
    2002, Chinese Institute of Electronics (11):
  • [27] An efficient robust ASR system based on the combination of speech enhancement and HMM adaptation
    Ding, P
    Cao, ZG
    CHINESE JOURNAL OF ELECTRONICS, 2002, 11 (03): : 422 - 425
  • [28] Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters
    Bhatia, Anshu
    Sinha, Sanchit
    Dingliwal, Saket
    Gopalakrishnan, Karthik
    Bodapati, Sravan
    Kirchhoff, Katrin
    INTERSPEECH 2023, 2023, : 3362 - 3366
  • [29] Leveraging Low-Rank Adaptation for Parameter-Efficient Fine-Tuning in Multi-Speaker Adaptive Text-to-Speech Synthesis
    Hong, Changi
    Lee, Jung Hyuk
    Kim, Hong Kook
    IEEE ACCESS, 2024, 12 : 190711 - 190727
  • [30] Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
    Hu, Yahao
    Xie, Yifei
    Wang, Tianfeng
    Chen, Man
    Pan, Zhisong
    MATHEMATICS, 2023, 11 (20)