ADAPTABLE MULTI-DOMAIN LANGUAGE MODEL FOR TRANSFORMER ASR

被引:4
|
作者
Lee, Taewoo [1 ]
Lee, Min-Joong [2 ]
Kang, Tae Gyoon [2 ]
Jung, Seokyeoung [1 ]
Kwon, Minseok [1 ]
Hong, Yeona [1 ]
Lee, Jungin [1 ]
Woo, Kyoung-Gu [1 ]
Kim, Ho-Gyeong [2 ]
Jeong, Jiseung [2 ]
Lee, Jihyun [2 ]
Lee, Hosik [2 ]
Choi, Young Sang [2 ]
机构
[1] Samsung Elect, AI R&D Grp, Suwon Shi, South Korea
[2] Samsung Elect, Samsung Adv Inst Technol, Suwon Shi, South Korea
关键词
End-to-end (E2E) automatic speech recognition (ASR); language model (LM); multi-domain adaptation;
D O I
10.1109/ICASSP39728.2021.9413475
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an adapter based multi-domain Transformer based language model (LM) for Transformer ASR. The model consists of a big size common LM and small size adapters. The model can perform multi-domain adaptation with only the small size adapters and its related layers. The proposed model can reuse the full fine-tuned LM which is fine-tuned using all layers of an original model. The proposed LM can be expanded to new domains by adding about 2% of parameters for a first domain and 13% parameters for after second domain. The proposed model is also effective in reducing the model maintenance cost because it is possible to omit the costly and time-consuming common LM pre-training process. Using proposed adapter based approach, we observed that a general LM with adapter can outperform a dedicated music domain LM in terms of word error rate (WER).
引用
收藏
页码:7358 / 7362
页数:5
相关论文
共 50 条
  • [31] Approach for an Integrated Multi-Domain Aircraft Energy Model
    Lazarovich, David
    Lee, Sang-Joon
    SAE INTERNATIONAL JOURNAL OF AEROSPACE, 2009, 1 (01): : 1053 - 1058
  • [32] Secure interoperable authorization model of multi-domain application
    Duan, Sujuan
    Hong, Fan
    Luo, Ting
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2003, 31 (11):
  • [33] The RBAC model and implementation architecture in multi-domain environment
    Zan Yang
    Jian-xin Wang
    Lin Yang
    Rui-guang Yang
    Bao-sheng Kou
    Jie-kun Chen
    Shu-mei Yang
    Electronic Commerce Research, 2013, 13 : 273 - 289
  • [34] Multi-domain network security model based on Agent
    Jisuanji Gongcheng, 9 (68-70):
  • [35] Identity-based authentication model for multi-domain
    State Key Laboratory of Information Security, Institute of Software, Chinese Acad. of Sci., Beijing 100080, China
    Jisuanji Xuebao, 2006, 8 (1271-1281):
  • [36] Language modeling for multi-domain speech-driven text retrieval
    Itou, K
    Fujii, A
    Ishikawa, T
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 327 - 330
  • [37] Thoroughly Modeling Multi-domain Pre-trained Recommendation as Language
    Qu, Zekai
    Xie, Ruobing
    Xiao, Chaojun
    Yao, Yuan
    Liu, Zhiyuan
    Lian, Fengzong
    Kang, Zhanhui
    Zhou, Jie
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2025, 43 (02)
  • [38] DAViLa - A domain adaptable visual language
    Jonsson, T
    Hamfelt, A
    200S IEEE SYMPOSIUM ON HUMAN CENTRIC COMPUTING LANGUAGES AND ENVIRONMENTS, 2003, : 114 - 116
  • [39] Multi-domain Knowledge Distillation via Uncertainty-Matching for End-to-End ASR Models
    Kim, Ho-Gyeong
    Lee, Min-Joong
    Lee, Hoshik
    Kang, Tae Gyoon
    Lee, Jihyun
    Yang, Eunho
    Hwang, Sung Ju
    INTERSPEECH 2021, 2021, : 2531 - 2535
  • [40] Application of multi-domain and multi-language cosimulation to an optical MEM switch design
    Nicolescu, G
    Martinez, S
    Kriaa, L
    Youssef, W
    Yoo, S
    Charlot, B
    Jerraya, A
    ASP-DAC/VLSI DESIGN 2002: 7TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE AND 15TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS, 2002, : 426 - 431