ADAPTABLE MULTI-DOMAIN LANGUAGE MODEL FOR TRANSFORMER ASR

被引:4
|
作者
Lee, Taewoo [1 ]
Lee, Min-Joong [2 ]
Kang, Tae Gyoon [2 ]
Jung, Seokyeoung [1 ]
Kwon, Minseok [1 ]
Hong, Yeona [1 ]
Lee, Jungin [1 ]
Woo, Kyoung-Gu [1 ]
Kim, Ho-Gyeong [2 ]
Jeong, Jiseung [2 ]
Lee, Jihyun [2 ]
Lee, Hosik [2 ]
Choi, Young Sang [2 ]
机构
[1] Samsung Elect, AI R&D Grp, Suwon Shi, South Korea
[2] Samsung Elect, Samsung Adv Inst Technol, Suwon Shi, South Korea
关键词
End-to-end (E2E) automatic speech recognition (ASR); language model (LM); multi-domain adaptation;
D O I
10.1109/ICASSP39728.2021.9413475
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose an adapter based multi-domain Transformer based language model (LM) for Transformer ASR. The model consists of a big size common LM and small size adapters. The model can perform multi-domain adaptation with only the small size adapters and its related layers. The proposed model can reuse the full fine-tuned LM which is fine-tuned using all layers of an original model. The proposed LM can be expanded to new domains by adding about 2% of parameters for a first domain and 13% parameters for after second domain. The proposed model is also effective in reducing the model maintenance cost because it is possible to omit the costly and time-consuming common LM pre-training process. Using proposed adapter based approach, we observed that a general LM with adapter can outperform a dedicated music domain LM in terms of word error rate (WER).
引用
收藏
页码:7358 / 7362
页数:5
相关论文
共 50 条
  • [21] GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio
    Chen, Guoguo
    Chai, Shuzhou
    Wang, Guanbo
    Du, Jiayu
    Zhang, Wei-Qiang
    Weng, Chao
    Su, Dan
    Povey, Daniel
    Trmal, Jan
    Zhang, Junbo
    Jin, Mingjie
    Khudanpur, Sanjeev
    Watanabe, Shinji
    Zhae, Shuaijiang
    Zou, Wei
    Li, Xiangang
    Yao, Xuchen
    Wang, Yongqing
    You, Zhao
    Yan, Zhiyong
    INTERSPEECH 2021, 2021, : 3670 - 3674
  • [22] Hypotheses Ranking and State Tracking for a Multi-Domain Dialog System using Multiple ASR Alternates
    Khan, Omar Zia
    Robichaud, Jean-Philippe
    Crook, Paul
    Sarikaya, Ruhi
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2022 - 2026
  • [23] Deepfake detection based on cross-domain local characteristic analysis with multi-domain transformer
    Amin, Muhammad Ahmad
    Hu, Yongjian
    Li, Chang-Tsun
    Liu, Beibei
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 91 : 592 - 609
  • [24] MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
    Du, Siyi
    Bayasi, Nourhan
    Hamarneh, Ghassan
    Garbi, Rafeef
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 448 - 458
  • [25] A Transformer-Based Multi-Domain Recommender System for E-commerce
    Morales-Murillo, Victor Giovanni
    Pinto, David
    Perez-Tellez, Fernando
    Rojas-Lopez, Franco
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (02):
  • [26] MDL-NAS: A Joint Multi-domain Learning Framework for Vision Transformer
    Wang, Shiguang
    Xie, Tao
    Cheng, Jian
    Zhang, Xingcheng
    Liu, Haijun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 20094 - 20104
  • [27] Multi-Domain Delegation and Revocation Model for Grid Systems
    Geethakumari, G.
    Jampala, Srikanth
    Venkatesan, T. L. Prasanna
    Negi, Atul
    Sastry, V. N.
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 2477 - +
  • [28] Practical Aspects of Implementation of a Multi-domain LED Model
    Poppe, Andras
    Szalai, Albin
    2014 30TH ANNUAL SEMICONDUCTOR THERMAL MEASUREMENT AND MANAGEMENT SYMPOSIUM (SEMI-THERM), 2014, : 153 - 158
  • [29] The RBAC model and implementation architecture in multi-domain environment
    Yang, Zan
    Wang, Jian-xin
    Yang, Lin
    Yang, Rui-guang
    Kou, Bao-sheng
    Chen, Jie-kun
    Yang, Shu-mei
    ELECTRONIC COMMERCE RESEARCH, 2013, 13 (03) : 273 - 289
  • [30] A Holistic Multi-Domain Association Model for Industrial Data
    AlGeddawy, Tarek
    ElMaraghy, Hoda
    30TH INTERNATIONAL CONFERENCE ON FLEXIBLE AUTOMATION AND INTELLIGENT MANUFACTURING (FAIM2021), 2020, 51 : 920 - 925