Transfer Learning for Automatic Speech Recognition Systems

被引:0
|
作者
Asefisaray, Behnam [1 ]
Haznedaroglu, Ali [1 ]
Erden, Mustafa [1 ]
Arslan, Levent M. [1 ,2 ]
机构
[1] Sestek, Istanbul, Turkey
[2] Bogazici Univ, Elekt Elekt Muhendisligi Bolumu, Istanbul, Turkey
关键词
transfer learning; automatic speech recognition; deep neural networks;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the effects of transfer learning on deep neural network based speech recognition systems. The source acoustic model is trained on a large corpus of call centers telephony records, and an acoustically mismatched out-of-domain data that consists of the meeting recordings of the Grand National Assembly of Turkey is selected as the target. Our goal is to adapt the source model to the target data using transfer learning, and we investigate the effects of different target training data sizes, transferred layer counts and feature extractors on transfer learning. Our experiments show that for all target training sizes, the transferred models outperformed the models that are only trained on the target data, and the model that is transferred using 20 hours of target data achieved 7.8% higher recognition accuracy than the source model.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Transfer Learning in Automatic Speech Recognition for Serbian
    Popovic, Branislav
    Pakoci, Edvin
    Pekar, Darko
    [J]. 2019 27TH TELECOMMUNICATIONS FORUM (TELFOR 2019), 2019, : 309 - 312
  • [2] Multilingual Transfer Learning for Children Automatic Speech Recognition
    Rolland, Thomas
    Abad, Alberto
    Cucchiarini, Catia
    Strik, Helmer
    [J]. LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7314 - 7320
  • [3] Learning name pronunciations in automatic speech recognition systems
    Beaufays, F
    Sankar, A
    Williams, S
    Weintraub, M
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 233 - 240
  • [4] Transfer Learning Using Whisper for Dysarthric Automatic Speech Recognition
    Rathod, Siddharth
    Charola, Monil
    Patil, Hemant A.
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT I, 2023, 14338 : 579 - 589
  • [5] Automatic speech recognition systems
    Catariov, A
    [J]. Information Technologies 2004, 2004, 5822 : 83 - 93
  • [6] Deep transfer learning for automatic speech recognition: Towards better generalization
    Kheddar, Hamza
    Himeur, Yassine
    Al-Maadeed, Somaya
    Amira, Abbes
    Bensaali, Faycal
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [7] Continual Learning in Automatic Speech Recognition
    Sadhu, Samik
    Hermansky, Hynek
    [J]. INTERSPEECH 2020, 2020, : 1246 - 1250
  • [8] Active learning for automatic speech recognition
    Hakkani-Tür, D
    Riccardi, G
    Gorin, A
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3904 - 3907
  • [9] A Fast Learning Method for Multilayer Perceptrons in Automatic Speech Recognition Systems
    Cai, Chenghao
    Xu, Yanyan
    Ke, Dengfeng
    Su, Kaile
    [J]. JOURNAL OF ROBOTICS, 2015, 2015
  • [10] Research on automatic speech recognition based on a DL-T and transfer learning
    Zhang, Wei
    Liu, Chen
    Fei, Hong-Bo
    Li, Wei
    Yu, Jing-Hu
    Cao, Yi
    [J]. Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2021, 43 (03): : 433 - 441