INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING

被引:0
|
作者
Rasipuram, Ramya [1 ]
Razavi, Marzieh [1 ,2 ]
Magimai-Doss, Mathew [1 ]
机构
[1] Idiap Res Inst, CH-1920 Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
关键词
Probabilistic lexical modeling; pronunciation lexicon; grapheme subwords; phoneme subwords; grapheme-to-phoneme conversion;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Standard automatic speech recognition (ASR) systems use phoneme-based pronunciation lexicon prepared by linguistic experts. When the hand crafted pronunciations fail to cover the vocabulary of a new domain, a grapheme-to-phoneme (G2P) converter is used to extract pronunciations for new words and then a phoneme-based ASR system is trained. G2P converters are typically trained only on the existing lexicons. In this paper, we propose a grapheme-based ASR approach in the framework of probabilistic lexical modeling that integrates pronunciation learning as a stage in ASR system training, and exploits both acoustic and lexical resources (not necessarily from the domain or language of interest). The proposed approach is evaluated on four lexical resource constrained ASR tasks and compared with the conventional two stage approach where G2P training is followed by ASR system development.
引用
收藏
页码:5176 / 5180
页数:5
相关论文
共 50 条
  • [1] Articulatory feature based continuous speech recognition using probabilistic lexical modeling
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 233 - 259
  • [2] Special issue on modeling pronunciation variation for automatic speech recognition
    Strik, H
    [J]. SPEECH COMMUNICATION, 1999, 29 (2-4) : 81 - 82
  • [3] Lexical and Phonetic Modeling for Arabic Automatic Speech Recognition
    Nguyen, Long
    Ng, Tim
    Nguyen, Kham
    Zbib, Rabih
    Makhoul, John
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 708 - +
  • [4] Lexical modeling of non-native speech for automatic speech recognition
    Livescu, K
    Glass, J
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1683 - 1686
  • [5] AUTOMATIC PRONUNCIATION VERIFICATION FOR SPEECH RECOGNITION
    Rao, Kanishka
    Peng, Fuchun
    Beaufays, Francoise
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5162 - 5166
  • [6] Automatic Speech Recognition and Pronunciation Training
    Xiao, Wenqi
    [J]. PROCEEDINGS OF THE 2018 2ND INTERNATIONAL CONFERENCE ON EDUCATION, ECONOMICS AND MANAGEMENT RESEARCH (ICEEMR 2018), 2018, 182 : 466 - 468
  • [7] Lexical modeling for the development of Amharic automatic speech recognition systems
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (03) : 963 - 984
  • [8] Lexical modeling for the development of Amharic automatic speech recognition systems
    Martha Yifiru Tachbelie
    Solomon Teferra Abate
    [J]. Language Resources and Evaluation, 2023, 57 : 963 - 984
  • [9] A NEW APPROACH TO SPEAKER ADAPTATION BY MODELING PRONUNCIATION IN AUTOMATIC SPEECH RECOGNITION
    SCHIEL, F
    [J]. SPEECH COMMUNICATION, 1993, 13 (3-4) : 281 - 286
  • [10] Automatic evaluation of Dutch pronunciation by using speech recognition technology
    Cucchiarini, C
    Strik, H
    Boves, L
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 622 - 629