ACOUSTIC DATA-DRIVEN PRONUNCIATION LEXICON GENERATION FOR LOGOGRAPHIC LANGUAGES

被引:0
|
作者
Chen, Guoguo [1 ]
Povey, Daniel [1 ,2 ]
Khudanpur, Sanjeev [1 ,2 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA
基金
美国国家科学基金会;
关键词
Pronunciation lexicon; logographic language; speech recognition; keyword search; SPEECH; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Handcrafted pronunciation lexicons are widely used in modern speech recognition systems. Designing a pronunciation lexicon, however, requires tremendous amount of expert knowledge and effort, which is not practical when applying speech recognition techniques to low resource languages. In this paper, we are interested in developing speech recognition systems for logographic languages with only a small expert pronunciation lexicon. An iterative framework is proposed to generate and refine the phonetic transcripts of the training data, which will then be aligned to their word-level transcripts for grapheme-to-phoneme (G2P) model training. The G2P model trained this way covers graphemes that appear in the training transcripts (most of which are usually unseen in a small expert lexicon for logographic languages), therefore is able to generate pronunciations for all the words in the transcripts. The proposed lexicon generation procedure is evaluated on Cantonese speech recognition and keyword search tasks. Experiments show that starting from an expert lexicon of only 1K words, we are able to generate a lexicon that works reasonably well when compared with an expert-crafted lexicon of 5K words.
引用
收藏
页码:5350 / 5354
页数:5
相关论文
共 50 条
  • [1] ACOUSTIC DATA-DRIVEN PRONUNCIATION LEXICON FOR LARGE VOCABULARY SPEECH RECOGNITION
    Lu, Liang
    Ghoshal, Arnab
    Renals, Steve
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 374 - 379
  • [2] Acoustic data-driven lexicon learning based on a greedy pronunciation selection framework
    Zhang, Xiaohui
    Manohar, Vimal
    Povey, Daniel
    Khudanpur, Sanjeev
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2541 - 2545
  • [3] Multimodal neural pronunciation modeling for spoken languages with logographic origin
    Minh Nguyen
    Ngo, Gia H.
    Chen, Nancy F.
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2916 - 2922
  • [4] A data-driven method for modeling pronunciation variation
    Kessens, JM
    Cucchiarini, C
    Strik, H
    [J]. SPEECH COMMUNICATION, 2003, 40 (04) : 517 - 534
  • [5] Data-driven generation of pronunciation dictionaries in the German Verbmobil project - Discussion of experimental results
    Eichner, M
    Wolff, M
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1687 - 1690
  • [6] ACOUSTIC UNIT DISCOVERY AND PRONUNCIATION GENERATION FROM A GRAPHEME-BASED LEXICON
    Hartmann, William
    Roy, Anindya
    Lamel, Lori
    Gauvain, Jean-Luc
    [J]. 2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 380 - 385
  • [7] Prediction of pronunciation variations for speech synthesis: A data-driven approach
    Bennett, CL
    Black, AW
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 297 - 300
  • [8] Systematizing the lexicon of platforms in information systems: a data-driven study
    Bartelheimer, Christian
    zur Heiden, Philipp
    Luettenberg, Hedda
    Beverungen, Daniel
    [J]. ELECTRONIC MARKETS, 2022, 32 (01) : 375 - 396
  • [9] Systematizing the lexicon of platforms in information systems: a data-driven study
    Christian Bartelheimer
    Philipp zur Heiden
    Hedda Lüttenberg
    Daniel Beverungen
    [J]. Electronic Markets, 2022, 32 : 375 - 396
  • [10] On the expressiveness of event notification in data-driven coordination languages
    Busi, N
    Zavattaro, G
    [J]. PROGRAMMING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2000, 1782 : 41 - 55