ACOUSTIC DATA-DRIVEN PRONUNCIATION LEXICON GENERATION FOR LOGOGRAPHIC LANGUAGES

被引:0
|
作者
Chen, Guoguo [1 ]
Povey, Daniel [1 ,2 ]
Khudanpur, Sanjeev [1 ,2 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21218 USA
基金
美国国家科学基金会;
关键词
Pronunciation lexicon; logographic language; speech recognition; keyword search; SPEECH; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Handcrafted pronunciation lexicons are widely used in modern speech recognition systems. Designing a pronunciation lexicon, however, requires tremendous amount of expert knowledge and effort, which is not practical when applying speech recognition techniques to low resource languages. In this paper, we are interested in developing speech recognition systems for logographic languages with only a small expert pronunciation lexicon. An iterative framework is proposed to generate and refine the phonetic transcripts of the training data, which will then be aligned to their word-level transcripts for grapheme-to-phoneme (G2P) model training. The G2P model trained this way covers graphemes that appear in the training transcripts (most of which are usually unseen in a small expert lexicon for logographic languages), therefore is able to generate pronunciations for all the words in the transcripts. The proposed lexicon generation procedure is evaluated on Cantonese speech recognition and keyword search tasks. Experiments show that starting from an expert lexicon of only 1K words, we are able to generate a lexicon that works reasonably well when compared with an expert-crafted lexicon of 5K words.
引用
收藏
页码:5350 / 5354
页数:5
相关论文
共 50 条
  • [41] Acoustic emission source modeling using a data-driven approach
    Cuadra, J.
    Vanniamparambil, P. A.
    Servansky, D.
    Bartoli, I.
    Kontsos, A.
    [J]. JOURNAL OF SOUND AND VIBRATION, 2015, 341 : 222 - 236
  • [42] Data-driven analysis of parametrized acoustic systems in the frequency domain
    Xie, Xiang
    Wang, Wei
    Wu, Haijun
    Guo, Mengwu
    [J]. APPLIED MATHEMATICAL MODELLING, 2023, 124 : 791 - 805
  • [43] Data-driven acoustic measurement of moisture content in flowing biomass
    Greenhall, J.
    Pantea, C.
    Vakhlamov, P.
    Davis, E. S.
    Semelsberger, T.
    [J]. MACHINE LEARNING WITH APPLICATIONS, 2023, 13
  • [44] MASK plus :DATA-DRIVEN REGIONS SELECTION FOR ACOUSTIC FINGERPRINTING
    Ondel, Lucas
    Anguera, Xavier
    Luque, Jordi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 335 - 339
  • [45] Design of acoustic absorbing metasurfaces using a data-driven approach
    Hamza Baali
    Mahmoud Addouche
    Abdesselam Bouzerdoum
    Abdelkrim Khelif
    [J]. Communications Materials, 4
  • [46] Data-driven, nonlinear, formant-to-acoustic mapping for ASR
    Jackson, PJB
    Lo, BH
    Russell, MJ
    [J]. ELECTRONICS LETTERS, 2002, 38 (13) : 667 - 669
  • [47] Design of acoustic absorbing metasurfaces using a data-driven approach
    Baali, Hamza
    Addouche, Mahmoud
    Bouzerdoum, Abdesselam
    Khelif, Abdelkrim
    [J]. COMMUNICATIONS MATERIALS, 2023, 4 (01)
  • [48] Improved acoustic modeling based on selective data-driven PMC
    Kim, W
    Ko, H
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 4176 - 4176
  • [49] Data-Driven Acoustic Communication Modeling for Undersea Collaborative Navigation
    Horner, Douglas
    Xie, Geoffrey
    [J]. 2013 OCEANS - SAN DIEGO, 2013,
  • [50] Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation
    Rasipuram, Ramya
    Doss, Mathew Magimai
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1818 - 1821