Improving Grapheme-based ASR by Probabilistic Lexical Modeling Approach

被引:0
|
作者
Rasipuram, Ramya [1 ,2 ]
Magimai-Doss, Mathew [1 ]
机构
[1] Idiap Res Inst, CH-1920 Martigny, Switzerland
[2] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
关键词
Automatic speech recognition; hidden Markov model; Lexical modeling; Graphemes; Phonemes; Posterior features; Kullback-Leibler divergence based HMM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is growing interest in using graphemes as subword units, especially in the context of the rapid development of hidden Markov model (HMM) based automatic speech recognition (ASR) system, as it eliminates the need to build a phoneme pronunciation lexicon. However, directly modeling the relationship between acoustic feature observations and grapheme states may not be always trivial. It usually depends upon the grapheme-to-phoneme relationship within the language. This paper builds upon our recent interpretation of Kullback-Leibler divergence based HMM (KL-HMM) as a probabilistic lexical modeling approach to propose a novel grapheme-based ASR approach where, first a set of acoustic units are derived by modeling context-dependent graphemes in the framework of conventional HMM/Gaussian mixture model (HMM/GMM) system, and then the probabilistic relationship between the derived acoustic units and the lexical units representing graphemes is modeled in the framework of KL-HMM. Through experimental studies on English, where the grapheme-to-phoneme relationship is irregular, we show that the proposed grapheme-based ASR approach (without using any phoneme information) can achieve performance comparable to standard phoneme-based ASR approach.
引用
下载
收藏
页码:505 / 509
页数:5
相关论文
共 50 条
  • [11] Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework
    Razavi, Marzieh
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    SPEECH COMMUNICATION, 2016, 80 : 1 - 21
  • [12] Grapheme-Based Automatic Speech Recognition Using KL-HMM
    Magimai-Doss, Mathew
    Rasipuram, Ramya
    Aradilla, Guillermo
    Bourlard, Herve
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 452 - 455
  • [13] ACOUSTIC UNIT DISCOVERY AND PRONUNCIATION GENERATION FROM A GRAPHEME-BASED LEXICON
    Hartmann, William
    Roy, Anindya
    Lamel, Lori
    Gauvain, Jean-Luc
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 380 - 385
  • [14] SILANIZED GRAPHEME-BASED NANOCOMPOSITE COATINGS ON FIBER REINFORCED COMPOSITES AGAINST THE ENVIRONMENTAL DEGRADATIONS
    Diouf, D.
    Asmatulu, R.
    PROCEEDINGS OF THE ASME INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, 2014, VOL 1, 2015,
  • [15] Lexical Modeling of ASR Errors for Robust Speech Translation
    Martucci, Giuseppe
    Cettolo, Mauro
    Negri, Matteo
    Turchi, Marco
    INTERSPEECH 2021, 2021, : 2282 - 2286
  • [16] A study of phoneme and grapheme based context-dependent ASR systems
    Dines, John
    Doss, Mathew Magimai
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2008, 4892 : 215 - 226
  • [17] Lexical Vagueness Modeling: A SOM based Approach
    Wang, Tingting
    2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, : 1115 - 1119
  • [18] A Probabilistic Lexical Approach to Textual Entailment
    Glickman, Oren
    Dagan, Ido
    Koppel, Moshe
    19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1682 - 1683
  • [19] Articulatory feature based continuous speech recognition using probabilistic lexical modeling
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 233 - 259
  • [20] From Speech to Letters - Using a Novel Neural Network Architecture for Grapheme Based ASR
    Eyben, Florian
    Woellmer, Martin
    Schuller, Bjoern
    Graves, Alex
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 376 - +