Model-based Articulatory Phonetic Features for Improved Speech Recognition

被引:0
|
作者
Huang, Guangpu [1 ]
Er, Meng Joo [1 ]
机构
[1] Nanyang Technol Univ, Comp Vis Lab, Singapore 639798, Singapore
来源
2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2012年
关键词
NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a neural based articulatory phonetic inversion model to improve the recognition of the acoustically varying vowels and the syllable initial plosives. The model uses a set of continuous valued articulatory phonetic features (APFs) to explore the interactions between the motor control of articulators and the acoustic phonetic events. We demonstrate that the neural model gives more accurate and robust recognition performance on the TIMIT sentences. The model offers two salient properties: it allows asynchronous feature changes at phoneme boundaries, and it accounts for the dual aspects of human speech production and perception through a heuristic learning algorithm during APFs mapping.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] ARTICULATORY TIMING IN SPEECH PRODUCTION . ARTICULATORY DISTINCTIVE FEATURES
    COKER, CH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1970, 47 (1P1): : 94 - +
  • [42] Unsupervised noise model estimation for model-based robust speech recognition
    Graciarena, M
    Franco, H
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 351 - 356
  • [43] Within and cross-corpus speech emotion recognition using latent topic model-based features
    Mohit Shah
    Chaitali Chakrabarti
    Andreas Spanias
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [44] Within and cross-corpus speech emotion recognition using latent topic model-based features
    Shah, Mohit
    Chakrabarti, Chaitali
    Spanias, Andreas
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015,
  • [45] Improved Fault Recognition for Model-Based Diagnostic Systems
    Koetter, Matthias
    Pungs, Andreas
    Wolkenar, Bernd
    INTERNATIONALER MOTORENKONGRESS 2015: MIT NUTZFAHRZEUGMOTOREN - SPEZIAL, 2015, : 499 - 514
  • [46] SUBMODULAR DATA SELECTION WITH ACOUSTIC AND PHONETIC FEATURES FOR AUTOMATIC SPEECH RECOGNITION
    Ni, Chongjia
    Wang, Lei
    Liu, Haibo
    Leung, Cheung-Chi
    Lu, Li
    Ma, Bin
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4629 - 4633
  • [47] Multi-resolution phonetic/segmental features and models for HMM-based speech recognition
    Vaseghi, S
    Harte, N
    Milner, B
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1263 - 1266
  • [48] Predictive model-based compensation schemes for robust speech recognition
    Gales, MJF
    SPEECH COMMUNICATION, 1998, 25 (1-3) : 49 - 74
  • [49] Model-Based Wiener filter for noise robust speech recognition
    Arakawa, Takayuki
    Tsujikawa, Masanori
    Isotani, Ryosuke
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 537 - 540
  • [50] Speech synthesis based on a physiological articulatory model
    Fang, Qiang
    Dang, Jianwu
    Chinese Spoken Language Processing, Proceedings, 2006, 4274 : 211 - 222