Smoothed unit HMM in mandarin speech recognition

被引:0
|
作者
He, Q [1 ]
Mao, SY [1 ]
Zhang, YW [1 ]
机构
[1] Beijing Univ Aeronaut & Astronaut, Dept EE, Beijing 100083, Peoples R China
关键词
speech recognition; HMM; demi-syllable; SUHMM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The base unit in mandarin speech recognition can be phoneme, demi-syllable or syllable. Demi-syllable system has fewer HMM models and need less computation, thus it's suitable for real-time systems. But due to poor description for the acoustic properties of the speech signal, it generally shows a low performance compared to syllable system. While system based on syllable of phoneme (tri-phone or di-phone) has much more HMM models, and needs massive computation in training and recognition. In this paper, a compromised scheme is proposed. The new system is based on demi-syllable, but the two demi-syllable HMMs are connected into a full syllable HMM in training phase,so the data of the whole length of the syllable are used, and smoothing between two demi-syllables is introduced. This can increase the system performance without increasing HMM models, and it fits to real-time systems with DSP kernel.
引用
收藏
页码:792 / 795
页数:4
相关论文
共 50 条
  • [1] Discriminative HMM stream model for Mandarin digit string speech recognition
    Shi, YY
    Liu, J
    Liu, RS
    2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 528 - 531
  • [2] Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognition
    Matsumoto, H
    Naitoh, N
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 905 - 908
  • [3] Dynamic HMM model with estimated dynamic property in continuous Mandarin speech recognition
    Chen, FL
    Zhu, J
    CHINESE JOURNAL OF ELECTRONICS, 2003, 12 (02): : 193 - 196
  • [4] A study of training strategy of unit HMM for Chinese speech recognition
    Pi, XB
    Du, LM
    Hou, ZQ
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 777 - 780
  • [5] Integration of Articulatory Knowledge and Voicing Features Based on DNN/HMM for Mandarin Speech Recognition
    Tan, Ying-Wei
    Liu, Wen-Ju
    Jiang, Wei
    Zheng, Hao
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [6] Speech recognition under noisy environments using segmental unit input HMM
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    Systems and Computers in Japan, 2002, 33 (08) : 111 - 120
  • [7] Mandarin emotion recognition in speech
    Pao, TL
    Chen, YT
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 227 - 230
  • [8] Speech recognition of mandarin monosyllables
    Li, TF
    PATTERN RECOGNITION, 2003, 36 (11) : 2713 - 2721
  • [9] An improved HMM speech recognition model
    Yuan, Lichi
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
  • [10] HMM speech recognition with reduced training
    Foo, SW
    Yap, T
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1016 - 1019