Context Dependent Syllable Acoustic Model for Continuous Chinese Speech Recognition

被引:0
|
作者
Wu, Hao [1 ]
Wu, Xihong [1 ]
机构
[1] Peking Univ, State Key Lab Machine Percept, Speech & Hearing Res Ctr, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The choice of basic modeling unit in building acoustic model for a continuous Mandarin speech recognition task is a very important issue [1]. Unlike traditional phoneme or Initial/Finals (IFs) units based acoustic modeling methods, which usually suffer from the limitations of less accuracy in modeling intrasyllable variations and long scale temporal dependencies, in this paper, a practicable syllable based approach is presented. In contrast with IFs, syllable can implicitly model the intrasyllable variations in good accuracy. Also, by carefully choosing context modeling schemes and parameter tying methods, syllable based acoustic model can capture longer temporal variations while keeping the complexity of model well controlled. Meanwhile, considering the data unbalanced problem, multiple sized unit model based approaches are also implemented in this research. The experiment result shows the acoustic model based on the presented syllable based approach is effective in improving the performance of the Chinese continuous speech recognition.
引用
收藏
页码:1961 / 1964
页数:4
相关论文
共 50 条
  • [1] Improved Syllable Based Acoustic Modeling by Inter-syllable Transition Model for Continuous Chinese Speech Recognition
    Chao, Hao
    Liu, Wenju
    [J]. PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 654 - 657
  • [2] Context dependent initial/final acoustic modeling for continuous Chinese speech recognition
    Li, Jing
    Zheng, Fang
    Zhang, Jiyong
    Wu, Wenhu
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2004, 44 (01): : 61 - 64
  • [3] Context-dependent acoustic models for Chinese speech recognition
    Ma, B
    Huang, TY
    Xu, B
    Zhang, XJ
    Qu, F
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 455 - 458
  • [4] Trajectory clustering of syllable-length acoustic models for continuous speech recognition
    Han, Yan
    Hamalainen, Annika
    Boves, Lou
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 1169 - 1172
  • [5] Deep Neural Networks for Syllable based Acoustic Modeling in Chinese Speech Recognition
    Li, Xiangang
    Hong, Caifu
    Yang, Yuning
    Wu, Xihong
    [J]. 2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [6] SYLLABLE RECOGNITION FOR CONTINUOUS JAPANESE SPEECH RECOGNITION.
    Watanabe, Takao
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1986, : 2295 - 2298
  • [7] PARALLEL ALGORITHMS FOR SYLLABLE RECOGNITION IN CONTINUOUS SPEECH
    DEMORI, R
    LAFACE, P
    MONG, Y
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1985, 7 (01) : 56 - 69
  • [8] A STUDY OF THE SYLLABLE ORIENTED RECOGNITION OF CONTINUOUS SPEECH
    TANAKA, A
    TOGAWA, F
    UEDA, T
    HAKARIDANI, M
    IWAHASHI, H
    NISHIOKA, Y
    KOBAYASHI, T
    KINPARA, S
    YAMASHITA, K
    [J]. SPEECH COMMUNICATION, 1983, 2 (2-3) : 207 - 210
  • [9] On the syllable structures of Chinese relating to speech recognition
    Zhang, JL
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2450 - 2453
  • [10] Continuous Density Hidden Markov Model for Context Dependent Hindi speech Recognition
    Sinha, Shweta
    Agrawal, S. S.
    Jain, Aruna
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1953 - 1958