Smoothed unit HMM in mandarin speech recognition

被引：0

作者：

He, Q ^{[1
]}

Mao, SY ^{[1
]}

Zhang, YW ^{[1
]}

机构：

[1] Beijing Univ Aeronaut & Astronaut, Dept EE, Beijing 100083, Peoples R China

来源：

2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III | 2000年

关键词：

speech recognition; HMM; demi-syllable; SUHMM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The base unit in mandarin speech recognition can be phoneme, demi-syllable or syllable. Demi-syllable system has fewer HMM models and need less computation, thus it's suitable for real-time systems. But due to poor description for the acoustic properties of the speech signal, it generally shows a low performance compared to syllable system. While system based on syllable of phoneme (tri-phone or di-phone) has much more HMM models, and needs massive computation in training and recognition. In this paper, a compromised scheme is proposed. The new system is based on demi-syllable, but the two demi-syllable HMMs are connected into a full syllable HMM in training phase,so the data of the whole length of the syllable are used, and smoothing between two demi-syllables is introduced. This can increase the system performance without increasing HMM models, and it fits to real-time systems with DSP kernel.

引用

页码：792 / 795

页数：4

共 50 条

[1] Discriminative HMM stream model for Mandarin digit string speech recognition
Shi, YY
Liu, J
Liu, RS
2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 528 - 531
[2] Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognition
Matsumoto, H
Naitoh, N
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 905 - 908
[3] Dynamic HMM model with estimated dynamic property in continuous Mandarin speech recognition
Chen, FL
Zhu, J
CHINESE JOURNAL OF ELECTRONICS, 2003, 12 (02): : 193 - 196
[4] A study of training strategy of unit HMM for Chinese speech recognition
Pi, XB
Du, LM
Hou, ZQ
ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 777 - 780
[5] Integration of Articulatory Knowledge and Voicing Features Based on DNN/HMM for Mandarin Speech Recognition
Tan, Ying-Wei
Liu, Wen-Ju
Jiang, Wei
Zheng, Hao
2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
[6] Speech recognition under noisy environments using segmental unit input HMM
Yamamoto, Kazumasa
Nakagawa, Seiichi
Systems and Computers in Japan, 2002, 33 (08) : 111 - 120
[7] Mandarin emotion recognition in speech
Pao, TL
Chen, YT
ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 227 - 230
[8] Speech recognition of mandarin monosyllables
Li, TF
PATTERN RECOGNITION, 2003, 36 (11) : 2713 - 2721
[9] An improved HMM speech recognition model
Yuan, Lichi
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1311 - 1315
[10] HMM speech recognition with reduced training
Foo, SW
Yap, T
ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1016 - 1019

← 1 2 3 4 5 →