PHMM BASED ASYNCHRONOUS ACOUSTIC MODEL FOR CHINESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION

被引：0

作者：

Wu, Hao ^{[1
]}

Wu, Xihong ^{[1
]}

Chi, Huisheng ^{[1
]}

机构：

[1] Peking Univ, Minist Educ, Key Lab Machine Percept, Hearing Res Ctr, Beijing 100871, Peoples R China

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

tonal language; multiple stream; PHMM;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we presented an asynchronous multiple stream based Chinese tonal acoustic modeling framework. In this framework, toneless phonetic units and tones are modeled separately with different acoustic features. During the training, and decoding process, a set of models are coupled together with a product hidden Markov models (PHMM) to represent whole tonal phonetic units. Through this, a compound context dependent tonal model can be generated from a few simple models. Experiments show that such model scheme generates more compact and accurate model presentation and brings improvement on the performance for large vocabulary speech recognition tasks.

引用

页码：4477 / 4480

页数：4

共 50 条

[1] Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
Shen, JL
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1998, 145 (05): : 309 - 315
[2] Large Vocabulary Continuous Speech Recognition With Reservoir-Based Acoustic Models
Triefenbach, Fabian
Demuynck, Kris
Martens, Jean-Pierre
IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (03) : 311 - 315
[3] Acoustic models of the elderly for large-vocabulary continuous speech recognition
Baba, A
Yoshizawa, S
Yamada, M
Lee, A
Shikano, K
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2004, 87 (07): : 49 - 57
[4] Unsupervised training of acoustic models for large vocabulary continuous speech recognition
Wessel, F
Ney, H
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 307 - 310
[5] Syllable based language model for large vocabulary continuous speech recognition of Uyghur
Silamu, W. (wushour@xju.edu.cn), 1600, Tsinghua University (53):
[6] Syllable Based Language Model for Large Vocabulary Continuous Speech Recognition of Polish
Majewski, Piotr
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 397 - 401
[7] Acoustic Nudging-Based Model for Vocabulary Reformulation in Continuous Yoruba Speech Recognition
Ajayi, Lydia Kehinde
Azeta, Ambrose
Odun-Ayo, Isaac
Aniemeka, Enem Theophilus
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2022, PT I, 2022, 13375 : 494 - 508
[8] Probabilistic Speaker-Class based Acoustic Modeling for Large Vocabulary Continuous Speech Recognition
Li, Xiangang
Su, Dan
Pang, Zaihu
Wu, Xihong
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1218 - 1221
[9] Acoustic-Phonetic Approaches for Improving Segment-Based Speech Recognition for Large Vocabulary Continuous Speech
Likitsupin, Krerksak
Punyabukkana, Proadpran
Wutiwiwatchai, Chai
Suchato, Atiwong
ENGINEERING JOURNAL-THAILAND, 2016, 20 (02): : 179 - 197
[10] DISCRIMINATIVE TRAINING OF HIERARCHICAL ACOUSTIC MODELS FOR LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION
Chang, Hung-An
Glass, James R.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4481 - 4484

← 1 2 3 4 5 →