Features extraction, modeling and training strategies in continuous speech recognition for Romanian language

被引：0

作者：

Dumitru, CO ^{[1
]}

Gavat, I ^{[1
]}

机构：

[1] Univ Bucharest, Fac Elect Telecommun & Informat, Bucharest, Romania

来源：

Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings | 2005年

关键词：

HMM; MFCC; PLP; LPC; context dependent modeling; continuous speech;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database.

引用

页码：1425 / 1428

页数：4

共 50 条

[41] Audio-Visual Speech Modeling for Continuous Speech Recognition
Dupont, Stephane
Luettin, Juergen
IEEE TRANSACTIONS ON MULTIMEDIA, 2000, 2 (03) : 141 - 151
[42] Language Modeling for Speech Recognition of Spoken Cantonese
Yeung, Yu Ting
Cao, Houwei
Zheng, N. H.
Lee, Tan
Ching, P. C.
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
[43] Statistical Phonetic Analysis of the Romanian Language for Speech Recognition and Synthesis Tasks
Stanescu, Miruna
Buzo, Andi
Cucu, Horia
Burileanu, Corneliu
PROCEEDINGS ELMAR-2012, 2012, : 219 - 222
[44] POSITION INFORMATION FOR LANGUAGE MODELING IN SPEECH RECOGNITION
Chiu, Hsuan-Sheng
Chen, Guan-Yu
Lee, Chun-Jen
Chen, Berlin
2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 101 - 104
[45] Latent semantic language modeling for speech recognition
Bellegarda, JR
MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103
[46] Efficient Structured Language Modeling for Speech Recognition
Rastrow, Ariya
Dredze, Mark
Khudanpur, Sanjeev
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1658 - 1661
[47] Topic extraction based on continuous speech recognition in broadcast news speech
Ohtsuki, K
Matsuoka, T
Matsunaga, S
Furui, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (07) : 1138 - 1144
[48] Study on the integration of speech and language processing in recognition of Chinese continuous speech
Zhao, L.
Zhou, C.R.
Wu, Z.Y.
Shengxue Xuebao/Acta Acustica, 2001, 26 (01): : 73 - 78
[49] Feature sets in continuous speech recognition for the Portuguese language
dos Santos, SCB
Alcaim, A
ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 126 - 129
[50] SySRA: A System of a Continuous Speech Recognition in Arab Language
Abdelhamid, Samir
Bouguechal, Noureddine
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 11, 2006, 11 : 207 - +

← 1 2 3 4 5 →