Features extraction, modeling and training strategies in continuous speech recognition for Romanian language

被引：0

作者：

Dumitru, CO ^{[1
]}

Gavat, I ^{[1
]}

机构：

[1] Univ Bucharest, Fac Elect Telecommun & Informat, Bucharest, Romania

来源：

Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings | 2005年

关键词：

HMM; MFCC; PLP; LPC; context dependent modeling; continuous speech;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database.

引用

页码：1425 / 1428

页数：4

共 50 条

[31] Integration of speech and language processing in Chinese continuous speech recognition
ZHAO Li ZOU Cairong WU Zhenyang(Department of Radio Engineering
Chinese Journal of Acoustics, 2002, (04) : 343 - 351
[32] SPEECH ENHANCEMENT AND FEATURES COMPENSATION ALGORITHMS FOR CONTINUOUS SPEECH RECOGNITION
Arcos, Christian
Grivet, Marco
Alcaim, Abraham
2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 27 - 31
[33] Discriminative training of language models for speech recognition
Kuo, KHJ
Fosler-Lussier, E
Jiang, H
Lee, CH
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
[34] Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition
Sun, Ri Hyon
Chol, Ri Jong
SPEECH COMMUNICATION, 2020, 117 : 21 - 27
[35] A new combined modeling of continuous speech recognition
Han, ZB
Jia, L
Zhang, S
Xu, B
2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 597 - 602
[36] Improved lexicon modeling for continuous speech recognition
Yun, SJ
Oh, YH
Shin, GC
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1827 - 1830
[37] Context modeling and clustering in continuous speech recognition
Junqua, JC
Vassallo, L
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2262 - 2265
[38] PROTOLOGOS, SYSTEM FOR ROMANIAN LANGUAGE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU)
Militaru, Diana
Gavat, Inge
Dumitru, Octavian
Zaharia, Tiberiu
Segarceanu, Svetlana
FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 21 - 32
[39] Tone Modeling for Continuous Mandarin Speech Recognition
Cao, Yang
Zhang, Shuwu
Huang, Taiyi
Xu, Bo
International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
[40] Joint acoustic and language modeling for speech recognition
Chien, Jen-Tzung
Chueh, Chuang-Hua
SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235

← 1 2 3 4 5 →