Features extraction, modeling and training strategies in continuous speech recognition for Romanian language

被引:0
|
作者
Dumitru, CO [1 ]
Gavat, I [1 ]
机构
[1] Univ Bucharest, Fac Elect Telecommun & Informat, Bucharest, Romania
关键词
HMM; MFCC; PLP; LPC; context dependent modeling; continuous speech;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database.
引用
收藏
页码:1425 / 1428
页数:4
相关论文
共 50 条
  • [41] Audio-Visual Speech Modeling for Continuous Speech Recognition
    Dupont, Stephane
    Luettin, Juergen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2000, 2 (03) : 141 - 151
  • [42] Language Modeling for Speech Recognition of Spoken Cantonese
    Yeung, Yu Ting
    Cao, Houwei
    Zheng, N. H.
    Lee, Tan
    Ching, P. C.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1570 - 1573
  • [43] Statistical Phonetic Analysis of the Romanian Language for Speech Recognition and Synthesis Tasks
    Stanescu, Miruna
    Buzo, Andi
    Cucu, Horia
    Burileanu, Corneliu
    PROCEEDINGS ELMAR-2012, 2012, : 219 - 222
  • [44] POSITION INFORMATION FOR LANGUAGE MODELING IN SPEECH RECOGNITION
    Chiu, Hsuan-Sheng
    Chen, Guan-Yu
    Lee, Chun-Jen
    Chen, Berlin
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 101 - 104
  • [45] Latent semantic language modeling for speech recognition
    Bellegarda, JR
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 73 - 103
  • [46] Efficient Structured Language Modeling for Speech Recognition
    Rastrow, Ariya
    Dredze, Mark
    Khudanpur, Sanjeev
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1658 - 1661
  • [47] Topic extraction based on continuous speech recognition in broadcast news speech
    Ohtsuki, K
    Matsuoka, T
    Matsunaga, S
    Furui, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2002, E85D (07) : 1138 - 1144
  • [48] Study on the integration of speech and language processing in recognition of Chinese continuous speech
    Zhao, L.
    Zhou, C.R.
    Wu, Z.Y.
    Shengxue Xuebao/Acta Acustica, 2001, 26 (01): : 73 - 78
  • [49] Feature sets in continuous speech recognition for the Portuguese language
    dos Santos, SCB
    Alcaim, A
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 126 - 129
  • [50] SySRA: A System of a Continuous Speech Recognition in Arab Language
    Abdelhamid, Samir
    Bouguechal, Noureddine
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 11, 2006, 11 : 207 - +