Features extraction, modeling and training strategies in continuous speech recognition for Romanian language

被引:0
|
作者
Dumitru, CO [1 ]
Gavat, I [1 ]
机构
[1] Univ Bucharest, Fac Elect Telecommun & Informat, Bucharest, Romania
关键词
HMM; MFCC; PLP; LPC; context dependent modeling; continuous speech;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database.
引用
收藏
页码:1425 / 1428
页数:4
相关论文
共 50 条
  • [1] Features extraction and training strategies in continuous speech recognition for Romanian language
    Dumitru, Corneliu Octavian
    Gavat, Inge
    [J]. ICINCO 2006: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS: SIGNAL PROCESSING, SYSTEMS MODELING AND CONTROL, 2006, : 114 - 121
  • [2] Features extraction methods applied for continuous speech recognition in Romanian
    Dumitru, C. O.
    Gavat, I.
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON OPTIMIZATION OF ELECTRICAL AND ELECTRONIC EQUIPMENT, VOL IV, 2006, : 115 - 120
  • [3] Influence of features extraction methods in performance of continuous speech recognition for Romanian
    Dumitru, C. O.
    Gavat, Inge
    [J]. 2007 14TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNALS, & IMAGE PROCESSING & EURASIP CONFERENCE FOCUSED ON SPEECH & IMAGE PROCESSING, MULTIMEDIA COMMUNICATIONS & SERVICES, 2007, : 40 - 43
  • [4] A comparative study of feature extraction methods applied to continuous speech recognition in Romanian language
    Dumitru, Corneliu Octavian
    Gavat, Inge
    [J]. PROCEEDINGS ELMAR-2006, 2006, : 115 - +
  • [5] NN and hybrid strategies for speech recognition in romanian language
    Dumitru, Corneliu-Octavian
    Gavat, Inge
    [J]. ANNIP 2008: PROCEEDINGS OF THE ARTIFICIAL NEURAL NETWORKS AND INTELLIGENT INFORMATION PROCESSING, 2008, : 51 - 60
  • [6] CONTINUOUS TOPIC LANGUAGE MODELING FOR SPEECH RECOGNITION
    Chueh, Chuang-Hua
    Chien, Jen-Tzung
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 193 - 196
  • [7] Progresses in continuous speech recognition based on statistical modelling for Romanian language
    Dumitru, Corneliu Octavian
    Gavat, Inge
    Militaru, Diana
    [J]. ICINCO 2007: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS, VOL SPSMC: SIGNAL PROCESSING, SYSTEMS MODELING AND CONTROL, 2007, : 262 - 267
  • [8] Hybrid speech recognition system with discriminative training applied for Romanian language
    Gavat, I
    Zirra, M
    Cula, O
    [J]. MELECON '98 - 9TH MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1 AND 2, 1998, : 11 - 15
  • [9] Modeling and training strategies for language recognition systems
    Duroselle, Raphael
    Sahidullah, Md
    Jouvet, Denis
    Illina, Irina
    [J]. INTERSPEECH 2021, 2021, : 1494 - 1498
  • [10] Syllable modeling in continuous speech recognition for Tamil language
    Thangarajan, R.
    Natarajan, A.
    Selvam, M.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2009, 12 (01) : 47 - 57