Features extraction, modeling and training strategies in continuous speech recognition for Romanian language

被引:0
|
作者
Dumitru, CO [1 ]
Gavat, I [1 ]
机构
[1] Univ Bucharest, Fac Elect Telecommun & Informat, Bucharest, Romania
关键词
HMM; MFCC; PLP; LPC; context dependent modeling; continuous speech;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database.
引用
收藏
页码:1425 / 1428
页数:4
相关论文
共 50 条
  • [31] Integration of speech and language processing in Chinese continuous speech recognition
    ZHAO Li ZOU Cairong WU Zhenyang(Department of Radio Engineering
    Chinese Journal of Acoustics, 2002, (04) : 343 - 351
  • [32] SPEECH ENHANCEMENT AND FEATURES COMPENSATION ALGORITHMS FOR CONTINUOUS SPEECH RECOGNITION
    Arcos, Christian
    Grivet, Marco
    Alcaim, Abraham
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 27 - 31
  • [33] Discriminative training of language models for speech recognition
    Kuo, KHJ
    Fosler-Lussier, E
    Jiang, H
    Lee, CH
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 325 - 328
  • [34] Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition
    Sun, Ri Hyon
    Chol, Ri Jong
    SPEECH COMMUNICATION, 2020, 117 : 21 - 27
  • [35] A new combined modeling of continuous speech recognition
    Han, ZB
    Jia, L
    Zhang, S
    Xu, B
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 597 - 602
  • [36] Improved lexicon modeling for continuous speech recognition
    Yun, SJ
    Oh, YH
    Shin, GC
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1827 - 1830
  • [37] Context modeling and clustering in continuous speech recognition
    Junqua, JC
    Vassallo, L
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2262 - 2265
  • [38] PROTOLOGOS, SYSTEM FOR ROMANIAN LANGUAGE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU)
    Militaru, Diana
    Gavat, Inge
    Dumitru, Octavian
    Zaharia, Tiberiu
    Segarceanu, Svetlana
    FROM SPEECH PROCESSING TO SPOKEN LANGUAGE TECHNOLOGY, 2009, : 21 - 32
  • [39] Tone Modeling for Continuous Mandarin Speech Recognition
    Cao, Yang
    Zhang, Shuwu
    Huang, Taiyi
    Xu, Bo
    International Journal of Speech Technology, 2004, 7 (2-3) : 115 - 128
  • [40] Joint acoustic and language modeling for speech recognition
    Chien, Jen-Tzung
    Chueh, Chuang-Hua
    SPEECH COMMUNICATION, 2010, 52 (03) : 223 - 235