Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引:0
|
作者
Oropeza Rodriguez, Jose Luis [1 ]
Suarez Guerra, Sergio [1 ]
机构
[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico
来源
COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期
关键词
Speech recognition; Syllables recognition; Expert System; Speech processing;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.
引用
收藏
页码:270 / 286
页数:17
相关论文
共 50 条
  • [1] Speech recognition using energy parameters to classify syllables in the Spanish language
    Guerra, SS
    Rodríguez, JLO
    Riveron, EMF
    Nazuno, JF
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 161 - 170
  • [2] Speech recognition using energy, MFCCs and Rho parameters to classify syllables in the Spanish language
    Suarez Guerra, Sergio
    Oropeza Rodriguez, Jose Luis
    Felipe Riveron, Edgardo Manuel
    Figueroa Nazuno, Jesus
    [J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1057 - +
  • [3] Automatic Disordered Syllables Repetition Recognition in Continuous Speech Using CWT and Correlation
    Codello, Ireneusz
    Kuniszyk-Jozkowiak, Wieslawa
    Smolka, Elzbieta
    Kobus, Adam
    [J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 867 - 876
  • [4] Comparative Experiments of Different Aspects of Syllables for Robust Automatic Speech Recognition
    Azmi, Mohamed Mostafa
    Tolba, Hesham
    [J]. ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 88 - 91
  • [5] Using morphemes in language modeling and automatic speech recognition of amharic
    [J]. Tachbelie, Martha Yifiru, 1600, Cambridge University Press (20):
  • [6] Using morphemes in language modeling and automatic speech recognition of Amharic
    Tachbelie, Martha Yifiru
    Abate, Solomon Teferra
    Menzel, Wolfgang
    [J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
  • [7] Agglutinative Language Speech Recognition Using Automatic Allophone Deriving
    Xu Ji
    Pan Jielin
    Yan Yonghong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (02) : 328 - 333
  • [8] Agglutinative Language Speech Recognition Using Automatic Allophone Deriving
    XU Ji
    PAN Jielin
    YAN Yonghong
    [J]. Chinese Journal of Electronics, 2016, 25 (02) : 328 - 333
  • [9] Using Syllables as Acoustic Units for Spontaneous Speech Recognition
    Hejtmanek, Jan
    [J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 299 - 305
  • [10] Speech Recognition with Syllables and Concepts
    De Palma, Paul
    Wooters, Charles
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 5 - 10