Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引：0

作者：

Oropeza Rodriguez, Jose Luis ^{[1
]}

Suarez Guerra, Sergio ^{[1
]}

机构：

[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico

来源：

COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期

关键词：

Speech recognition; Syllables recognition; Expert System; Speech processing;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.

引用

页码：270 / 286

页数：17

共 50 条

[1] Speech recognition using energy parameters to classify syllables in the Spanish language
Guerra, SS
Rodríguez, JLO
Riveron, EMF
Nazuno, JF
[J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 161 - 170
[2] Speech recognition using energy, MFCCs and Rho parameters to classify syllables in the Spanish language
Suarez Guerra, Sergio
Oropeza Rodriguez, Jose Luis
Felipe Riveron, Edgardo Manuel
Figueroa Nazuno, Jesus
[J]. MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 1057 - +
[3] Automatic Disordered Syllables Repetition Recognition in Continuous Speech Using CWT and Correlation
Codello, Ireneusz
Kuniszyk-Jozkowiak, Wieslawa
Smolka, Elzbieta
Kobus, Adam
[J]. PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 867 - 876
[4] Comparative Experiments of Different Aspects of Syllables for Robust Automatic Speech Recognition
Azmi, Mohamed Mostafa
Tolba, Hesham
[J]. ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 88 - 91
[5] Using morphemes in language modeling and automatic speech recognition of amharic
[J]. Tachbelie, Martha Yifiru, 1600, Cambridge University Press (20):
[6] Using morphemes in language modeling and automatic speech recognition of Amharic
Tachbelie, Martha Yifiru
Abate, Solomon Teferra
Menzel, Wolfgang
[J]. NATURAL LANGUAGE ENGINEERING, 2014, 20 (02) : 235 - 259
[7] Agglutinative Language Speech Recognition Using Automatic Allophone Deriving
Xu Ji
Pan Jielin
Yan Yonghong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (02) : 328 - 333
[8] Agglutinative Language Speech Recognition Using Automatic Allophone Deriving
XU Ji
PAN Jielin
YAN Yonghong
[J]. Chinese Journal of Electronics, 2016, 25 (02) : 328 - 333
[9] Using Syllables as Acoustic Units for Spontaneous Speech Recognition
Hejtmanek, Jan
[J]. TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 299 - 305
[10] Speech Recognition with Syllables and Concepts
De Palma, Paul
Wooters, Charles
[J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 5 - 10

← 1 2 3 4 5 →