Algorithms and Methods for the Automatic Speech Recognition in Spanish Language using Syllables

被引：0

作者：

Oropeza Rodriguez, Jose Luis ^{[1
]}

Suarez Guerra, Sergio ^{[1
]}

机构：

[1] IPN, Ctr Invest Comp, Av Juan de Dios Batiz S-N Esq, Mexico City 07738, DF, Mexico

来源：

COMPUTACION Y SISTEMAS | 2006年 / 9卷 / 03期

关键词：

Speech recognition; Syllables recognition; Expert System; Speech processing;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This work examines the results of incorporating into Automatic Speech Recognition the syllable units for the Spanish language. Because of the boundaries between phonemes-like units its often difficult to elicit them; the use of these has not reached a good performance in Automatic Speech Recognition. In the course of the developing the experiments three approaches for the segmentation task were examined: a) the using of the Short Term Total Energy Function, b) the Energy Function of the Cepstral High Frequency (named ERO parameter), and c) a Knowledge Based System. They represent the most important contributions of this work; they showed good results for the Continuous and Discontinuous speech corpus developed in laboratory. The Knowledge Based System and Short Term Total Energy Function were used in a digit corpus where the results achieved using Short Term Total Energy Function alone reached 90.58% recognition rate. When Short Term Total Energy Function and RO parameters were used a 94.70% recognition rate was achieved. Otherwise, in the continuous speech corpus created in the laboratory the results achieved a 78.5% recognition rate using Short Term Total Energy Function and Knowledge Based System, and 80.5% recognition rate using the three approaches mentioned above. The bigram model language and Continuous Density Hidden Markov Models with three and five states incorporating three Gaussian Mixtures for state were implemented. By further including a major number of digital filters and Artificial Intelligent techniques in the training and recognition stages respectively the results can be improved even more. This research showed the potential of the syllabic unit paradigm for the Automatic Speech Recognition for the Spanish language. Finally, the inference rules in the Knowledge Based System associated with rules for splitting words in syllables in the cited language were created.

引用

页码：270 / 286

页数：17

共 50 条

[41] MACHINE RECOGNITION OF HUMAN LANGUAGE .I. AUTOMATIC SPEECH RECOGNITION
LINDGREN, N
[J]. IEEE SPECTRUM, 1965, 2 (03) : 114 - +
[42] Using Automatic Speech Recognition to Assess Thai Speech Language Fluency in the Montreal Cognitive Assessment (MoCA)
Kantithammakorn, Pimarn
Punyabukkana, Proadpran
Pratanwanich, Ploy N.
Hemrungrojn, Solaphat
Chunharas, Chaipat
Wanvarie, Dittaya
[J]. SENSORS, 2022, 22 (04)
[43] Investigation of Using Different Chinese Word Segmentation Standards and Algorithms for Automatic Speech Recognition
Ni, Chongjia
Leung, Cheung-Chi
[J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 44 - 48
[44] SEGMENTATION OF SPEECH AND RECOGNITION OF SYLLABLES WITHIN WORDS
MERCIER, G
[J]. AUTOMATISME, 1972, 17 (03): : 69 - &
[45] Automatic speech recognition of Gujarati digits using wavelet coefficients in machine learning algorithms
Pandit P.
Bhatt S.
[J]. International Journal of Innovative Computing and Applications, 2023, 14 (04) : 191 - 200
[46] Evolution of the performance of automatic speech recognition algorithms in transcribing conversational telephone speech
Padmanabhan, M
Saon, G
Zweig, G
Huang, J
Kingsbury, B
Mangu, L
[J]. IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1926 - 1931
[47] Creating Language and Acoustic Models using Kaldi to Build An Automatic Speech Recognition System for Kannada Language
Yadava, Thimmaraja G.
Jayanna, H. S.
[J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 161 - 165
[48] Spanish Syllables Recognition by Wavelet and Cross Covariance
San Juan, Enrique
Firoozabadi, Ali Dehghan
Adasme, Pablo
Soto, Ismael
Canete, Lucio
[J]. 2019 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (CHILECON), 2019,
[49] Comparative Evaluation of Speech Enhancement Methods for Robust Automatic Speech Recognition
Paliwal, Kuldip K.
Lyons, James G.
So, Stephen
Stark, Anthony P.
Wojcicki, Kamil K.
[J]. 2010 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2010,
[50] SPEECH SEGMENTATION IN CATALAN AND SPANISH - THE ROLE OF SYLLABLES AND STRESS
SEBASTIAN, N
[J]. INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1992, 27 (3-4) : 57 - 57

← 1 2 3 4 5 →