A SOM based 2500 - Isolated - Farsi - Word speech recognizer

被引：0

作者：

Shirazi, J ^{[1
]}

Menhaj, MB

机构：

[1] Gonabad Azad Univ, Dept Elect Engn, Gonabad, Iran

[2] Amir Kabir Univ Technol, Dept Elect Engn, Tehran, Iran

来源：

ARTIFICIAL NEURAL NETWORKS: BIOLOGICAL INSPIRATIONS - ICANN 2005, PT 1, PROCEEDINGS | 2005年 / 3696卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A modified 2-D Kohonen Self-Organizing (MSOM) neural network is used for recognizing Farsi isolated words. The network dimension is 10*15 cells with a hexagonal topology and it is trained using 300 Farsi words. As input vectors for learning, speech spectrum and energy of signal are used. The weight vectors of the cells are then fine tuned using supervised learning vector quantization 3 (LVQ3) technique. The cells are labeled to 28 out of 29 Farsi phonemes. At the word recognition stage, the quasi phonemes are obtained. Then the phonemes are determined. Using the phonetic rules of Farsi words and the connection rules of Farsi characters, the recognized word will appear on the monitor. To remedy the errors, a 2500 word dictionary is used. The determined sequence of phonemes is given to the dictionary, and the closest word to the sequence is shown on the monitor. The proposed recognizer is able to recognize all vowels with the accuracy of 100 percent, and it also recognize correctly 55 isolated words among 100 words.

引用

页码：589 / 595

页数：7

共 50 条

[1] Appropriate Farsi speech recognizer for commanding robots (Performance evaluation of correlation-based and model-based classifiers for a Farsi isolated word recognition robotic system)
Rashedi, Ashkan
Shirvani Moghaddam, Shahriar
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 573 - +
[2] Remote Spoken Document Retrieval using Foreground Speech Segmentation based Isolated Word Recognizer
Deepak, K. T.
Prasanna, S. R. Mahadeva
2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
[3] A 20000-WORD SPEECH RECOGNIZER OF ITALIAN
BRANDETTI, M
FERRETTI, M
FUSI, A
MALTESE, G
SCARCI, S
VITILLARO, G
LECTURE NOTES IN COMPUTER SCIENCE, 1989, 399 : 391 - 400
[4] A 20000-WORD SPEECH RECOGNIZER OF ITALIAN
BRANDETTI, M
FERRETTI, M
FUSI, A
MALTESE, G
SCARCI, S
VITILLARO, G
RECENT ISSUES IN PATTERN ANALYSIS AND RECOGNITION, 1989, 399 : 391 - 400
[5] FPGA Implementation of Speech Recognizer for Isolated Words
Nithya, K.
Gadamsetty, Muralidhar
Kailath, Binsu J.
2019 IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2019), 2019, : 25 - 28
[6] Evaluating unsupervised data in isolated speech recognizer
Seman, Noraini
Salleh, Siti Salwa
Hussin, Naimah Mohd
2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 439 - 444
[7] ISOLATED-WORD RECOGNIZER BASED ON GRAMMAR-CONTROLLED CLASSIFICATION PROCESSES
RIVOIRA, S
TORASSO, P
PATTERN RECOGNITION, 1978, 10 (02) : 73 - 84
[8] ON THE EFFECTS OF VARYING ANALYSIS PARAMETERS ON AN LPC-BASED ISOLATED WORD RECOGNIZER
RABINER, LR
WILPON, JG
ACKENHUSEN, JG
BELL SYSTEM TECHNICAL JOURNAL, 1981, 60 (06): : 893 - 911
[9] Phrase-based translation of speech recognizer word lattices using loglinear model combination
Matusov, E
Ney, H
Schlüter, R
2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 110 - 115
[10] Real-time isolated word recognizer for telephone input
Yato, F.
Kuroiwa, S.
Takeda, K.
Yamamoto, S.
Owa, K.
Shozakai, M.
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1994, 15 (02):

← 1 2 3 4 5 →