A SOM based 2500 - Isolated - Farsi - Word speech recognizer

被引:0
|
作者
Shirazi, J [1 ]
Menhaj, MB
机构
[1] Gonabad Azad Univ, Dept Elect Engn, Gonabad, Iran
[2] Amir Kabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A modified 2-D Kohonen Self-Organizing (MSOM) neural network is used for recognizing Farsi isolated words. The network dimension is 10*15 cells with a hexagonal topology and it is trained using 300 Farsi words. As input vectors for learning, speech spectrum and energy of signal are used. The weight vectors of the cells are then fine tuned using supervised learning vector quantization 3 (LVQ3) technique. The cells are labeled to 28 out of 29 Farsi phonemes. At the word recognition stage, the quasi phonemes are obtained. Then the phonemes are determined. Using the phonetic rules of Farsi words and the connection rules of Farsi characters, the recognized word will appear on the monitor. To remedy the errors, a 2500 word dictionary is used. The determined sequence of phonemes is given to the dictionary, and the closest word to the sequence is shown on the monitor. The proposed recognizer is able to recognize all vowels with the accuracy of 100 percent, and it also recognize correctly 55 isolated words among 100 words.
引用
收藏
页码:589 / 595
页数:7
相关论文
共 50 条
  • [1] Appropriate Farsi speech recognizer for commanding robots (Performance evaluation of correlation-based and model-based classifiers for a Farsi isolated word recognition robotic system)
    Rashedi, Ashkan
    Shirvani Moghaddam, Shahriar
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 573 - +
  • [2] Remote Spoken Document Retrieval using Foreground Speech Segmentation based Isolated Word Recognizer
    Deepak, K. T.
    Prasanna, S. R. Mahadeva
    2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [3] A 20000-WORD SPEECH RECOGNIZER OF ITALIAN
    BRANDETTI, M
    FERRETTI, M
    FUSI, A
    MALTESE, G
    SCARCI, S
    VITILLARO, G
    LECTURE NOTES IN COMPUTER SCIENCE, 1989, 399 : 391 - 400
  • [4] A 20000-WORD SPEECH RECOGNIZER OF ITALIAN
    BRANDETTI, M
    FERRETTI, M
    FUSI, A
    MALTESE, G
    SCARCI, S
    VITILLARO, G
    RECENT ISSUES IN PATTERN ANALYSIS AND RECOGNITION, 1989, 399 : 391 - 400
  • [5] FPGA Implementation of Speech Recognizer for Isolated Words
    Nithya, K.
    Gadamsetty, Muralidhar
    Kailath, Binsu J.
    2019 IEEE INTERNATIONAL SYMPOSIUM ON SMART ELECTRONIC SYSTEMS (ISES 2019), 2019, : 25 - 28
  • [6] Evaluating unsupervised data in isolated speech recognizer
    Seman, Noraini
    Salleh, Siti Salwa
    Hussin, Naimah Mohd
    2008 INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING, VOLS 1-3, 2008, : 439 - 444
  • [7] ISOLATED-WORD RECOGNIZER BASED ON GRAMMAR-CONTROLLED CLASSIFICATION PROCESSES
    RIVOIRA, S
    TORASSO, P
    PATTERN RECOGNITION, 1978, 10 (02) : 73 - 84
  • [8] ON THE EFFECTS OF VARYING ANALYSIS PARAMETERS ON AN LPC-BASED ISOLATED WORD RECOGNIZER
    RABINER, LR
    WILPON, JG
    ACKENHUSEN, JG
    BELL SYSTEM TECHNICAL JOURNAL, 1981, 60 (06): : 893 - 911
  • [9] Phrase-based translation of speech recognizer word lattices using loglinear model combination
    Matusov, E
    Ney, H
    Schlüter, R
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 110 - 115
  • [10] Real-time isolated word recognizer for telephone input
    Yato, F.
    Kuroiwa, S.
    Takeda, K.
    Yamamoto, S.
    Owa, K.
    Shozakai, M.
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1994, 15 (02):