A SOM based 2500 - Isolated - Farsi - Word speech recognizer

被引:0
|
作者
Shirazi, J [1 ]
Menhaj, MB
机构
[1] Gonabad Azad Univ, Dept Elect Engn, Gonabad, Iran
[2] Amir Kabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A modified 2-D Kohonen Self-Organizing (MSOM) neural network is used for recognizing Farsi isolated words. The network dimension is 10*15 cells with a hexagonal topology and it is trained using 300 Farsi words. As input vectors for learning, speech spectrum and energy of signal are used. The weight vectors of the cells are then fine tuned using supervised learning vector quantization 3 (LVQ3) technique. The cells are labeled to 28 out of 29 Farsi phonemes. At the word recognition stage, the quasi phonemes are obtained. Then the phonemes are determined. Using the phonetic rules of Farsi words and the connection rules of Farsi characters, the recognized word will appear on the monitor. To remedy the errors, a 2500 word dictionary is used. The determined sequence of phonemes is given to the dictionary, and the closest word to the sequence is shown on the monitor. The proposed recognizer is able to recognize all vowels with the accuracy of 100 percent, and it also recognize correctly 55 isolated words among 100 words.
引用
收藏
页码:589 / 595
页数:7
相关论文
共 50 条
  • [21] EXPERIMENTING NATURAL-LANGUAGE DICTATION WITH A 20000-WORD SPEECH RECOGNIZER
    ALTO, P
    BRANDETTI, M
    FERRETTI, M
    MALTESE, G
    SCARCI, S
    VLSI AND COMPUTER PERIPHERALS: VLSI AND MICROELECTRONIC APPLICATIONS IN INTELLIGENT PERIPHERALS AND THEIR INTERCONNECTION NETWORKS, 1989, : B78 - B81
  • [22] Voice Controlled Urdu Interface using Isolated and Continuous Speech Recognizer
    Ali, Saira
    Iqbal, Sidra
    Saeed, Imran
    2012 15TH INTERNATIONAL MULTITOPIC CONFERENCE (INMIC), 2012, : 53 - 57
  • [23] Building HMM Independent Isolated Speech Recognizer System for Amazigh Language
    El Ouahabi, Safaa
    Atounti, Mohamed
    Bellouki, Mohamed
    EUROPE AND MENA COOPERATION ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGIES, 2017, 520 : 299 - 307
  • [24] FarsBayan: A Unit Selection based Farsi Speech Synthesizer
    Homayounpour, M. Mehdi
    Namnabat, Majid
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1336 - 1339
  • [25] Personalized Speech Recognizer with Keyword-based Personalized Lexicon and Language Model using Word Vector Representations
    Yeh, Ching-Feng
    Liou, Yuan-Ming
    Lee, Hung-Yi
    Lee, Lin-shan
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3521 - 3525
  • [26] BANGLA ISOLATED WORD SPEECH RECOGNITION
    Firoze, Adnan
    Arifin, M. Shamsul
    Quadir, Ryana
    Rahman, Rashedur M.
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 2, 2011, : 73 - 82
  • [27] Optimization of Dereverberation Parameters based on Likelihood of Speech Recognizer
    Gomez, Randy
    Kawahara, Tatsuya
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1259 - 1262
  • [28] Strategies for reducing the complexity of a RNN based speech recognizer
    Kasper, K
    Reininger, H
    Wust, H
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 3354 - 3357
  • [29] HMM speech recognizer based on discriminative metric design
    Watanabe, H
    Katagiri, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 3237 - 3240
  • [30] A programmable application-specific VLSI architecture and implementation for speech word-recognizer
    Suen, AN
    Wang, JF
    Wang, TD
    PROCEEDINGS OF THE ASP-DAC '97 - ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE 1997, 1996, : 71 - 75