A SOM based 2500 - Isolated - Farsi - Word speech recognizer

被引:0
|
作者
Shirazi, J [1 ]
Menhaj, MB
机构
[1] Gonabad Azad Univ, Dept Elect Engn, Gonabad, Iran
[2] Amir Kabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A modified 2-D Kohonen Self-Organizing (MSOM) neural network is used for recognizing Farsi isolated words. The network dimension is 10*15 cells with a hexagonal topology and it is trained using 300 Farsi words. As input vectors for learning, speech spectrum and energy of signal are used. The weight vectors of the cells are then fine tuned using supervised learning vector quantization 3 (LVQ3) technique. The cells are labeled to 28 out of 29 Farsi phonemes. At the word recognition stage, the quasi phonemes are obtained. Then the phonemes are determined. Using the phonetic rules of Farsi words and the connection rules of Farsi characters, the recognized word will appear on the monitor. To remedy the errors, a 2500 word dictionary is used. The determined sequence of phonemes is given to the dictionary, and the closest word to the sequence is shown on the monitor. The proposed recognizer is able to recognize all vowels with the accuracy of 100 percent, and it also recognize correctly 55 isolated words among 100 words.
引用
收藏
页码:589 / 595
页数:7
相关论文
共 50 条
  • [41] Applications of Virtual-Evidence based Speech Recognizer Training
    Subramanya, Amarnag
    Bilmes, Jeff
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2562 - 2565
  • [42] Development of isolated word speech recognition system
    Lipeika, A
    Lipeikiene, J
    Telksnys, L
    INFORMATICA, 2002, 13 (01) : 37 - 46
  • [43] A Speech Recognizer Based Intelligent Agent For Ambient Intelligent Environments
    El-Faham, Ayman
    Hagras, Hani
    INTELLIGENT ENVIRONMENTS 2009, 2009, 2 : 247 - 256
  • [44] ISOLATED WORD SPEECH RECOGNITION USING A NEURAL NETWORK BASED SOURCE MODEL
    LEE, GE
    TATTERSALL, GD
    SMYTH, SG
    BT TECHNOLOGY JOURNAL, 1992, 10 (03): : 38 - 47
  • [45] A nonuniform segmentation approach based on Kohonen network for isolated word speech recognition
    Figueiredo, FL
    Violaro, F
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 130 - 134
  • [46] "A speech recognizer" a tool to recognize the high clarity speech signal based on existing speech using ISCA
    Velammal, M. Navaneetha
    Kumar, P. Nirmal
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2019, 98 (01) : 41 - 58
  • [47] “A speech recognizer” a tool to recognize the high clarity speech signal based on existing speech using ISCA
    M. Navaneetha Velammal
    P. Nirmal Kumar
    Analog Integrated Circuits and Signal Processing, 2019, 98 : 41 - 58
  • [48] An Alternative sEMG based Isolated Word Subvocal Speech Recognition System based on Interpolation Functions
    Yang, Meng
    Zhang, Ming
    2020 INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2020), 2020, : 306 - 309
  • [49] Intelligibility Assessment and Speech Recognizer Word Accuracy Rate Prediction for Dysarthric Speakers in a Factor Analysis Subspace
    Martinez, David
    Lleida, Eduardo
    Green, Phil
    Christensen, Heidi
    Ortega, Alfonso
    Miguel, Antonio
    ACM TRANSACTIONS ON ACCESSIBLE COMPUTING, 2015, 6 (03) : 1 - 21
  • [50] A ROBUST SPEAKER-INDEPENDENT ISOLATED WORD HMM RECOGNIZER FOR OPERATION OVER THE TELEPHONE NETWORK
    SONG, JM
    SAMOUELIAN, A
    SPEECH COMMUNICATION, 1993, 13 (3-4) : 287 - 295