A SOM based 2500 - Isolated - Farsi - Word speech recognizer

被引:0
|
作者
Shirazi, J [1 ]
Menhaj, MB
机构
[1] Gonabad Azad Univ, Dept Elect Engn, Gonabad, Iran
[2] Amir Kabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A modified 2-D Kohonen Self-Organizing (MSOM) neural network is used for recognizing Farsi isolated words. The network dimension is 10*15 cells with a hexagonal topology and it is trained using 300 Farsi words. As input vectors for learning, speech spectrum and energy of signal are used. The weight vectors of the cells are then fine tuned using supervised learning vector quantization 3 (LVQ3) technique. The cells are labeled to 28 out of 29 Farsi phonemes. At the word recognition stage, the quasi phonemes are obtained. Then the phonemes are determined. Using the phonetic rules of Farsi words and the connection rules of Farsi characters, the recognized word will appear on the monitor. To remedy the errors, a 2500 word dictionary is used. The determined sequence of phonemes is given to the dictionary, and the closest word to the sequence is shown on the monitor. The proposed recognizer is able to recognize all vowels with the accuracy of 100 percent, and it also recognize correctly 55 isolated words among 100 words.
引用
收藏
页码:589 / 595
页数:7
相关论文
共 50 条
  • [31] THE EFFECTS OF SELECTED SIGNAL-PROCESSING TECHNIQUES ON THE PERFORMANCE OF A FILTER-BANK-BASED ISOLATED WORD RECOGNIZER
    DAUTRICH, BA
    RABINER, LR
    MARTIN, TB
    BELL SYSTEM TECHNICAL JOURNAL, 1983, 62 (05): : 1311 - 1336
  • [32] Isolated Word Speech Rcogniton Based On HRSF and Improved DTW Algorithm
    Hu Xiao-hui
    Zhao Gansen
    Zhan Lv-jun
    Xue Yun
    Zhou Weixing
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 270 - 273
  • [33] DESIGN OF A MICROPROCESSOR BASED ISOLATED WORD SPEECH RECOGNITION SYSTEM.
    Sullivan, John M.
    Lee, Arnold Y.
    Journal Water Pollution Control Federation, 1980, : 40 - 44
  • [34] Research on Isolated Word Speech Recognition Based on Biomimetic Pattern Recognition
    Lu, Bin
    Xu, Jing-jing
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL II, PROCEEDINGS, 2009, : 436 - 439
  • [35] An isolated word speech recognition system based on kohonen neural network
    Figueiredo, FL
    Violaro, F
    VTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, PROCEEDINGS, 1998, : 151 - 156
  • [36] A model distance maximizing framework for speech recognizer-based speech enhancement
    BabaAli, Bagher
    Sameti, Hossein
    Falk, Tiago H.
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2011, 65 (02) : 99 - 106
  • [37] A Real-Time FPGA-Based 20 000-Word Speech Recognizer With Optimized DRAM Access
    Choi, Young-Kyu
    You, Kisun
    Choi, Jungwook
    Sung, Wonyong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2010, 57 (08) : 2119 - 2131
  • [38] A FARSI PART-OF-SPEECH TAGGER BASED on MARKOV MODEL
    Mohseni, Mahdi
    Motalebi, Hasan
    Minaei-bidgoli, Behrouz
    Shokrollahi-far, Mahmoud
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1588 - +
  • [39] EFFECTS OF SPEAKER ACCENT ON THE PERFORMANCE OF SPEAKER-INDEPENDENT, ISOLATED-WORD RECOGNIZER
    GUPTA, V
    MERMELSTEIN, P
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1982, 71 (06): : 1581 - 1587
  • [40] EPITOME - GENERATION OF SURGICAL PATHOLOGY REPORTS USING A 5,000-WORD SPEECH RECOGNIZER
    TISCHLER, AS
    MARTIN, MR
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 1989, 92 (04) : S44 - S47