Persian Phoneme Recognition using Long Short-Term Memory Neural Network

被引:0
|
作者
Daneshvar, Mohammad [1 ]
Veisi, Hadi [1 ]
机构
[1] Univ Tehran, Fac New Sci & Technol, Tehran, Iran
关键词
component; Long Short-Term Memory (LSTM); Farsdat; Persian Speech Recognition; Connectionist Temporal Classification (CTC);
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently Recurrent Neural Networks (RNNs) have shown impressive performance in sequence classification tasks. In this paper we apply Long Short-Term Memory (LSTM) network on Persian phoneme recognition. For years Hidden Markov Model (HMM) was the dominant technique in speech recognition system but after introducing LSTM, RNNs outperformed HHM-based methods. We apply LSTM and deep LSTM on FARSDAT speech database and find that both LSTM and deep LSTM outperforms HMM in Persian phoneme recognition. Our evaluation show that deep LSTM achieves 17.55% error in FARSDAT phoneme recognition on test set which to our knowledge is the best recorded result.
引用
收藏
页码:111 / 115
页数:5
相关论文
共 50 条
  • [1] A Persian part of speech tagging system using the long short-term memory neural network
    Koochari, Abbas
    Gharahbagh, Abdorreza Alavi
    Hajihashemi, Vahid
    [J]. 2020 6TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2020,
  • [2] Recognition of Spontaneous Conversational Speech using Long Short-Term Memory Phoneme Predictions
    Woellmer, Martin
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1946 - 1949
  • [3] Persian Optical Character Recognition Using Deep Bidirectional Long Short-Term Memory
    Khosrobeigi, Zohreh
    Veisi, Hadi
    Hoseinzade, Ehsan
    Shabanian, Hanieh
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [4] Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Oruh, Jane
    Viriri, Serestina
    Adegun, Adekanmi
    [J]. IEEE ACCESS, 2022, 10 : 30069 - 30079
  • [5] HUMAN ACTIVITY RECOGNITION USING LONG SHORT-TERM MEMORY NETWORK
    Warunsin, Kulwarun
    Promjiraprawat, Kamphol
    Chitsobhuk, Orachat
    [J]. INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2023, 19 (03): : 973 - 990
  • [6] GRAPHEME-TO-PHONEME CONVERSION USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS
    Rao, Kanishka
    Peng, Fuchun
    Sak, Hasim
    Beaufays, Francoise
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4225 - 4229
  • [7] Vowel speech recognition from rat electroencephalography using long short-term memory neural network
    Ham, Jinsil
    Yoo, Hyun-Joon
    Kim, Jongin
    Lee, Boreom
    [J]. PLOS ONE, 2022, 17 (06):
  • [8] Music Emotion Recognition Based on Long Short-Term Memory and Forward Neural Network
    Liu, Aizhen
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (04):
  • [9] Long short-term memory and convolutional neural network for abnormal driving behaviour recognition
    Jia, Shuo
    Hui, Fei
    Li, Shining
    Zhao, Xiangmo
    Khattak, Asad J.
    [J]. IET INTELLIGENT TRANSPORT SYSTEMS, 2020, 14 (05) : 306 - 312
  • [10] A Convolution Bidirectional Long Short-Term Memory Neural Network for Driver Emotion Recognition
    Du, Guanglong
    Wang, Zhiyao
    Gao, Boyu
    Mumtaz, Shahid
    Abualnaja, Khamael M.
    Du, Cuifeng
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (07) : 4570 - 4578