Long short-term memory recurrent neural network architectures for Urdu acoustic modeling

被引:91
|
作者
Zia, Tehseen [1 ]
Zahid, Usman [1 ]
机构
[1] COMSATS Univ Islamabad, Islamabad, Pakistan
关键词
Recurrent neural networks; Long short-term memory; Acoustic modeling; Speech recognition; Urdu;
D O I
10.1007/s10772-018-09573-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recurrent neural networks (RNNs) have achieved remarkable improvements in acoustic modeling recently. However, the potential of RNNs have not been utilized for modeling Urdu acoustics. The connectionist temporal classification and attention based RNNs are suffered due to the unavailability of lexicon and computational cost of training, respectively. Therefore, we explored contemporary long short-term memory and gated recurrent neural networks Urdu acoustic modeling. The efficacies of plain, deep, bidirectional and deep-directional network architectures are evaluated empirically. Results indicate that deep-directional has an advantage over the other architectures. A word error rate of 20% was achieved on a hundred words dataset of twenty speakers. It shows 15% improvement over the baseline single-layer LSTMs. It has been observed that two-layer architectures can improve performance over single-layer, however the performance is degraded with further layers. LSTM architectures were compared with gated recurrent unit (GRU) based architectures and it was found that LSTM has an advantage over GRU.
引用
收藏
页码:21 / 30
页数:10
相关论文
共 50 条
  • [31] Misfire Detection Using Crank Speed and Long Short-Term Memory Recurrent Neural Network
    Wang, Xinwei
    Zhang, Pan
    Gao, Wenzhi
    Li, Yong
    Wang, Yanjun
    Pang, Haoqian
    [J]. ENERGIES, 2022, 15 (01)
  • [32] Long Short-Term Memory Recurrent Neural Network (LSTM-RNN) Power Forecasting
    Alsabban, Maha S.
    Salem, Nema
    Malik, Hebatullah M.
    [J]. APPEEC 2021: 2021 13TH IEEE PES ASIA PACIFIC POWER & ENERGY ENGINEERING CONFERENCE (APPEEC), 2021,
  • [33] The chaotic nature of temper in humans: A long short-term memory recurrent neural network model
    Zifan, Ali
    Gharibzadeh, Shahriar
    [J]. MEDICAL HYPOTHESES, 2006, 67 (03) : 658 - 661
  • [34] Performance prediction of fuel cells using long short-term memory recurrent neural network
    Zheng, Lu
    Hou, Yongping
    Zhang, Tao
    Pan, Xiangmin
    [J]. INTERNATIONAL JOURNAL OF ENERGY RESEARCH, 2021, 45 (06) : 9141 - 9161
  • [35] Convolutional Grid Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Xue, Jiabin
    Zheng, Tieran
    Han, Jiqing
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 718 - 726
  • [36] Forecasting of FOREX Price Trend Using Recurrent Neural Network - Long Short-term Memory
    Dobrovolny, Michal
    Soukal, Ivan
    Lim, Kok Cheng
    Selamat, Ali
    Krejcar, Ondrej
    [J]. HRADEC ECONOMIC DAYS 2020, VOL 10, PT 1, 2020, 10 : 95 - 103
  • [37] Terahertz Spectral Recognition Based on Bidirectional Long Short-Term Memory Recurrent Neural Network
    Yu Hao-yue
    Shen Tao
    Zhu Yan
    Liu Ying-li
    Yu Zheng-tao
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39 (12) : 3737 - 3742
  • [38] A novel recurrent neural network algorithm with long short-term memory model for futures trading
    Gu, Quan
    Lu, Na
    Liu, Lin
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 4477 - 4484
  • [39] APPLICATION OF RECURRENT NEURAL NETWORK LONG SHORT-TERM MEMORY MODEL ON EARLY KICK DETECTION
    Wang, Junzhe
    Ozbayoglu, Evren M.
    [J]. PROCEEDINGS OF ASME 2022 41ST INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE & ARCTIC ENGINEERING, OMAE2022, VOL 10, 2022,
  • [40] An FPGA Implementation of a Long Short-Term Memory Neural Network
    Ferreira, Joao Canas
    Fonseca, Jose
    [J]. 2016 INTERNATIONAL CONFERENCE ON RECONFIGURABLE COMPUTING AND FPGAS (RECONFIG16), 2016,