Long short-term memory recurrent neural network architectures for Urdu acoustic modeling

被引:1
|
作者
Tehseen Zia
Usman Zahid
机构
[1] COMSATS University Islamabad,
关键词
Recurrent neural networks; Long short-term memory; Acoustic modeling; Speech recognition; Urdu;
D O I
暂无
中图分类号
学科分类号
摘要
Recurrent neural networks (RNNs) have achieved remarkable improvements in acoustic modeling recently. However, the potential of RNNs have not been utilized for modeling Urdu acoustics. The connectionist temporal classification and attention based RNNs are suffered due to the unavailability of lexicon and computational cost of training, respectively. Therefore, we explored contemporary long short-term memory and gated recurrent neural networks Urdu acoustic modeling. The efficacies of plain, deep, bidirectional and deep-directional network architectures are evaluated empirically. Results indicate that deep-directional has an advantage over the other architectures. A word error rate of 20% was achieved on a hundred words dataset of twenty speakers. It shows 15% improvement over the baseline single-layer LSTMs. It has been observed that two-layer architectures can improve performance over single-layer, however the performance is degraded with further layers. LSTM architectures were compared with gated recurrent unit (GRU) based architectures and it was found that LSTM has an advantage over GRU.
引用
收藏
页码:21 / 30
页数:9
相关论文
共 50 条
  • [1] Long short-term memory recurrent neural network architectures for Urdu acoustic modeling
    Zia, Tehseen
    Zahid, Usman
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 21 - 30
  • [2] Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling
    Sak, Hasim
    Senior, Andrew
    Beaufays, Francoise
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 338 - 342
  • [3] Long Short-Term Memory Recurrent Neural Network Architectures for Melody Generation
    Mishra, Abhinav
    Tripathi, Kshitij
    Gupta, Lakshay
    Singh, Krishna Pratap
    [J]. SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 41 - 55
  • [4] Long short-term memory recurrent neural network for pharmacokinetic-pharmacodynamic modeling
    Liu, Xiangyu
    Liu, Chao
    Huang, Ruihao
    Zhu, Hao
    Liu, Qi
    Mitra, Sunanda
    Wang, Yaning
    [J]. INTERNATIONAL JOURNAL OF CLINICAL PHARMACOLOGY AND THERAPEUTICS, 2021, 59 (02) : 138 - 146
  • [5] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [6] APPLICATION OF LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK IN POPULATION PHARMACOKINETIC MODELING.
    Davydov, S.
    Tan, W.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2022, 111 : S18 - S18
  • [8] Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) Network
    Sherstinsky, Alex
    [J]. arXiv, 2018,
  • [9] On extended long short-term memory and dependent bidirectional recurrent neural network
    Su, Yuanhang
    Kuo, C-C Jay
    [J]. NEUROCOMPUTING, 2019, 356 : 151 - 161
  • [10] Applying Long Short-Term Memory Recurrent Neural Network for Intrusion Detection
    Althubiti, Sara
    Nick, William
    Mason, Janelle
    Yuan, Xiaohong
    Esterline, Albert
    [J]. IEEE SOUTHEASTCON 2018, 2018,