Long short-term memory recurrent neural network architectures for Urdu acoustic modeling

被引:91
|
作者
Zia, Tehseen [1 ]
Zahid, Usman [1 ]
机构
[1] COMSATS Univ Islamabad, Islamabad, Pakistan
关键词
Recurrent neural networks; Long short-term memory; Acoustic modeling; Speech recognition; Urdu;
D O I
10.1007/s10772-018-09573-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recurrent neural networks (RNNs) have achieved remarkable improvements in acoustic modeling recently. However, the potential of RNNs have not been utilized for modeling Urdu acoustics. The connectionist temporal classification and attention based RNNs are suffered due to the unavailability of lexicon and computational cost of training, respectively. Therefore, we explored contemporary long short-term memory and gated recurrent neural networks Urdu acoustic modeling. The efficacies of plain, deep, bidirectional and deep-directional network architectures are evaluated empirically. Results indicate that deep-directional has an advantage over the other architectures. A word error rate of 20% was achieved on a hundred words dataset of twenty speakers. It shows 15% improvement over the baseline single-layer LSTMs. It has been observed that two-layer architectures can improve performance over single-layer, however the performance is degraded with further layers. LSTM architectures were compared with gated recurrent unit (GRU) based architectures and it was found that LSTM has an advantage over GRU.
引用
收藏
页码:21 / 30
页数:10
相关论文
共 50 条
  • [1] Long short-term memory recurrent neural network architectures for Urdu acoustic modeling
    Tehseen Zia
    Usman Zahid
    [J]. International Journal of Speech Technology, 2019, 22 : 21 - 30
  • [2] Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling
    Sak, Hasim
    Senior, Andrew
    Beaufays, Francoise
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 338 - 342
  • [3] Long Short-Term Memory Recurrent Neural Network Architectures for Melody Generation
    Mishra, Abhinav
    Tripathi, Kshitij
    Gupta, Lakshay
    Singh, Krishna Pratap
    [J]. SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 41 - 55
  • [4] Long short-term memory recurrent neural network for pharmacokinetic-pharmacodynamic modeling
    Liu, Xiangyu
    Liu, Chao
    Huang, Ruihao
    Zhu, Hao
    Liu, Qi
    Mitra, Sunanda
    Wang, Yaning
    [J]. INTERNATIONAL JOURNAL OF CLINICAL PHARMACOLOGY AND THERAPEUTICS, 2021, 59 (02) : 138 - 146
  • [5] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [6] APPLICATION OF LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK IN POPULATION PHARMACOKINETIC MODELING.
    Davydov, S.
    Tan, W.
    [J]. CLINICAL PHARMACOLOGY & THERAPEUTICS, 2022, 111 : S18 - S18
  • [8] On extended long short-term memory and dependent bidirectional recurrent neural network
    Su, Yuanhang
    Kuo, C-C Jay
    [J]. NEUROCOMPUTING, 2019, 356 : 151 - 161
  • [9] Stock Price Prediction With Long Short-Term Memory Recurrent Neural Network
    Jeenanunta, Chawalit
    Chaysiri, Rujira
    Thong, Laksmey
    [J]. 2018 INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS AND INTELLIGENT TECHNOLOGY & INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EMBEDDED SYSTEMS (ICESIT-ICICTES), 2018,
  • [10] Long Short-Term Memory Recurrent Neural Network for Tidal Level Forecasting
    Yang, Cheng-Hong
    Wu, Chih-Hsien
    Hsieh, Chih-Min
    [J]. IEEE ACCESS, 2020, 8 : 159389 - 159401