Recurrent neural network with backpropagation through time for speech recognition

被引:0
|
作者
Ahmad, AM [1 ]
Ismail, S [1 ]
Samaon, DF [1 ]
机构
[1] Univ Teknol Malaysia, George Town, Malaysia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The study on speech recognition and understanding has been done for many years. In this paper, we propose a fully-connected hidden layer between the input and state nodes and the output. Besides that, we also investigate and show that this hidden layer makes the learning of complex classification tasks more efficient. We also investigate difference between LPCC and MFCC in feature extraction process. The aim of the study was to observe the difference of Arabic's alphabet like "alif" until "ya". The purpose of this research is to upgrade the people's knowledge and understanding on Arabic's alphabet or word by using Fully-Connected Recurrent Neural Network (FCRNN) and Backpropagation through Time (BPTT) learning algorithm. 6 speakers (a mixture of male and female) are trained in quiet environment. Neural Network is well-known as a technique that has the ability to classified nonlinear problem. Today, lots of researches have been done in applying Neural Network towards the solution of speech recognition [1] such as Arabic. The Arabic language offers a number of challenges for speech recognition [2]. Even though positive results have been obtained from the continuous study, research on minimizing the error rate is still gaining lots of attention. This research utilizes Recurrent Neural Network, one of Neural Network technique to observe the difference of alphabet "alif" until "ya".
引用
收藏
页码:98 / 102
页数:5
相关论文
共 50 条
  • [1] Recurrent neural network with backpropagation through time algorithm for arabic recognition
    Ismail, S
    bin Ahmad, AM
    [J]. ESM'2004: 18TH EUROPEAN SIMULATION MULTICONFERENCE: NETWORKED SIMULATIONS AND SIMULATED NETWORKS, 2004, : 29 - 33
  • [2] Hardware Architecture of Emotion Recognition from Speech Features using Recurrent Neural Network and Backpropagation Through Time
    Gunawan, Joshua
    Putri, Teresia R. S.
    Arthanto, Yashael F.
    Adiono, Trio
    [J]. 2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [3] ON TRAINING RECURRENT NETWORKS WITH TRUNCATED BACKPROPAGATION THROUGH TIME IN SPEECH RECOGNITION
    Tang, Hao
    Glass, James
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 48 - 55
  • [4] Time Delay Recurrent Neural Network for Speech Recognition
    Liu, Boji
    Zhang, Weibin
    Xu, Xiangming
    Chen, Dongpeng
    [J]. 2019 3RD INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2019), 2019, 1229
  • [5] Suitable Recurrent Neural Network for Air Quality Prediction With Backpropagation Through Time
    Septiawan, Widya Mas
    Endah, Sukmawati Nur
    [J]. 2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 196 - 201
  • [6] Backpropagation through time for a general class of recurrent network
    De Jesús, O
    Hagan, MT
    [J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2638 - 2643
  • [7] Stochastic Recurrent Neural Network for Speech Recognition
    Chien, Jen-Tzung
    Shen, Chen
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1313 - 1317
  • [8] Remarks on System Identification Using a Quaternion Recurrent Neural Network Trained by Backpropagation through Time
    Takahashi, Kazuhiko
    Shibata, Sora
    Hashimoto, Masafumi
    [J]. 2021 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2021, : 122 - 125
  • [9] DEEP RECURRENT REGULARIZATION NEURAL NETWORK FOR SPEECH RECOGNITION
    Chien, Jen-Tzung
    Lu, Tsai-Wei
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4560 - 4564
  • [10] Implementation of an autoassociative Recurrent Neural Network for speech recognition
    Cocchiglia, A
    Paplinski, A
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 245 - 248