Recurrent neural network with backpropagation through time for speech recognition

被引：0

作者：

Ahmad, AM ^{[1
]}

Ismail, S ^{[1
]}

Samaon, DF ^{[1
]}

机构：

[1] Univ Teknol Malaysia, George Town, Malaysia

来源：

IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The study on speech recognition and understanding has been done for many years. In this paper, we propose a fully-connected hidden layer between the input and state nodes and the output. Besides that, we also investigate and show that this hidden layer makes the learning of complex classification tasks more efficient. We also investigate difference between LPCC and MFCC in feature extraction process. The aim of the study was to observe the difference of Arabic's alphabet like "alif" until "ya". The purpose of this research is to upgrade the people's knowledge and understanding on Arabic's alphabet or word by using Fully-Connected Recurrent Neural Network (FCRNN) and Backpropagation through Time (BPTT) learning algorithm. 6 speakers (a mixture of male and female) are trained in quiet environment. Neural Network is well-known as a technique that has the ability to classified nonlinear problem. Today, lots of researches have been done in applying Neural Network towards the solution of speech recognition [1] such as Arabic. The Arabic language offers a number of challenges for speech recognition [2]. Even though positive results have been obtained from the continuous study, research on minimizing the error rate is still gaining lots of attention. This research utilizes Recurrent Neural Network, one of Neural Network technique to observe the difference of alphabet "alif" until "ya".

引用

页码：98 / 102

页数：5

共 50 条

[1] Recurrent neural network with backpropagation through time algorithm for arabic recognition
Ismail, S
bin Ahmad, AM
[J]. ESM'2004: 18TH EUROPEAN SIMULATION MULTICONFERENCE: NETWORKED SIMULATIONS AND SIMULATED NETWORKS, 2004, : 29 - 33
[2] Hardware Architecture of Emotion Recognition from Speech Features using Recurrent Neural Network and Backpropagation Through Time
Gunawan, Joshua
Putri, Teresia R. S.
Arthanto, Yashael F.
Adiono, Trio
[J]. 2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
[3] ON TRAINING RECURRENT NETWORKS WITH TRUNCATED BACKPROPAGATION THROUGH TIME IN SPEECH RECOGNITION
Tang, Hao
Glass, James
[J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 48 - 55
[4] Time Delay Recurrent Neural Network for Speech Recognition
Liu, Boji
Zhang, Weibin
Xu, Xiangming
Chen, Dongpeng
[J]. 2019 3RD INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2019), 2019, 1229
[5] Suitable Recurrent Neural Network for Air Quality Prediction With Backpropagation Through Time
Septiawan, Widya Mas
Endah, Sukmawati Nur
[J]. 2018 2ND INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2018, : 196 - 201
[6] Backpropagation through time for a general class of recurrent network
De Jesús, O
Hagan, MT
[J]. IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2638 - 2643
[7] Stochastic Recurrent Neural Network for Speech Recognition
Chien, Jen-Tzung
Shen, Chen
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1313 - 1317
[8] Remarks on System Identification Using a Quaternion Recurrent Neural Network Trained by Backpropagation through Time
Takahashi, Kazuhiko
Shibata, Sora
Hashimoto, Masafumi
[J]. 2021 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE (ANZCC), 2021, : 122 - 125
[9] DEEP RECURRENT REGULARIZATION NEURAL NETWORK FOR SPEECH RECOGNITION
Chien, Jen-Tzung
Lu, Tsai-Wei
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4560 - 4564
[10] Implementation of an autoassociative Recurrent Neural Network for speech recognition
Cocchiglia, A
Paplinski, A
[J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 245 - 248

← 1 2 3 4 5 →