Speech Recognition Using Principal Components Analysis and Neural Networks

被引：0

作者：

Shabani, Shaham ^{[1
]}

Norouzi, Yaser ^{[2
]}

机构：

[1] Univ Bologna, DEI Dept, Bologna, Italy

[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran

来源：

2016 IEEE 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS (IS) | 2016年

关键词：

component; speech recognition; feature extraction; principal components analysis (PCA); Mel frequency cepstral coefficient (MFCC); neural network;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we intend to introduce a new approach to recognize discrete speeches, specifically pre-assumed words. Our approach is mainly based on Principal Components Analysis (PCA) and Neural Networks (NN). To do so, initially we build a data base which is provided by 20 speakers who uttered each predefined word 5 times and overall 10 Persian words. Then we apply Voice Activity Detection (VAD) and eliminate the useless portions of each frame and then by computing Mel Frequency Cepstral Coefficients (MFCCs), which are our useful features in the recognition process, and then applying PCA to reduce the size of our data set, we will successfully provide the inputs of the NN block. Using PCA will enable us to provide inputs with lower size to our recognition system which is an important feature of our approach by speeding up the training procedure while keeping the accuracy as high as possible. In another words, PCA will decrease the amount of computations we have to deal with usually in most recognition systems. We use 90% of our data set to train our algorithm and the remained 10% to test our algorithm and measure the accuracy of recognition process.

引用

页码：90 / 95

页数：6

共 50 条

[41] EXPERIMENTS IN DYSARTHRIC SPEECH RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
JAYARAM, G
ABDELHAMIED, K
JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1995, 32 (02): : 162 - 169
[42] A comprehensive survey on automatic speech recognition using neural networks
Amandeep Singh Dhanjal
Williamjeet Singh
Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
[43] SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS
Muppidi, Aneesh
Radfar, Martin
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6309 - 6313
[44] Implementation of Tamil speech recognition system using neural networks
Saraswathi, S
Geetha, TV
APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 169 - 176
[45] Speech Emotion Recognition using MFCC and Hybrid Neural Networks
Badr, Youakim
Mukherjee, Partha
Thumati, Sindhu
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 366 - 373
[46] Speech recognition using cluster monitoring scheme and neural networks
Yadav, Munshi
Singh, Amit Prakash
Singh, Tanya
3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 2, 2006, : 21 - +
[47] Speech Recognition System Based On Phonemes Using Neural Networks
Maheswari, N. Uma
Kabilan, A. P.
Venkatesh, R.
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (07): : 148 - 153
[48] A comprehensive survey on automatic speech recognition using neural networks
Dhanjal, Amandeep Singh
Singh, Williamjeet
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23367 - 23412
[49] Yoruba Gender Recognition from Speech Using Neural Networks
Sefara, Tshephisho Joseph
Modupe, Abiodun
2019 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2019), 2019, : 50 - 55
[50] An Effective Speech Emotion Recognition Using Artificial Neural Networks
Anoop, V.
Rao, P. V.
Aruna, S.
INTERNATIONAL PROCEEDINGS ON ADVANCES IN SOFT COMPUTING, INTELLIGENT SYSTEMS AND APPLICATIONS, ASISA 2016, 2018, 628 : 393 - 401

← 1 2 3 4 5 →