Speech Recognition Using Principal Components Analysis and Neural Networks

被引:0
|
作者
Shabani, Shaham [1 ]
Norouzi, Yaser [2 ]
机构
[1] Univ Bologna, DEI Dept, Bologna, Italy
[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
component; speech recognition; feature extraction; principal components analysis (PCA); Mel frequency cepstral coefficient (MFCC); neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we intend to introduce a new approach to recognize discrete speeches, specifically pre-assumed words. Our approach is mainly based on Principal Components Analysis (PCA) and Neural Networks (NN). To do so, initially we build a data base which is provided by 20 speakers who uttered each predefined word 5 times and overall 10 Persian words. Then we apply Voice Activity Detection (VAD) and eliminate the useless portions of each frame and then by computing Mel Frequency Cepstral Coefficients (MFCCs), which are our useful features in the recognition process, and then applying PCA to reduce the size of our data set, we will successfully provide the inputs of the NN block. Using PCA will enable us to provide inputs with lower size to our recognition system which is an important feature of our approach by speeding up the training procedure while keeping the accuracy as high as possible. In another words, PCA will decrease the amount of computations we have to deal with usually in most recognition systems. We use 90% of our data set to train our algorithm and the remained 10% to test our algorithm and measure the accuracy of recognition process.
引用
收藏
页码:90 / 95
页数:6
相关论文
共 50 条
  • [41] EXPERIMENTS IN DYSARTHRIC SPEECH RECOGNITION USING ARTIFICIAL NEURAL NETWORKS
    JAYARAM, G
    ABDELHAMIED, K
    JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1995, 32 (02): : 162 - 169
  • [42] A comprehensive survey on automatic speech recognition using neural networks
    Amandeep Singh Dhanjal
    Williamjeet Singh
    Multimedia Tools and Applications, 2024, 83 : 23367 - 23412
  • [43] SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS
    Muppidi, Aneesh
    Radfar, Martin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6309 - 6313
  • [44] Implementation of Tamil speech recognition system using neural networks
    Saraswathi, S
    Geetha, TV
    APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 169 - 176
  • [45] Speech Emotion Recognition using MFCC and Hybrid Neural Networks
    Badr, Youakim
    Mukherjee, Partha
    Thumati, Sindhu
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 366 - 373
  • [46] Speech recognition using cluster monitoring scheme and neural networks
    Yadav, Munshi
    Singh, Amit Prakash
    Singh, Tanya
    3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 2, 2006, : 21 - +
  • [47] Speech Recognition System Based On Phonemes Using Neural Networks
    Maheswari, N. Uma
    Kabilan, A. P.
    Venkatesh, R.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (07): : 148 - 153
  • [48] A comprehensive survey on automatic speech recognition using neural networks
    Dhanjal, Amandeep Singh
    Singh, Williamjeet
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (8) : 23367 - 23412
  • [49] Yoruba Gender Recognition from Speech Using Neural Networks
    Sefara, Tshephisho Joseph
    Modupe, Abiodun
    2019 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2019), 2019, : 50 - 55
  • [50] An Effective Speech Emotion Recognition Using Artificial Neural Networks
    Anoop, V.
    Rao, P. V.
    Aruna, S.
    INTERNATIONAL PROCEEDINGS ON ADVANCES IN SOFT COMPUTING, INTELLIGENT SYSTEMS AND APPLICATIONS, ASISA 2016, 2018, 628 : 393 - 401