Speech Recognition Using Principal Components Analysis and Neural Networks

被引:0
|
作者
Shabani, Shaham [1 ]
Norouzi, Yaser [2 ]
机构
[1] Univ Bologna, DEI Dept, Bologna, Italy
[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
component; speech recognition; feature extraction; principal components analysis (PCA); Mel frequency cepstral coefficient (MFCC); neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we intend to introduce a new approach to recognize discrete speeches, specifically pre-assumed words. Our approach is mainly based on Principal Components Analysis (PCA) and Neural Networks (NN). To do so, initially we build a data base which is provided by 20 speakers who uttered each predefined word 5 times and overall 10 Persian words. Then we apply Voice Activity Detection (VAD) and eliminate the useless portions of each frame and then by computing Mel Frequency Cepstral Coefficients (MFCCs), which are our useful features in the recognition process, and then applying PCA to reduce the size of our data set, we will successfully provide the inputs of the NN block. Using PCA will enable us to provide inputs with lower size to our recognition system which is an important feature of our approach by speeding up the training procedure while keeping the accuracy as high as possible. In another words, PCA will decrease the amount of computations we have to deal with usually in most recognition systems. We use 90% of our data set to train our algorithm and the remained 10% to test our algorithm and measure the accuracy of recognition process.
引用
收藏
页码:90 / 95
页数:6
相关论文
共 50 条
  • [31] PRINCIPAL COMPONENTS, MINOR COMPONENTS, AND LINEAR NEURAL NETWORKS
    OJA, E
    NEURAL NETWORKS, 1992, 5 (06) : 927 - 935
  • [32] Using Deep Principal Components Analysis-Based Neural Networks for Fabric Pilling Classification
    Yang, Chin-Shan
    Lin, Cheng-Jian
    Chen, Wen-Jong
    ELECTRONICS, 2019, 8 (05)
  • [33] Using Probabilistic Neural Networks with Wavelet Transform and Principal Components Analysis for Motor Fault Detection
    Karatoprak, Erinc
    Sengueler, Tayfun
    Seker, Serhat
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 356 - 359
  • [34] Simulation of an industrial wastewater treatment plant using artificial neural networks and principal components analysis
    Oliveira-Esquerre, KP
    Mori, M
    Bruns, RE
    BRAZILIAN JOURNAL OF CHEMICAL ENGINEERING, 2002, 19 (04) : 365 - 370
  • [35] Dynamic selection of model parameters in principal components analysis neural networks
    López-Rubio, E
    Ortiz-de-Lazcano-Lobato, JM
    Vargas-González, MDC
    López-Rubio, JM
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 618 - 622
  • [36] Automatic Signature Recognition And Verification Using Principal Components Analysis
    Ismail, I. A.
    Ramadan, M. A.
    El Danf, T.
    Samak, A. H.
    COMPUTER GRAPHICS, IMAGING AND VISUALISATION - MODERN TECHNIQUES AND APPLICATIONS, PROCEEDINGS, 2008, : 356 - +
  • [37] Feedforward neural networks for principal components extraction
    Nicole, S
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 33 (04) : 425 - 437
  • [38] Speech recognition using dynamic programming of Bayesian neural networks
    Huang, CC
    Wang, JF
    Wu, CH
    Lee, JY
    CENTRAL AUDITORY PROCESSING AND NEURAL MODELING, 1998, : 71 - 76
  • [39] Speech Recognition of Punjabi Numerals Using Convolutional Neural Networks
    Aditi, Thakur
    Karun, Verma
    ADVANCES IN COMPUTER COMMUNICATION AND COMPUTATIONAL SCIENCES, VOL 1, 2019, 759 : 61 - 69
  • [40] Vietnamese Speech Command Recognition using Recurrent Neural Networks
    Phan Duy Hung
    Truong Minh Giang
    Le Hoang Nam
    Phan Minh Duong
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (07) : 194 - 201