Speech Recognition Using Principal Components Analysis and Neural Networks

被引:0
|
作者
Shabani, Shaham [1 ]
Norouzi, Yaser [2 ]
机构
[1] Univ Bologna, DEI Dept, Bologna, Italy
[2] Amirkabir Univ Technol, Dept Elect Engn, Tehran, Iran
关键词
component; speech recognition; feature extraction; principal components analysis (PCA); Mel frequency cepstral coefficient (MFCC); neural network;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we intend to introduce a new approach to recognize discrete speeches, specifically pre-assumed words. Our approach is mainly based on Principal Components Analysis (PCA) and Neural Networks (NN). To do so, initially we build a data base which is provided by 20 speakers who uttered each predefined word 5 times and overall 10 Persian words. Then we apply Voice Activity Detection (VAD) and eliminate the useless portions of each frame and then by computing Mel Frequency Cepstral Coefficients (MFCCs), which are our useful features in the recognition process, and then applying PCA to reduce the size of our data set, we will successfully provide the inputs of the NN block. Using PCA will enable us to provide inputs with lower size to our recognition system which is an important feature of our approach by speeding up the training procedure while keeping the accuracy as high as possible. In another words, PCA will decrease the amount of computations we have to deal with usually in most recognition systems. We use 90% of our data set to train our algorithm and the remained 10% to test our algorithm and measure the accuracy of recognition process.
引用
收藏
页码:90 / 95
页数:6
相关论文
共 50 条
  • [21] Recognition and Processing of Speech Signals Using Neural Networks
    Douglas O’Shaughnessy
    Circuits, Systems, and Signal Processing, 2019, 38 : 3454 - 3481
  • [22] Speech emotion recognition using spiking neural networks
    Buscicchio, Cosimo A.
    Gorecki, Przemyslaw
    Caponetti, Laura
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 38 - 46
  • [23] Arabic speech recognition using recurrent neural networks
    El Choubassi, MM
    El Khoury, HE
    Alagha, CEJ
    Skaf, JA
    Al-Alaoui, MA
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 543 - 547
  • [24] Using Neural Networks for a Discriminant Speech Recognition System
    Schiopu, Daniela
    Oprea, Mihaela
    2014 INTERNATIONAL CONFERENCE ON DEVELOPMENT AND APPLICATION SYSTEMS (DAS), 2014, : 165 - 169
  • [25] Using neural networks and LPCC to improve speech recognition
    Zbancioc, M
    Costin, M
    SCS 2003: INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2003, : 445 - 448
  • [26] Remarks on emotional speech recognition using neural networks
    Takahashi, Kazuhiko
    Nakatsu, Ryohei
    Nippon Kikai Gakkai Ronbunshu, C Hen/Transactions of the Japan Society of Mechanical Engineers, Part C, 2002, 68 (08): : 2339 - 2345
  • [27] Isolated speech recognition using artificial neural networks
    Polur, PD
    Zhou, RB
    Yang, J
    Adnani, F
    Hobson, RS
    PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 1731 - 1734
  • [28] Recognition and Processing of Speech Signals Using Neural Networks
    O'Shaughnessy, Douglas
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3454 - 3481
  • [29] A comparison of different spectral analysis models for speech recognition using neural networks
    Zebulum, RS
    Vellasco, M
    Perelmuter, G
    Pacheco, MA
    PROCEEDINGS OF THE 39TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 1996, : 1428 - 1431
  • [30] Irish Sign Language Recognition Using Principal Component Analysis and Convolutional Neural Networks
    Oliveira, Marlon
    Chatbri, Houssem
    Little, Suzanne
    Ferstl, Ylva
    O'Connor, Noel E.
    Sutherland, Alistair
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 749 - 756