Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron

被引:0
|
作者
Ben Nasr, Mohammed [1 ]
Talbi, Mourad [1 ]
Cherif, Adnane [1 ]
机构
[1] Fac Sci Tunis, Dept Elect, Tunis 1060, Tunisia
关键词
Speech Recognition; Feature Extraction; Bionic Wavelet Transforms (BWT); Mel-Frequency Cepstral Coefficients (MFCCs); Multi-Layer Perceptron (MLP);
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.
引用
收藏
页码:803 / 808
页数:6
相关论文
共 50 条
  • [1] Emotion Recognition in Speech Using MFCC and Wavelet Features
    Kishore, K. V. Krishna
    Satish, P. Krishna
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 842 - 847
  • [2] Speech Emotion Recognition Using Multi-Layer Perceptron Classifier
    Yuan, Xiaochen
    Wong, Wai Pang
    Lam, Chan Tong
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 644 - 648
  • [3] Improving speech recognition using bionic wavelet features
    Vani H.Y.
    Anusuya M.A.
    AIMS Electronics and Electrical Engineering, 2020, 4 (02): : 200 - 215
  • [4] Arabic Speech Recognition Using MFCC Feature Extraction and ANN Classification
    Wahyuni, Elvira Sukma
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 22 - 25
  • [5] Multitapering and a wavelet variant of MFCC in speech recognition
    Ricotti, LP
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2005, 152 (01): : 29 - 35
  • [6] Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
    Hidayat, Risanuri
    Bejo, Agus
    Sumaryono, Sujoko
    Winursito, Anggun
    PROCEEDINGS OF 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2018, : 280 - 284
  • [7] Multi-band based recognition of spoken Arabic numerals using wavelet transform
    Alkhaldi, W
    Fakhr, W
    Hamdy, N
    2002 IEEE PROCEEDINGS OF THE NINETEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2002, : 224 - 229
  • [8] The speech recognition system based on bark wavelet MFCC
    Zhang, Xue-ying
    Bai, Jing
    Liang, Wu-zhou
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 780 - +
  • [9] MFCC and vector quantization for Arabic fricatives Speech/Speaker recognition
    Chelali, Fatma Zohra
    Djeradi, Amar
    2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 284 - 289
  • [10] Arabic Handwriting Recognition Using Gabor Wavelet Transform and SVM
    Elzobi, Moftah
    Al-Hamadi, Ayoub
    Saeed, Anwar
    Dings, Laslo
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 2154 - 2158