Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron

被引：0

作者：

Ben Nasr, Mohammed ^{[1
]}

Talbi, Mourad ^{[1
]}

Cherif, Adnane ^{[1
]}

机构：

[1] Fac Sci Tunis, Dept Elect, Tunis 1060, Tunisia

来源：

2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT) | 2012年

关键词：

Speech Recognition; Feature Extraction; Bionic Wavelet Transforms (BWT); Mel-Frequency Cepstral Coefficients (MFCCs); Multi-Layer Perceptron (MLP);

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.

引用

页码：803 / 808

页数：6

共 50 条

[1] Emotion Recognition in Speech Using MFCC and Wavelet Features
Kishore, K. V. Krishna
Satish, P. Krishna
PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 842 - 847
[2] Speech Emotion Recognition Using Multi-Layer Perceptron Classifier
Yuan, Xiaochen
Wong, Wai Pang
Lam, Chan Tong
2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 644 - 648
[3] Improving speech recognition using bionic wavelet features
Vani H.Y.
Anusuya M.A.
AIMS Electronics and Electrical Engineering, 2020, 4 (02): : 200 - 215
[4] Arabic Speech Recognition Using MFCC Feature Extraction and ANN Classification
Wahyuni, Elvira Sukma
2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 22 - 25
[5] Multitapering and a wavelet variant of MFCC in speech recognition
Ricotti, LP
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2005, 152 (01): : 29 - 35
[6] Denoising Speech for MFCC Feature Extraction Using Wavelet Transformation in Speech Recognition System
Hidayat, Risanuri
Bejo, Agus
Sumaryono, Sujoko
Winursito, Anggun
PROCEEDINGS OF 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING (ICITEE), 2018, : 280 - 284
[7] Multi-band based recognition of spoken Arabic numerals using wavelet transform
Alkhaldi, W
Fakhr, W
Hamdy, N
2002 IEEE PROCEEDINGS OF THE NINETEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2002, : 224 - 229
[8] The speech recognition system based on bark wavelet MFCC
Zhang, Xue-ying
Bai, Jing
Liang, Wu-zhou
2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 780 - +
[9] MFCC and vector quantization for Arabic fricatives Speech/Speaker recognition
Chelali, Fatma Zohra
Djeradi, Amar
2012 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2012, : 284 - 289
[10] Arabic Handwriting Recognition Using Gabor Wavelet Transform and SVM
Elzobi, Moftah
Al-Hamadi, Ayoub
Saeed, Anwar
Dings, Laslo
PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 2154 - 2158

← 1 2 3 4 5 →