Noise Robust Speaker Identification Using RASTA-MFCC Feature with Quadrilateral Filter Bank Structure

被引：11

作者：

Nidhyananthan, S. Selva ^{[1
]}

Kumari, R. Shantha Selva ^{[1
]}

Selvi, T. Senthur ^{[1
]}

机构：

[1] Mepco Schlenk Engn Coll, Dept ECE, Sivakasi, Tamil Nadu, India

来源：

WIRELESS PERSONAL COMMUNICATIONS | 2016年 / 91卷 / 03期

关键词：

Cepstral mean normalization (CMN); Equivalent rectangular bandwidth (ERB); Gaussian Mixture Model-Universal Background Model (GMM-UBM); Mel Frequency Cepstral Coefficients (MFCC); Relative Spectra processing (RASTA); SPEECH;

D O I：

10.1007/s11277-016-3530-3

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

This paper motivates the use of Relative Spectra-Mel Frequency Cepstral Coefficients (RASTA-MFCC) feature extracted from the newly designed Quadrilateral filter bank structure and Gaussian Mixture Model-Universal Background Model (GMM-UBM) for improved text independent speaker identification under noisy environment. Unlike neural network model which requires retraining of entire database when a new sample is added to it, GMM-UBM model does not require retraining of entire database which leads to easier and faster processing. RASTA-MFCC is found to be more robust to noisy environment compared with traditional MFCC method. MFCC is an efficient feature for identifying the speaker as it has speaker specific information capturing ability. RASTA processing of speech improves the performance of recognizer in the presence of convolution and additive noise. This work combines the better of these two processes to yield RASTA-MFCC feature which is robust to noise and also proposes a new Quadrilateral filter bank structure which approximates the response of cochlear membrane of human ear to effectively capture the feature vectors. The proposed Quadrilateral filter bank structure with RASTA-MFCC feature and GMM-UBM modeling for speaker identification demonstrates supremacy over triangular and Gaussian filter banks and offers a speaker identification accuracy of 97.67 % for the MEPCO noisy speech database with 50 speakers.

引用

页码：1321 / 1333

页数：13

共 35 条

[1] Noise Robust Speaker Identification Using RASTA–MFCC Feature with Quadrilateral Filter Bank Structure
S. Selva Nidhyananthan
R. Shantha Selva Kumari
T. Senthur Selvi
Wireless Personal Communications, 2016, 91 : 1321 - 1333
[2] Noise Robust Speaker Identification by Dividing MFCC
Matsumoto, Kizuki
Hayasaka, Noboru
Iiguni, Youji
2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 652 - 655
[3] Text Independent Voice Based Students Attendance System under Noisy Environment using RASTA-MFCC Feature
Nidhyananthan, S. Selva
Kumari, R. Shantha Selva
2014 INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORK TECHNOLOGIES (ICCNT), 2014, : 182 - 187
[4] Improved MFCC-Based Feature for Robust Speaker Identification
吴尊敬
曹志刚
TsinghuaScienceandTechnology, 2005, (02) : 158 - 161
[5] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
Singh, Mahesh K.
WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
[6] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
Barhoush, Mahdi
Hallawa, Ahmed
Schmeink, Anke
2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33
[7] A Framework for Robust MFCC Feature Extraction Using SNR-Dependent Compression of Enhanced Mel Filter Bank Energies
Nasersharif, Babak
Akbari, Ahmad
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 33 - 36
[8] Robust feature based on speech harmonic structure for speaker identification
College of Communication and Information Engineering, Nanjing Univ. of Posts and Telecom., Nanjing 210003, China
Dianzi Yu Xinxi Xuebao, 2006, 10 (1786-1789):
[9] ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE
Li, Qi
Huang, Yan
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4514 - 4517
[10] Speaker identification in mismatch condition using warped filter bank features
Chavan, Mahesh S
Chougule, Sharada V
International Journal of Circuits, Systems and Signal Processing, 2015, 9 : 88 - 93

← 1 2 3 4 →