Noise Robust Speaker Identification Using RASTA-MFCC Feature with Quadrilateral Filter Bank Structure

被引:11
|
作者
Nidhyananthan, S. Selva [1 ]
Kumari, R. Shantha Selva [1 ]
Selvi, T. Senthur [1 ]
机构
[1] Mepco Schlenk Engn Coll, Dept ECE, Sivakasi, Tamil Nadu, India
关键词
Cepstral mean normalization (CMN); Equivalent rectangular bandwidth (ERB); Gaussian Mixture Model-Universal Background Model (GMM-UBM); Mel Frequency Cepstral Coefficients (MFCC); Relative Spectra processing (RASTA); SPEECH;
D O I
10.1007/s11277-016-3530-3
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This paper motivates the use of Relative Spectra-Mel Frequency Cepstral Coefficients (RASTA-MFCC) feature extracted from the newly designed Quadrilateral filter bank structure and Gaussian Mixture Model-Universal Background Model (GMM-UBM) for improved text independent speaker identification under noisy environment. Unlike neural network model which requires retraining of entire database when a new sample is added to it, GMM-UBM model does not require retraining of entire database which leads to easier and faster processing. RASTA-MFCC is found to be more robust to noisy environment compared with traditional MFCC method. MFCC is an efficient feature for identifying the speaker as it has speaker specific information capturing ability. RASTA processing of speech improves the performance of recognizer in the presence of convolution and additive noise. This work combines the better of these two processes to yield RASTA-MFCC feature which is robust to noise and also proposes a new Quadrilateral filter bank structure which approximates the response of cochlear membrane of human ear to effectively capture the feature vectors. The proposed Quadrilateral filter bank structure with RASTA-MFCC feature and GMM-UBM modeling for speaker identification demonstrates supremacy over triangular and Gaussian filter banks and offers a speaker identification accuracy of 97.67 % for the MEPCO noisy speech database with 50 speakers.
引用
收藏
页码:1321 / 1333
页数:13
相关论文
共 35 条
  • [1] Noise Robust Speaker Identification Using RASTA–MFCC Feature with Quadrilateral Filter Bank Structure
    S. Selva Nidhyananthan
    R. Shantha Selva Kumari
    T. Senthur Selvi
    Wireless Personal Communications, 2016, 91 : 1321 - 1333
  • [2] Noise Robust Speaker Identification by Dividing MFCC
    Matsumoto, Kizuki
    Hayasaka, Noboru
    Iiguni, Youji
    2014 6TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING (ISCCSP), 2014, : 652 - 655
  • [3] Text Independent Voice Based Students Attendance System under Noisy Environment using RASTA-MFCC Feature
    Nidhyananthan, S. Selva
    Kumari, R. Shantha Selva
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORK TECHNOLOGIES (ICCNT), 2014, : 182 - 187
  • [4] Improved MFCC-Based Feature for Robust Speaker Identification
    吴尊敬
    曹志刚
    TsinghuaScienceandTechnology, 2005, (02) : 158 - 161
  • [5] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
  • [6] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
    Barhoush, Mahdi
    Hallawa, Ahmed
    Schmeink, Anke
    2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33
  • [7] A Framework for Robust MFCC Feature Extraction Using SNR-Dependent Compression of Enhanced Mel Filter Bank Energies
    Nasersharif, Babak
    Akbari, Ahmad
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 33 - 36
  • [8] Robust feature based on speech harmonic structure for speaker identification
    College of Communication and Information Engineering, Nanjing Univ. of Posts and Telecom., Nanjing 210003, China
    Dianzi Yu Xinxi Xuebao, 2006, 10 (1786-1789):
  • [9] ROBUST SPEAKER IDENTIFICATION USING AN AUDITORY-BASED FEATURE
    Li, Qi
    Huang, Yan
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4514 - 4517
  • [10] Speaker identification in mismatch condition using warped filter bank features
    Chavan, Mahesh S
    Chougule, Sharada V
    International Journal of Circuits, Systems and Signal Processing, 2015, 9 : 88 - 93