Non-Linear Filtering for Feature Enhancement of Reverberant Speech

被引：0

作者：

Verma, Amit Kumar ^{[1
]}

Tomar, Hemendra ^{[2
]}

Chetupalli, Srikanth Raj ^{[3
]}

Sreenivas, T. V. ^{[3
]}

机构：

[1] Elect & Radar Dev Estab LRDE, Bangalore, Karnataka, India

[2] Indian Air Force, New Delhi, India

[3] Indian Inst Sci, Dept Elect Commun Engn, Bangalore, Karnataka, India

来源：

TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE | 2017年

关键词：

Gaussian Mixture Model(GMM); universal background model (UBM); mel frequency cepstral coefficients (MFCC); non-linear filtering; IDENTIFICATION; MODELS;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speaker identification implemented on a mobile robot is a challenging problem because of varying reverberant environments which the robot encounters while in motion. The performance of a typical speaker identification system degrades significantly in reverberant environments. The degradation in performance is mainly due to the conventional feature being not robust to change in reverberant condition. In this paper, we present a non-linear filter based mel frequency cepstral coefficient (MFCC) feature extraction, which is more robust to changes in reverberant conditions. This feature extraction method is a two stage operation and is applied on the spectrogram of the speech signal. The first stage suppresses the frequency spread due to reverberation within each frame and in the second stage, reverberation effect across the frames is suppressed. The performance is evaluated by the GMM-UBM based identifier built and tested with conventional MFCC feature vectors and with the non-linear filter based MFCC feature vectors. We show that, the identification accuracy of GMM-UBM based identifier with non-linear filter based MFCC feature vectors is better than that of conventional MFCC feature vectors.

引用

页码：1800 / 1805

页数：6

共 50 条

[1] A speech enhancement algorithm based on non-linear filtering and noise masking
Zhang, JJ
Cao, ZG
[J]. CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (03) : 296 - 300
[2] LINEAR AND NON-LINEAR FILTERING FOR IMAGE-ENHANCEMENT
KEKRE, HB
SOLANKI, JK
[J]. COMPUTERS & ELECTRICAL ENGINEERING, 1978, 5 (03) : 283 - 288
[3] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
Sami Keronen
Heikki Kallasjoki
Kalle J. Palomäki
Guy J. Brown
Jort F. Gemmeke
[J]. EURASIP Journal on Advances in Signal Processing, 2015
[4] Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization
Keronen, Sami
Kallasjoki, Heikki
Palomaki, Kalle J.
Brown, Guy J.
Gemmeke, Jort F.
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
[5] Contrast Enhancement and Detailed Enhancement Method Based on Non-linear Filtering
Goto, Tomio
Ikeyama, Miho
Hirano, Satoshi
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
[6] Model-Based Feature Enhancement for Reverberant Speech Recognition
Krueger, Alexander
Haeb-Umbach, Reinhold
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
[7] MULTICHANNEL SPEECH DEREVERBERATION AND SEPARATION WITH OPTIMIZED COMBINATION OF LINEAR AND NON-LINEAR FILTERING
Togami, Masahito
Kawaguchi, Yohei
Takeda, Ryu
Obuchi, Yasunari
Nukaga, Nobuo
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4057 - 4060
[8] Non-linear transformations of the feature space for robust speech recognition
de la Torre, A
Segura, JC
Benítez, C
Peinado, AM
Rubio, AJ
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 401 - 404
[9] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
Dong, Huan-Yu
Lee, Chang-Myung
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
[10] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
Huan-Yu Dong
Chang-Myung Lee
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2018

← 1 2 3 4 5 →