Noise robust estimate of speech dynamics for speaker recognition

被引：0

作者：

Openshaw, JP

Mason, JS

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for o-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively.

引用

页码：925 / 928

页数：4

共 50 条

[21] Robust Speaker Authentication Based on Combined Speech and Voiceprint Recognition
Malcangi, Mario
COMPUTATIONAL METHODS IN SCIENCE AND ENGINEERING, VOL 2: ADVANCES IN COMPUTATIONAL SCIENCE, 2009, 1148 : 872 - 875
[22] ROBUST SPEECH RECOGNITION THROUGH SELECTION OF SPEAKER AND ENVIRONMENT TRANSFORMS
Bilgi, Raghavendra
Joshi, Vikas
Umesh, S.
Garcia, L.
Benitez, C.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4333 - 4336
[23] Emotional Speech Clustering based Robust Speaker Recognition System
Li, Dongdong
Yang, Yingchun
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4576 - +
[24] A posterior union model with applications to robust speech and speaker recognition
Ming, Ji
Lin, Jie
Smith, F. Jack
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
[25] A Posterior Union Model with Applications to Robust Speech and Speaker Recognition
Ji Ming
Jie Lin
F. Jack Smith
EURASIP Journal on Advances in Signal Processing, 2006
[26] Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition
Novotny, Ondrej
Plchot, Oldrich
Glembek, Ondrej
Cernocky, Jan ''Honza''
Burget, Lukas
COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 403 - 421
[27] Articulatory Information for Noise Robust Speech Recognition
Mitra, Vikramjit
Nam, Hosung
Espy-Wilson, Carol Y.
Saltzman, Elliot
Goldstein, Louis
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 1913 - 1924
[28] Robust speech recognition for car environment noise
Kokubo, H
Amano, A
Hataoka, N
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (11): : 65 - 73
[29] Noise and speaker robustness in a Persian continuous speech recognition system
Veisi, Hadi
Sameti, Hossein
2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 73 - 76
[30] Robust noise suppression methods in speech recognition
Cui, Yi
Zhang, Dong
Shi, Liangping
Chen, Liyuan
Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 1998, 21 (02): : 10 - 14

← 1 2 3 4 5 →