Blind Signal-to-Noise Ratio Estimation of Speech Based on Vector Quantizer Classifiers and Decision Level Fusion

被引:0
|
作者
Russell Ondusko
Matthew Marbach
Ravi P. Ramachandran
Linda M. Head
机构
[1] Navsea,Department of Electrical and Computer Engineering
[2] Lockheed Martin,undefined
[3] Rowan University,undefined
来源
关键词
Blind estimation; Linear predictive features; Vector quantizer classifier; Estimation combination; Overall average absolute error;
D O I
暂无
中图分类号
学科分类号
摘要
A blind approach for estimating the signal to noise ratio (SNR) of a speech signal corrupted by additive noise is proposed. The method is based on a pattern recognition paradigm using various linear predictive based features, a vector quantizer classifier and estimation combination. Blind SNR estimation is very useful in speaker identification systems in which a confidence metric is determined along with the speaker identity. The confidence metric is partially based on the mismatch between the training and testing conditions of the speaker identification system and SNR estimation is very important in evaluating the degree of this mismatch. The aim is to correctly estimate SNR values from 0 to 30 dB, a range that is both practical and crucial for speaker identification systems. Experiments consider (1) artificially generated additive white Gaussian noise, pink noise and bandpass noise and (2) fifteen noise types from the NOISEX database. Four features are combined to get the best results. The average SNR estimation error depends on the type of noise in that a relatively low error results for pink noise and jet cockpit noise and a high error results for destroyer operations room noise and military vehicle noise. For both artificially generated noise and the NOISEX data, the error is lower than what is achieved by the IMCRA method that uses SNR estimation for speech enhancement. Combining the four features with IMCRA lowers the error for 8 of the 15 noise types from NOISEX.
引用
收藏
页码:335 / 345
页数:10
相关论文
共 50 条
  • [41] SIGNAL-TO-NOISE POWER RATIO ESTIMATION FOR REFLECTION SEISMORGRAMS
    OSTRANDE.WJ
    GEOPHYSICS, 1966, 31 (06) : 1207 - &
  • [42] LEARNING TO MAXIMIZE SIGNAL-TO-NOISE RATIO FOR REVERBERANT SPEECH SEGREGATION
    Jin, Zhaozhang
    Wang, DeLiang
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4689 - +
  • [43] The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
    Dubbelboer, Finn
    Houtgast, Tarnmo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (06): : 3937 - 3946
  • [44] The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
    Dubbelboer, Finn
    Houtgast, Tammo
    Journal of the Acoustical Society of America, 2009, 124 (06): : 3937 - 3946
  • [45] Blind Non-Data-Aided Signal-to-Noise Ratio Estimation with Convolutional Neural Networks
    Kardas, Muhammed Ceyhun
    Pusane, Ali Emre
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [46] Bayesian Sequential Joint Signal Detection and Signal-to-Noise Ratio Estimation
    Reinhard, Dominik
    Fauss, Michael
    Zoubir, Ahdelhak M.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [47] SIMULTANEOUS ESTIMATION OF PROCESS PARAMETERS, NOISE VARIANCE, AND SIGNAL-TO-NOISE RATIO
    NIU, S
    FISHER, DG
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (07) : 1725 - 1728
  • [48] Noise Variance and Signal-to-Noise Ratio Estimation from Spectral Data
    Schuster, Stefan
    Exel, Dominik
    Scheiblhofer, Stefan
    Zankl, Dominik
    Ganglberger, Vera
    Reisinger, Johann
    Zagar, Bernhard
    2019 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2019, : 1416 - 1421
  • [49] Signal-to-noise ratio for cross-sensor fusion approach
    Kozaitis, S. P.
    Ouendeno, M.
    MULTISENSOR, MULTISOURCE INFORMATION FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS 2007, 2007, 6571
  • [50] Robust Signal-to-Noise Ratio Estimation Based on Waveform Amplitude Distribution Analysis
    Kim, Chanwoo
    Stern, Richard M.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2598 - +