Signal to Noise Ratio Estimation Based on An Optimal Design of Subband Voice Activity Detection

被引:0
|
作者
Morita, Shota [1 ]
Lu, Xugang [2 ]
Unoki, Masashi [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Tokyo, Japan
[2] Natl Inst Informat & Commun Technol, Universal Commun Res Inst, Tokyo, Japan
关键词
Signal to noise ratio; voice activity detection; subband processing; decision of threshold;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimates of the signal to noise ratio (SNR) of speech play an important role in noise reduction and predictions of speech intelligibility based on the speech transmission index (STI). Techniques of voice activity detection (VAD) must be used explicitly or implicitly during estimates of SNR to detect speech and non-speech sections. The decision of threshold in most studies has been fixed for VAD to speech and non-speech classications during SNR estimates. We argue that xing the decision of the threshold for all testing conditions is not optimal in controlling the false acceptance and miss detection rates of speech. We propose SNR estimates in this paper using a speech and non-speech detection algorithm based on optimizing the trade-off between false speech acceptance and miss detection rates on a receiver operating characteristic (ROC) curve. Rather than xing the decision threshold in VAD for all SNR conditions, we optimally estimate the decision threshold using an ROC curve for each SNR condition. Thresholds are optimized in subband signals on a large training data set composed of various SNR conditions and noise types. After speech and non-speech are detected, SNR is estimated by summarizing the subband powers of speech and noise from all subbands. We applied the proposed method of estimating SNR based on AURORA2J and NOISEX-92 data corpora. The experimental results demonstrated that the proposed method was more accurate than the classical method of estimating SNR. The proposed approach could be used in robust VAD and STI estimates.
引用
收藏
页码:560 / +
页数:2
相关论文
共 50 条
  • [21] Evaluation of Noise Estimation Algorithms Based on Minimum Statistics and Signal to Noise Ratio
    Jakovljevic, Niksa
    Miskovic, Dragisa
    Trpovski, Zeljen
    2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 419 - 422
  • [22] Noise estimation for speech enhancement by the estimated degree of noise without voice activity detection
    Hamid, M. Ekramul
    Ogawa, Keita
    Fukabayashi, Takeshi
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2006, : 420 - +
  • [23] Signal Detection Algorithm Design Based on Stochastic Resonance Technology Under Low Signal-to-Noise Ratio
    Jiang X.
    Diao M.
    Qu S.
    Journal of Shanghai Jiaotong University (Science), 2019, 24 (03) : 328 - 334
  • [24] Signal Detection Algorithm Design Based on Stochastic Resonance Technology Under Low Signal-to-Noise Ratio
    江晓林
    刁鸣
    渠苏苏
    JournalofShanghaiJiaotongUniversity(Science), 2019, 24 (03) : 328 - 334
  • [25] A subband space constrained beamformer incorporating voice activity detection
    Davis, A
    Low, SY
    Nordholm, S
    Grbic, N
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 65 - 68
  • [26] Signal-to-noise ratio (SNR) as a measure of reproducibility: Design, estimation, and application
    Elkum N.
    Shoukri M.M.
    Health Services and Outcomes Research Methodology, 2008, 8 (3) : 119 - 133
  • [27] Signal-to-noise ratio estimation for FFT-based system
    Lee, BK
    Kim, MJ
    Song, HK
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2005, E88B (03) : 1279 - 1281
  • [28] Signal to Noise Ratio Estimation in OFDM Based Cooperative Communication System
    Manzoor, Shahid
    Othman, Noor Shamsiah
    2017 IEEE 13TH MALAYSIA INTERNATIONAL CONFERENCE ON COMMUNICATIONS (MICC), 2017, : 84 - 89
  • [29] A robust voice activity detection based on noise eigenspace projection
    Ying, Dongwen
    Shi, Yu
    Soong, Frank
    Dang, Jianwu
    Lu, Xugang
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 76 - +
  • [30] Noise robust model-based Voice Activity Detection
    de la Torre, Angel
    Ramirez, Javier
    Benitez, Carmen
    Segura, Jose C.
    Garcia, Luz
    Rubio, Antonio J.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1954 - 1957