Signal to Noise Ratio Estimation Based on An Optimal Design of Subband Voice Activity Detection

被引:0
|
作者
Morita, Shota [1 ]
Lu, Xugang [2 ]
Unoki, Masashi [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Sch Informat Sci, Tokyo, Japan
[2] Natl Inst Informat & Commun Technol, Universal Commun Res Inst, Tokyo, Japan
关键词
Signal to noise ratio; voice activity detection; subband processing; decision of threshold;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimates of the signal to noise ratio (SNR) of speech play an important role in noise reduction and predictions of speech intelligibility based on the speech transmission index (STI). Techniques of voice activity detection (VAD) must be used explicitly or implicitly during estimates of SNR to detect speech and non-speech sections. The decision of threshold in most studies has been fixed for VAD to speech and non-speech classications during SNR estimates. We argue that xing the decision of the threshold for all testing conditions is not optimal in controlling the false acceptance and miss detection rates of speech. We propose SNR estimates in this paper using a speech and non-speech detection algorithm based on optimizing the trade-off between false speech acceptance and miss detection rates on a receiver operating characteristic (ROC) curve. Rather than xing the decision threshold in VAD for all SNR conditions, we optimally estimate the decision threshold using an ROC curve for each SNR condition. Thresholds are optimized in subband signals on a large training data set composed of various SNR conditions and noise types. After speech and non-speech are detected, SNR is estimated by summarizing the subband powers of speech and noise from all subbands. We applied the proposed method of estimating SNR based on AURORA2J and NOISEX-92 data corpora. The experimental results demonstrated that the proposed method was more accurate than the classical method of estimating SNR. The proposed approach could be used in robust VAD and STI estimates.
引用
收藏
页码:560 / +
页数:2
相关论文
共 50 条
  • [1] Voice activity detection using density ratio estimation of speech and noise
    Tachioka, Yuuki
    Hanazawa, Toshiyuki
    Narita, Tomohiro
    Ishii, Jun
    IEEJ Transactions on Electronics, Information and Systems, 2013, 133 (08) : 1549 - 1555
  • [2] Unsupervised voice activity detection with improved signal-to-noise ratio in noisy environment
    Sharma, Shilpa
    Malhotra, Rahul
    Sharma, Anurag
    Bala, Jeevan
    Rattan, Punam
    Vashisht, Sheveta
    INTERNATIONAL JOURNAL OF NANOTECHNOLOGY, 2023, 20 (1-4) : 421 - 432
  • [3] Voice activity detection based on density ratio estimation and system combination
    Tachioka, Yuuki
    Hanazawa, Toshiyuki
    Narita, Tomohiro
    Ishii, Jun
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [4] Single-Channel Mixed Signal Detection Based on Signal Noise Ratio Estimation
    Fu, Jun
    Huan, Qiang
    Peng, Hua
    PROCEEDINGS OF 2017 IEEE 2ND INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2017, : 705 - 711
  • [5] Photoacoustics Waveform Design for Optimal Signal to Noise Ratio
    Baddour, Natalie
    Sun, Zuwen
    SYMMETRY-BASEL, 2022, 14 (11):
  • [6] Noise robust voice activity detection based on periodic to aperiodic component ratio
    Ishizuka, Kentaro
    Nakatani, Tomohiro
    Fujimoto, Masakiyo
    Miyazaki, Noboru
    SPEECH COMMUNICATION, 2010, 52 (01) : 41 - 60
  • [7] SEQUENTIAL JOINT SIGNAL DETECTION AND SIGNAL-TO-NOISE RATIO ESTIMATION
    Fauss, M.
    Nagananda, K. G.
    Zoubir, A. M.
    Poor, H. V.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4606 - 4610
  • [8] Pitch Detection Based on Signal-to-Noise-Ratio Estimation and Compensation for Continuous Speech Signal
    Park, Hyung-Woo
    Khil, A-Ra
    Bae, Myung-Jin
    CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, 2012, 310 : 767 - 774
  • [9] Optimal Simultaneous Detection and Signal and Noise Power Estimation
    Le, Long
    Jones, Douglas L.
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2014, : 571 - 575
  • [10] An impulse noise robust voice activity detection algorithm applied for low signal-to-noise ratio digital communication
    Wang, Tong
    Cui, Hui-juan
    Tang, Kun
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 2225 - +