Reliable likelihood ratios for statistical model-based voice activity detector with low false-alarm rate

被引:8
|
作者
Kim, Younggwan [1 ]
Suh, Youngjoo [1 ]
Kim, Hoirin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Taejon 305701, South Korea
关键词
voice activity detector; statistical model; reliability of likelihood ratio;
D O I
10.1186/1687-6180-2011-31
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The role of the statistical model-based voice activity detector (SMVAD) is to detect speech regions from input signals using the statistical models of noise and noisy speech. The decision rule of SMVAD is based on the likelihood ratio test (LRT). The LRT-based decision rule may cause detection errors because of statistical properties of noise and speech signals. In this article, we first analyze the reasons why the detection errors occur and then propose two modified decision rules using reliable likelihood ratios (LRs). We also propose an effective weighting scheme considering spectral characteristics of noise and speech signals. In the experiments proposed in this study, with almost no additional computations, the proposed methods show significant performance improvement in various noise conditions. Experimental results also show that the proposed weighting scheme provides additional performance improvement over the two proposed SMVADs.
引用
收藏
页数:12
相关论文
共 27 条
  • [1] Reliable likelihood ratios for statistical model-based voice activity detector with low false-alarm rate
    Younggwan Kim
    Youngjoo Suh
    Hoirin Kim
    EURASIP Journal on Advances in Signal Processing, 2011
  • [2] Analysis and improvement of a statistical model-based voice activity detector
    Cho, YD
    Kondoz, A
    IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (10) : 276 - 278
  • [3] A statistical model-based voice activity detection
    Sohn, J
    Kim, NS
    Sung, W
    IEEE SIGNAL PROCESSING LETTERS, 1999, 6 (01) : 1 - 3
  • [4] Statistical Voice Activity Detector Based on Signal Subspace Model
    Ryu, Kwang-Chun
    Kim, Dong-Kook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (07): : 372 - 378
  • [5] A Model-Based Approach to Anomaly Detection Trading Detection Time and False Alarm Rate
    Goncalves, Charles F.
    Menasche, Daniel S.
    Avritzer, Alberto
    Antunes, Nuno
    Vieira, Marco
    2020 MEDITERRANEAN COMMUNICATION AND COMPUTER NETWORKING CONFERENCE (MEDCOMNET), 2020,
  • [6] Discriminative Weight Training for a Statistical Model-Based Voice Activity Detection
    Kang, Sang-Ick
    Jo, Q-Haing
    Chang, Joon-Hyuk
    Park, Seung Seop
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2007, 26 (05): : 194 - 198
  • [7] Discriminative weight training for a statistical model-based voice activity detection
    Kang, Sang-Ick
    Jo, Q-Haing
    Chang, Joon-Hyuk
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 170 - 173
  • [8] Speech recognition enhancement with statistical model-based voice activity detection
    Jarc, Bojan
    Babič, Rudolf
    Elektrotehniski Vestnik/Electrotechnical Review, 2002, 69 (01): : 75 - 81
  • [9] Detection method of electricity theft with low false alarm rate based on an XGBoost model
    Chen G.
    Li D.
    Chen X.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2021, 49 (23): : 178 - 186
  • [10] Multiple Acoustic Model-Based Discriminative Likelihood Ratio Weighting for Voice Activity Detection
    Suh, Youngjoo
    Kim, Hoirin
    IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (08) : 507 - 510