Noise robust voice activity detection based on statistical model and parallel non-linear Kalman filtering

被引:0
|
作者
Fujimoto, Masakiyo [1 ]
Ishizuka, Kentaro [1 ]
Kato, Hiroko [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, 2-4,Hikari Dai,Seika Cho, Kyoto 6190288, Japan
关键词
speech processing; state space methods; Kalman filtering; multiplied estimator; forward-backward estimation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of voice activity detection in noise environments. The proposed voice activity detection technique described in this paper is based on a statistical model approach, and estimates the statistical models sequentially without a prior knowledge of noise. The crucial factor as regards the statistical model-based approach is noise parameter estimation, especially non-stationary noise. To deal with this problem, a parallel non-linear Kalman filter, that is a multiplied estimator, is used for sequential noise estimation. Also, a backward estimation is used for noise estimation and likelihood calculation for speech / non-speech discrimination. In the evaluation results, we observed that the proposed method significantly outperforms conventional methods as regards voice activity detection accuracy in noisy environments.
引用
下载
收藏
页码:797 / +
页数:2
相关论文
共 50 条
  • [1] Noise Robust Voice Activity Detection Based on Switching Kalman Filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 965 - 968
  • [2] Noise robust voice activity detection based on switching Kalman filter
    Fujimoto, Masakiyo
    Ishizuka, Kentaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 467 - 477
  • [3] Noise robust model-based Voice Activity Detection
    de la Torre, Angel
    Ramirez, Javier
    Benitez, Carmen
    Segura, Jose C.
    Garcia, Luz
    Rubio, Antonio J.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1954 - 1957
  • [4] NON-LINEAR NOISE ADAPTIVE KALMAN FILTERING VIA VARIATIONAL BAYES
    Sarkka, Simo
    Hartikainen, Jouni
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [5] Robust extended Kalman filtering for non-linear systems with unknown input: a UBB model approach
    Asgari, Mersad
    Khaloozadeh, Hamid
    IET RADAR SONAR AND NAVIGATION, 2020, 14 (11): : 1837 - 1844
  • [6] Adaptive robust Kalman filtering based on the current statistical model
    Gao, Wei-Guang
    Yang, Yuan-Xi
    Zhang, Shuang-Cheng
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2006, 35 (01): : 15 - 18
  • [7] Robust voice activity detection based on noise eigenspace
    Ying, Dongwen
    Shi, Yu
    Lu, Xugang
    Dang, Jianwu
    Soong, Frank
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (06) : 413 - 423
  • [8] Noise Robust Speech Recognition Using Parallel Model Compensation and Voice Activity Detection Methods
    Hizlisoy, Serhat
    Tufekci, Zekeriya
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [9] Convergence analysis of non-linear filtering based on cubature Kalman filter
    Zarei, Jafar
    Shokri, Ehsan
    IET SCIENCE MEASUREMENT & TECHNOLOGY, 2015, 9 (03) : 294 - 305
  • [10] On Noise Robust Voice Activity Detection
    Dekens, Tomas
    Verhelst, Werner
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2660 - 2663