Noise robust voice activity detection based on statistical model and parallel non-linear Kalman filtering

被引:0
|
作者
Fujimoto, Masakiyo [1 ]
Ishizuka, Kentaro [1 ]
Kato, Hiroko [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, 2-4,Hikari Dai,Seika Cho, Kyoto 6190288, Japan
关键词
speech processing; state space methods; Kalman filtering; multiplied estimator; forward-backward estimation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of voice activity detection in noise environments. The proposed voice activity detection technique described in this paper is based on a statistical model approach, and estimates the statistical models sequentially without a prior knowledge of noise. The crucial factor as regards the statistical model-based approach is noise parameter estimation, especially non-stationary noise. To deal with this problem, a parallel non-linear Kalman filter, that is a multiplied estimator, is used for sequential noise estimation. Also, a backward estimation is used for noise estimation and likelihood calculation for speech / non-speech discrimination. In the evaluation results, we observed that the proposed method significantly outperforms conventional methods as regards voice activity detection accuracy in noisy environments.
引用
收藏
页码:797 / +
页数:2
相关论文
共 50 条
  • [31] Noise robust voice activity detection based on periodic to aperiodic component ratio
    Ishizuka, Kentaro
    Nakatani, Tomohiro
    Fujimoto, Masakiyo
    Miyazaki, Noboru
    SPEECH COMMUNICATION, 2010, 52 (01) : 41 - 60
  • [32] Entropy-based Extended Kalman Filtering for Stochastic Non-linear Systems with Polynomial Compensation
    Zhang, Qichun
    Hu, Zixiang
    Hu, Liang
    2019 25TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC), 2019, : 417 - 422
  • [33] CONCEPT OF PARALLEL AND CONVERGENT ALGEBRAS OF RECURSIVE NON-LINEAR FILTERING
    LEVIEUX, F
    APPLIED MATHEMATICS AND OPTIMIZATION, 1977, 4 (01): : 61 - 95
  • [34] Linear and non-linear model for statistical localization of landmarks
    Romaniuk, B
    Desvignes, M
    Revenu, M
    Deshayes, MJ
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITON, VOL IV, PROCEEDINGS, 2002, : 393 - 396
  • [35] Comparison Among ECG Filtering Methods for Non-linear Noise
    Patwary, Adnan Basir
    Chowdhury, Md. Thouhidul Islam
    Mamun, Nursadul
    2018 INTERNATIONAL CONFERENCE ON ADVANCEMENT IN ELECTRICAL AND ELECTRONIC ENGINEERING (ICAEEE), 2018,
  • [36] A FINITELY ADDITIVE WHITE NOISE APPROACH TO NON-LINEAR FILTERING
    KALLIANPUR, G
    KARANDIKAR, RL
    APPLIED MATHEMATICS AND OPTIMIZATION, 1983, 10 (02): : 159 - 185
  • [37] A new Multiuser Detection Algorithm Based on Robust Kalman Filtering
    Li Yanping
    Zong Hengshan
    Chang Xiaoming
    Chen Xiangnan
    2012 INTERNATIONAL CONFERENCE ON FUTURE ELECTRICAL POWER AND ENERGY SYSTEM, PT A, 2012, 17 : 615 - 622
  • [38] A Fusion Model for Robust Voice Activity Detection
    Wang, Guan-Bo
    Zhang, Wei-Qiang
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [39] FAULT-DETECTION BY ADAPTIVE NON-LINEAR FILTERING
    KRISHNAN, V
    JOURNAL OF THE INDIAN INSTITUTE OF SCIENCE SECTION A-ENGINEERING & TECHNOLOGY, 1981, 63 (11): : 249 - 262
  • [40] Non-linear matched filtering for object detection and tracking
    Noyer, JC
    Lanvin, P
    Benjelloun, M
    PATTERN RECOGNITION LETTERS, 2004, 25 (06) : 655 - 668