Voice Activity Detector Based on Enhanced Cumulant of LPC Residual and On-line EM Algorithm

被引:0
|
作者
Cournapeau, David [1 ]
Kawahara, Tatsuya [1 ]
Mase, Kenji [2 ,3 ]
Toriyama, Tomoji [3 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Nagoya Univ, Grad Sch Informat Sci, Nagoya, Aichi, Japan
[3] ATR, Media Informat Sci Lab, Kyoto, Japan
关键词
voice activity detection; high order statistics; on-line EM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of segmenting audio data recorded with embedded devices for the purpose of intelligent sensing in the context of multi-modal interactions. We propose a real-time method for robust speech detection in natural, noisy environments. It is based on a fusion of high order statistics of the LPC residual and autocorrelation, and adopts an on-line version of Expectation Maximization algorithm for the classification. Experimental evaluations show that the proposed method provides better detection performance under different types of natural noises, working robustly against other voices in the context of multi-speaker interactive situations. As the proposed method is based on features which have a low computational cost, and has a small latency, it is suitable for real-time tracking applications.
引用
收藏
页码:1201 / +
页数:2
相关论文
共 50 条
  • [1] Reinforcement learning based on on-line EM algorithm
    Sato, M
    Ishii, S
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 11, 1999, 11 : 1052 - 1058
  • [2] On-line EM algorithm for mixture of local experts
    Sato, M
    Ishii, S
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 1397 - 1401
  • [3] On-line EM algorithm and reconstruction of chaotic dynamics
    Ishii, S
    Sato, M
    NEURAL NETWORKS FOR SIGNAL PROCESSING VIII, 1998, : 360 - 369
  • [4] On-line EM algorithm for the normalized gaussian network
    Sato, M
    Ishii, S
    NEURAL COMPUTATION, 2000, 12 (02) : 407 - 432
  • [5] Reconstruction of chaotic dynamics by on-line EM algorithm
    Ishii, S
    Sato, MA
    NEURAL NETWORKS, 2001, 14 (09) : 1239 - 1256
  • [6] Vowel synthesis by on-line EM algorithm with IIR filter
    Tamakoshi, H
    Ishii, S
    Yoshida, W
    Sato, M
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2821 - 2825
  • [7] New Algorithm for LPC Residual Estimation from LSF Vectors for a Voice Conversion System
    Percybrooks, Winston S.
    Moore, Elliot, II
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2480 - 2483
  • [8] Voice Activity Detection Based on High Order Statistics and Online EM Algorithm
    Cournapeau, David
    Kawahara, Tatsuya
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (12) : 2854 - 2861
  • [9] The Variational EM algorithm for on-line identification of extended AR models
    Smídl, V
    Quinn, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 117 - 120
  • [10] Learning chaotic dynamics under noise with on-line EM algorithm
    Yoshida, Wako
    Ishii, Shin
    Sato, Masa-Aki
    Electronics and Communications in Japan, Part III: Fundamental Electronic Science (English translation of Denshi Tsushin Gakkai Ronbunshi), 2001, 84 (06): : 23 - 31