Voice Activity Detector Based on Enhanced Cumulant of LPC Residual and On-line EM Algorithm

被引:0
|
作者
Cournapeau, David [1 ]
Kawahara, Tatsuya [1 ]
Mase, Kenji [2 ,3 ]
Toriyama, Tomoji [3 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto, Japan
[2] Nagoya Univ, Grad Sch Informat Sci, Nagoya, Aichi, Japan
[3] ATR, Media Informat Sci Lab, Kyoto, Japan
关键词
voice activity detection; high order statistics; on-line EM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of segmenting audio data recorded with embedded devices for the purpose of intelligent sensing in the context of multi-modal interactions. We propose a real-time method for robust speech detection in natural, noisy environments. It is based on a fusion of high order statistics of the LPC residual and autocorrelation, and adopts an on-line version of Expectation Maximization algorithm for the classification. Experimental evaluations show that the proposed method provides better detection performance under different types of natural noises, working robustly against other voices in the context of multi-speaker interactive situations. As the proposed method is based on features which have a low computational cost, and has a small latency, it is suitable for real-time tracking applications.
引用
收藏
页码:1201 / +
页数:2
相关论文
共 50 条
  • [41] A dosimeter for on-line dose rate monitoring based on a natural diamond detector
    Prosvirin, DV
    Amosov, VN
    Krasil'nikov, AV
    Gvozdeva, NM
    INSTRUMENTS AND EXPERIMENTAL TECHNIQUES, 2004, 47 (05) : 675 - 677
  • [42] A Dosimeter for On-line Dose Rate Monitoring Based on a Natural Diamond Detector
    D. V. Prosvirin
    V. N. Amosov
    A. V. Krasil'nikov
    N. M. Gvozdeva
    Instruments and Experimental Techniques, 2004, 47 : 675 - 677
  • [43] RETRACTED: The New Approach Research on Singing Voice Detection Algorithm Based on Enhanced Reconstruction Residual Network (Retracted Article)
    Liu, Lilin
    JOURNAL OF MATHEMATICS, 2022, 2022
  • [44] ON-LINE RESIDUAL LIFE ESTIMATION FOR CONDITION BASED MAINTENANCE OF ROTATING EQUIPMENT
    Goto, Satoru
    Adachi, Yuhki
    Katafuchi, Shinji
    Furue, Toshihiko
    Uchida, Yoshitaka
    Sueyoshi, Mitsuhiro
    Hatazaki, Hironori
    Nakamura, Masatoshi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (01): : 89 - 102
  • [45] An On-line Routing Algorithm Based on the Off-line Optimal Computing in MPLS
    Hao, Kun
    Jin, Zhigang
    2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 4224 - +
  • [46] Noise estimation using negentropy based voice-activity detector
    Prasad, R
    Saruwatari, H
    Shikano, K
    2004 47TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, CONFERENCE PROCEEDINGS, 2004, : 149 - 152
  • [47] Analysis and improvement of a statistical model-based voice activity detector
    Cho, YD
    Kondoz, A
    IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (10) : 276 - 278
  • [48] A soft voice activity detector based on a Laplacian-Gaussian model
    Gazor, S
    Zhang, W
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 498 - 505
  • [49] Spectral Matching Based Voice Activity Detector for Improved Speaker Recognition
    Sreekumar, K. T.
    George, Kuruvachan K.
    Arunraj, K.
    Kumar, C. Santhosh
    2014 INTERNATIONAL CONFERENCE ON POWER SIGNALS CONTROL AND COMPUTATIONS (EPSCICON), 2014,
  • [50] Spectral Entropy-Based Voice Activity Detector for Videoconfencing Systems
    Lee, Bowon
    Muhkerjee, Debargha
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 3106 - +