Noise-Robust Voice Activity Detector Based On Four States-Based HMM

被引:1
|
作者
Zhou, Bin [1 ]
Liu, Jing [1 ]
Pei, Zheng [1 ]
机构
[1] Xihua Univ, Ctr Radio Adm & Technol Dev, Chengdu, Peoples R China
关键词
Voice activity detection; k-means clustering; left-right hidden Markov model; low signal-to-noise ratio;
D O I
10.4028/www.scientific.net/AMM.411-414.743
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Voice activity detection (VAD) is more and more essential in the noisy environments to provide an accuracy performance in the speech recognition. In this paper, we provide a method based on left-right hidden Markov model (HMM) to identify the start and end of the speech. The method builds two models of non-speech and speech instead of existed two states, formally, each model could include several states, we also analysis other features, such as pitch index, pitch magnitude and fractal dimension of speech and non-speech.. We compare the VAD results with the proposed algorithm and two states HMM. Experiments show that the proposed method make a better performance than two state HMMs in VAD, especially in the low signal-to-noise ratio (SNR) environment.
引用
收藏
页码:743 / 748
页数:6
相关论文
共 50 条
  • [41] Fast and noise-robust quantum state tomography based on ELM
    Wu, Xiao-Dong
    Cong, Shuang
    INTERNATIONAL JOURNAL OF QUANTUM INFORMATION, 2024, 22 (04)
  • [42] Noise Robust Voice Detector for Speaker Recognition
    Hernandez, Gabriel
    Calvo, Jose R.
    Fernandez, Rafael
    Rodes, Ivis
    Martinez, Rafael
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2605 - 2608
  • [43] A noise-robust FFT-based spectrum for audio classification
    Chu, Wei
    Champagne, Benoit
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5071 - 5074
  • [44] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [45] A Semi-Continuous State-Transition Probability HMM-Based Voice Activity Detector
    H Othman
    T Aboulnasr
    EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [46] Robust Pathological Voice Detection Based on Component Information from HMM
    Sarria-Paja, M.
    Castellanos-Dominguez, G.
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2011, 7015 : 254 - 261
  • [47] Noise robust voice activity detection based on periodic to aperiodic component ratio
    Ishizuka, Kentaro
    Nakatani, Tomohiro
    Fujimoto, Masakiyo
    Miyazaki, Noboru
    SPEECH COMMUNICATION, 2010, 52 (01) : 41 - 60
  • [48] AUDIO-VISUAL VOICE CONVERSION USING NOISE-ROBUST FEATURES
    Sawada, Kohei
    Takehara, Masanori
    Tamura, Satoshi
    Hayamizu, Satoru
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [49] Superpixel-Based Noise-Robust Sparse Unmixing of Hyperspectral Image
    Li, Chang
    Sui, Chenhong
    Song, Rencheng
    Cheng, Juan
    Liu, Yu
    Chen, Xun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [50] Noise-Robust HRRP Target Recognition Based on Residual Scattering Network
    Huang, Pengjun
    Li, Shuai
    Zheng, Muhai
    Xie, Jingyang
    Tian, Biao
    Xu, Shiyou
    2024 9TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, ICSIP, 2024, : 37 - 41