Noise-Robust Voice Activity Detector Based On Four States-Based HMM

被引:1
|
作者
Zhou, Bin [1 ]
Liu, Jing [1 ]
Pei, Zheng [1 ]
机构
[1] Xihua Univ, Ctr Radio Adm & Technol Dev, Chengdu, Peoples R China
关键词
Voice activity detection; k-means clustering; left-right hidden Markov model; low signal-to-noise ratio;
D O I
10.4028/www.scientific.net/AMM.411-414.743
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Voice activity detection (VAD) is more and more essential in the noisy environments to provide an accuracy performance in the speech recognition. In this paper, we provide a method based on left-right hidden Markov model (HMM) to identify the start and end of the speech. The method builds two models of non-speech and speech instead of existed two states, formally, each model could include several states, we also analysis other features, such as pitch index, pitch magnitude and fractal dimension of speech and non-speech.. We compare the VAD results with the proposed algorithm and two states HMM. Experiments show that the proposed method make a better performance than two state HMMs in VAD, especially in the low signal-to-noise ratio (SNR) environment.
引用
收藏
页码:743 / 748
页数:6
相关论文
共 50 条
  • [31] ADA-VAD: UNPAIRED ADVERSARIAL DOMAIN ADAPTATION FOR NOISE-ROBUST VOICE ACTIVITY DETECTION
    Kim, Taesoo
    Chang, Jiho
    Ko, Jong Hwan
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7327 - 7331
  • [32] CNMF-BASED ACOUSTIC FEATURES FOR NOISE-ROBUST ASR
    Vaz, Colin
    Dimitriadis, Dimitrios
    Thomas, Samuel
    Narayanani, Shrikanth
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5735 - 5739
  • [33] Noise-robust speech recognition based on difference of power spectrum
    Xu, JF
    Wei, G
    ELECTRONICS LETTERS, 2000, 36 (14) : 1247 - 1248
  • [34] A voice activity detector employing soft decision based noise spectrum adaptation
    Sohn, J
    Sung, W
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 365 - 368
  • [35] NOISE-ROBUST VOICE CONVERSION USING A SMALL PARALLE DATA BASED ON NON-NEGATIVE MATRIX FACTORIZATION
    Aihara, Ryo
    Fujii, Takao
    Nakashika, Toru
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 315 - 319
  • [36] Noise-Robust Speech Signals Processing for the Voice Control System Based on the Complementary Ensemble Empirical Mode Decomposition
    Kazanferovich, Alimuradov Alan
    Pavlovich, Churakov Pyotr
    2015 INTERNATIONAL SIBERIAN CONFERENCE ON CONTROL AND COMMUNICATIONS (SIBCON), 2015,
  • [37] Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization
    Aihara, Ryo
    Takashima, Ryoichi
    Takiguchi, Tetsuya
    Ariki, Yasuo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06): : 1411 - 1418
  • [38] A Semi-Continuous State-Transition Probability HMM-Based Voice Activity Detector
    Othman, H.
    Aboulnasr, T.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [39] Noise-Robust Speech Recognition Based on RBF Neural Network
    Hou, Xuemei
    HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
  • [40] Noise-Robust Speaker Recognition Based on Morphological Component Analysis
    He, Yongjun
    Chen, Chen
    Han, Jiqing
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3001 - 3005