Noise-Robust Voice Activity Detector Based On Four States-Based HMM

被引:1
|
作者
Zhou, Bin [1 ]
Liu, Jing [1 ]
Pei, Zheng [1 ]
机构
[1] Xihua Univ, Ctr Radio Adm & Technol Dev, Chengdu, Peoples R China
关键词
Voice activity detection; k-means clustering; left-right hidden Markov model; low signal-to-noise ratio;
D O I
10.4028/www.scientific.net/AMM.411-414.743
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
Voice activity detection (VAD) is more and more essential in the noisy environments to provide an accuracy performance in the speech recognition. In this paper, we provide a method based on left-right hidden Markov model (HMM) to identify the start and end of the speech. The method builds two models of non-speech and speech instead of existed two states, formally, each model could include several states, we also analysis other features, such as pitch index, pitch magnitude and fractal dimension of speech and non-speech.. We compare the VAD results with the proposed algorithm and two states HMM. Experiments show that the proposed method make a better performance than two state HMMs in VAD, especially in the low signal-to-noise ratio (SNR) environment.
引用
收藏
页码:743 / 748
页数:6
相关论文
共 50 条
  • [1] An improved noise-robust voice activity detector based on hidden semi-Markov models
    Liang, Yuan
    Liu, Xianglong
    Lou, Yihua
    Shan, Baosong
    PATTERN RECOGNITION LETTERS, 2011, 32 (07) : 1044 - 1053
  • [2] HMM-based noise-robust feature compensation
    Sasou, Akira
    Asano, Futoshi
    Nakamura, Satoshi
    Tanaka, Kazuyo
    SPEECH COMMUNICATION, 2006, 48 (09) : 1100 - 1111
  • [3] A New Method Based on HMMs and K-means Algorithms for Noise-Robust Voice Activity Detector
    Luo Bing
    Pei Zheng
    Xu Li
    Hu Deli
    MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION IV, PTS 1 AND 2, 2012, 128-129 : 461 - 464
  • [4] Noise-robust voice conversion based on joint dictionary optimization
    ZHANG Shilei
    JIAN Zhihua
    SUN Minhong
    ZHONG Hua
    LIU Erxiao
    Chinese Journal of Acoustics, 2020, 39 (02) : 259 - 272
  • [5] On training targets for noise-robust voice activity detection
    Braun, Sebastian
    Tashev, Ivan
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 421 - 425
  • [6] A Novel Model Characteristics for Noise-Robust Automatic Speech Recognition Based on HMM
    Rafieee, M. Saadeq
    Khazaei, Ali Akbar
    2010 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND INFORMATION SECURITY (WCNIS), VOL 2, 2010, : 215 - 218
  • [7] HMM-Based Top-Down Attention Tor Noise-Robust Speech Recognition
    Lee, Chang-Hoon
    Lee, Soo-Young
    ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 731 - +
  • [8] Robust voice activity detection based on noise eigenspace
    Ying, Dongwen
    Shi, Yu
    Lu, Xugang
    Dang, Jianwu
    Soong, Frank
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (06) : 413 - 423
  • [9] Noise-Robust Hearing Aid Voice Control
    Lopez-Espejo, Ivan
    Rosello, Eros
    Edraki, Amin
    Harte, Naomi
    Jensen, Jesper
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 241 - 245
  • [10] SPARSE POWER SPECTRUM BASED ROBUST VOICE ACTIVITY DETECTOR
    You, Datao
    Han, Jiqing
    Zheng, Guibin
    Zheng, Tieran
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 289 - 292