Robust Speech Detection for Noisy Environments

被引:5
|
作者
Varela, Oscar [1 ]
Indra, S. A. [1 ]
San-Segundo, Ruben [1 ]
Hernandez, Luis A. [1 ]
机构
[1] Univ Politecn Madrid, E-28040 Madrid, Spain
关键词
RECOGNITION;
D O I
10.1109/MAES.2011.6070277
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This presents a robust voice activity detector (VAD) based on Hidden Markov Models (HMM) in stationary and non-stationary noise environments: inside motor vehicles (like cars or planes) or inside buildings close to high traffic places (like in a control tower for air traffic control (ATC)). In these environments, there is a high stationary noise level caused by vehicle motors and additionally, there could be people speaking at certain distance from the main speaker producing non-stationary noise. The VAD presented herein Is characterized by a new front-end and a noise level adaptation process that increases significantly the VAD robustness for different signal to noise ratios (SNRs). The feature vector used by the VAD includes the most relevant Mel Frequency Cepstral Coefficients (MFCC), normalized log energy, and delta log energy. The proposed VAD has been evaluated and compared to other well-known VADs using three databases containing different noise conditions: speech in clean environments (SNRs > 20 dB), speech recorded in stationary noise environments (inside or close to motor vehicles), and finally, speech in non-stationary environments (including noise from bars, television, and far-field speakers). In the three cases, the detection error obtained with the proposed VAD is the lowest for all SNRs compared to Acero's VAD (reference of this work [4]) and other well-known VADs like AMR, AURORA, or G729 annex b.
引用
收藏
页码:16 / U12
页数:12
相关论文
共 50 条
  • [1] A novel algorithm to robust speech endpoint detection in noisy environments
    Yi, Li
    Yingle, Fan
    [J]. ICIEA 2007: 2ND IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, VOLS 1-4, PROCEEDINGS, 2007, : 1555 - 1558
  • [2] A robust endpoint detection of speech for noisy environments with application to automatic speech recognition
    Bou-Ghazale, SE
    Assaleh, K
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 3808 - 3811
  • [3] A robust speech enhancement method in noisy environments
    Abajaddi, Nesrine
    Mounir, Badia
    Elfahm, Youssef
    Farchi, Abdelmajid
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (09) : 973 - 983
  • [4] An effective cluster-based model for robust speech detection and speech recognition in noisy environments
    Gorriz, J. M.
    Ramirez, J.
    Segura, J. C.
    Puntonet, C. G.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01): : 470 - 481
  • [5] An entropy based robust speech boundary detection algorithm for realistic noisy environments
    Weaver, K
    Waheed, K
    Salem, FM
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 680 - 685
  • [6] Robust Laughter Detection in Noisy Environments
    Gillick, Jon
    Deng, Wesley
    Ryokai, Kimiko
    Bamman, David
    [J]. INTERSPEECH 2021, 2021, : 2481 - 2485
  • [7] Robust Speech Endpoint Detection in Noisy Environments for HRI (Human-Robot Interface)
    Park, Jin-Soo
    Ko, Han-Seok
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (02): : 147 - 156
  • [8] Robust speech endpoint detection based on MP3 file in various noisy environments
    Wang, Fang
    Huang, Xianglin
    Yang, Lifang
    Liu, Tao
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 670 - 675
  • [9] A robust speech recognition system for communication robots in noisy environments
    Ishi, Carlos Toshinori
    Matsuda, Shigeki
    Kanda, Takayuki
    Jitsuhiro, Takatoshi
    Ishiguro, Hiroshi
    Nakamura, Satoshi
    Hagita, Norihiro
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2008, 24 (03) : 759 - 763
  • [10] A robust feature extraction for automatic speech recognition in noisy environments
    Lima, C
    Almeida, LB
    Monteiro, JL
    [J]. 2002 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I AND II, 2002, : 540 - 543