A robust, real-time endpoint detector with energy normalization for ASR in adverse environments

被引:0
|
作者
Li, Q [1 ]
Zheng, JS [1 ]
Zhou, QR [1 ]
Lee, CH [1 ]
机构
[1] Bell Labs, Lucent Technol, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
When automatic speech recognition (ASR) is applied to hands-free or other adverse acoustic environments, endpoint detection and energy normalization can be crucial to the entire system. In low signal-to-noise (SNR) situations, conventional approaches of endpointing and energy normalization often fail and ASR performances usually degrade dramatically. The goal of this paper is to find a fast, accurate, and robust endpointing algorithm for real-time ASR. We propose a novel approach of using a special filter plus a 3-state decision logic for endpoint detection. The filter has been designed under several criteria to ensure the accuracy and robustness of detection. The detected endpoints are then applied to energy normalization simultaneously. Evaluation results show that the proposed algorithm significantly reduce the string error rates on 7 out of 12 tested databases. The reduction rates even exceeded 50% on two of them. The algorithm only uses one-dimensional energy with 24-frame lookahead; therefore, it has a low complexity and is suitable for real-time ASR.
引用
收藏
页码:233 / 236
页数:4
相关论文
共 50 条
  • [1] Robust endpoint detection and energy normalization for real-time speech and speaker recognition
    Li, Q
    Zheng, JS
    Tsai, A
    Zhou, QR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (03): : 146 - 157
  • [2] A robust algorithm for real-time endpoint detection in the noisy mobile environments
    Wu, B
    Ren, XL
    Liu, CQ
    Zhang, YX
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2003, 12 (04) : 579 - 582
  • [3] A robust real-time endpoint detection algorithm
    Zhang, Y
    Elison, J
    Yfantis, EA
    [J]. PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2000, : 58 - 63
  • [4] A robust, real-time ellipse detector
    Zhang, SC
    Liu, ZQ
    [J]. PATTERN RECOGNITION, 2005, 38 (02) : 273 - 287
  • [5] Robust speech detection and segmentation for real-time ASR applications
    Shafran, I
    Rose, R
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 432 - 435
  • [6] Robust real-time face tracker for cluttered environments
    Anderson, K
    McOwan, PW
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 95 (02) : 184 - 200
  • [7] Real-Time ASR from Meetings
    Garner, Philip N.
    Dines, John
    Hain, Thomas
    El Hannani, Asmaa
    Karafiar, Martin
    Korchagin, Danil
    Lincoln, Mike
    Wan, Vincent
    Zhang, Le
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2067 - +
  • [8] A Robust, Real-time Ground Change Detector for a "Smart" Walker
    Weiss, Viviana
    Cloix, Severine
    Bologna, Guido
    Hasler, David
    Pun, Thierry
    [J]. PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 305 - 312
  • [9] Robust Real-Time Fire Detector Using CNN And LSTM
    Abdali, Al Maamoon Rasool
    Ghani, Rana Fareed
    [J]. 2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 204 - 207
  • [10] Robust real-time stereo matching system for indoor environments
    Chang, Jiho
    Jeong, Jae-Chan
    Choi, Seungmin
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2015, : 102 - 102