A robust speech detection algorithm for speech activated hands-free applications

被引:2
|
作者
Wu, D [1 ]
Tanaka, M [1 ]
Chen, R [1 ]
Olorenshaw, L [1 ]
Amador, M [1 ]
Menendez-Pidal, X [1 ]
机构
[1] Sony US Res Labs, Spoken Language Technol, San Jose, CA 95134 USA
关键词
D O I
10.1109/ICASSP.1999.758424
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a novel noise robust speech detection algorithm that can operate reliably in severe car noisy conditions. High performance has been obtained with the following techniques: (1) noise suppression based on principal component analysis for pre-processing, (2) robust endpoint detection using dynamic parameters [1] and (3) speech verification using periodicity of voiced signals with harmonic enhancement. Noise suppression improves the SNR as compared with nonlinear spectrum subtraction by about 20 dB. This makes the endpoint detection operate reliably in SNRs down to -10 dB. In car environments, road bump noises are problematic for speech detectors causing mis-detection errors. Speech verification helps to remove these errors. This technology is being used in Sony car navigation products.
引用
收藏
页码:2407 / 2410
页数:4
相关论文
共 50 条
  • [31] Improving Hands-Free Speech Rehabilitation in Laryngectomized Patients with a Moldable Adhesive
    Leemans, Maartje
    Longobardi, Ylenia
    Dirven, Richard
    Honings, Jimmie
    D'Alatri, Lucia
    Galli, Jacopo
    van den Brekel, Michiel
    Parrilla, Claudio
    van Sluis, Klaske E.
    [J]. LARYNGOSCOPE, 2023, 133 (11): : 2965 - 2970
  • [32] Development of Hands-free Speech Enhancement System for Both EL-users and Esophageal Speech Users
    Matsunaga, Yuta
    Matsui, Kenji
    Nakatoh, Yoshihisa
    Kato, Yumiko O.
    [J]. DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2018, 620 : 334 - 341
  • [33] Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition
    Ohashi, Y
    Nishikawa, T
    Saruwatari, H
    Lee, A
    Shikano, K
    [J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 533 - 537
  • [34] Hands-free speech recognition and communication on PDAS using microphone array technology
    Herbordt, W
    Horiuchi, T
    Fujimoto, M
    Jitsuhiro, T
    Nakamura, S
    [J]. 2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 302 - 307
  • [35] Achieving a hands-free computer interface using voice recognition and speech synthesis
    Evans, JR
    Tjoland, WA
    Allred, LG
    [J]. IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2000, 15 (01) : 14 - 16
  • [36] Compliance, quality of life and quantitative voice quality aspects of hands-free speech
    Op de Coul, BMR
    Ackerstaff, AH
    Van As-Brooks, CJ
    Van den Hoogen, FJA
    Meeuwis, CA
    Manni, JJ
    Hilgers, FJM
    [J]. ACTA OTO-LARYNGOLOGICA, 2005, 125 (06) : 629 - 637
  • [37] Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition
    Delcroix, Marc
    Yoshioka, Takuya
    Ogawa, Atsunori
    Kubo, Yotaro
    Fujimoto, Masakiyo
    Ito, Nobutaka
    Kinoshita, Keisuke
    Espi, Miquel
    Araki, Shoko
    Hori, Takaaki
    Nakatani, Tomohiro
    [J]. 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 522 - 526
  • [38] Clinical use of a neck brace to improve hands-free speech in laryngectomized patients
    Dirven, Richard
    Kooijman, Piet G. C.
    Wouters, Yannick
    Marres, Henri A. M.
    [J]. LARYNGOSCOPE, 2012, 122 (06): : 1267 - 1272
  • [39] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y
    Godfrey, JJ
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 297 - 300
  • [40] Hands-free mobile phone speech while driving degrades coordination and control
    Treffner, PJ
    Barrett, R
    [J]. TRANSPORTATION RESEARCH PART F-TRAFFIC PSYCHOLOGY AND BEHAVIOUR, 2004, 7 (4-5) : 229 - 246