Environmental conditions and acoustic transduction in hands-free speech recognition

被引:54
|
作者
Omologo, M [1 ]
Svaizer, P [1 ]
Matassoni, M [1 ]
机构
[1] Ist Ric Sci & Tecnol, I-38050 Trento, Italy
关键词
hands-free speech recognition; robustness; environmental noise; microphone arrays; acoustics; MAP adaptation;
D O I
10.1016/S0167-6393(98)00030-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Hands-free interaction represents a key-point for increase of flexibility of present applications and for the development of new speech recognition applications, where the user cannot be encumbered by either hand-held or head-mounted microphones. When the microphone is far from the speaker, the transduced signal is affected by degradation of different nature, that is often unpredictable. Special microphones and multi-microphone acquisition systems represent a way of reducing some environmental noise effects. Robust processing and adaptation techniques can be further used in order to compensate for different kinds of variability that may be present in the recognizer input. The purpose of this paper is to re-visit some of the assumptions about the different sources of this variability and to discuss both on special transducer systems and on compensation/adaptation techniques that can be adopted. In particular, the paper will refer to the use of multi-microphone systems to overcome some undesired effects caused by room acoustics (e.g. reverberation) and by coherent/incoherent noise (e.g. competitive talkers, computer fans). The paper concludes with the description of some experiments that were conducted both on real and simulated speech data. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:75 / 95
页数:21
相关论文
共 50 条
  • [21] An acoustic echo canceller optimized for hands-free speech telecommunication in large vehicle cabins
    Saremi, Amin
    Ramkumar, Balaji
    Ghaffari, Ghazaleh
    Gu, Zonghua
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [22] Frame-synchronous noise compensation for hands-free speech recognition in car environments
    Chien, JT
    Lin, MS
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (06): : 508 - 515
  • [23] Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
    Matassoni, M
    Omologo, M
    Giuliani, D
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1407 - 1410
  • [24] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
  • [25] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON HUMAN-COMPUTER INTERACTION WITH MOBILE DEVICES AND SERVICES (MOBILEHCI '17), 2017,
  • [26] HANDS-FREE SPEECH-SOUND INTERACTIONS AT HOME
    Milhorat, P.
    Istrate, D.
    Boudy, J.
    Chollet, G.
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1678 - 1682
  • [27] Hands-free
    Gaudenzi, Daniela
    PONTE, 2011, 67 (09) : 11 - 13
  • [28] A robust speech detection algorithm for speech activated hands-free applications
    Wu, D
    Tanaka, M
    Chen, R
    Olorenshaw, L
    Amador, M
    Menendez-Pidal, X
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2407 - 2410
  • [29] Adaptive acoustic echo canceler for hands-free teleconferencing
    Porayath, Rajiv
    Doherty, John F.
    Russell, Steve F.
    Digital Signal Processing: A Review Journal, 1996, 6 (01): : 29 - 36
  • [30] Convergence of Acoustic Echo Cancellers for Hands-Free Telephones Operating Under Feedback Conditions
    Schuetze, H.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 257 - 260