Speech Intelligibility of Microphone Arrays in Reverberant Environments with Interference

被引:0
|
作者
Ideli, Elham [1 ]
Vaughan, Rodney G. [1 ]
Bajic, Ivan, V [1 ]
机构
[1] Simon Fraser Univ, Sch Engn Sci, Burnaby, BC, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Microphone array; Beamforming technique; Speech intelligibility; NOISE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is known that speech intelligibility degrades with additive noise and reverberation, and that quantitative parameters such as fidelity and signal-to-noise ratio can be improved by using microphone arrays with various beamforming algorithms. However, it is not clear how the array configuration impacts the intelligibility of speech. Numerical experiments, using widely used models, provide the most convenient comparison, and the approach allows rapid assessment of parameters such as the array configuration, the number and spacing of the elements, and modeled features such as room reflection coefficients. For a typical reverberant room with a single wanted source and two unwanted sources (interferers), we compare the performance of two ceiling-mounted configurations - the uniform linear array (ULA) and a uniform circular array (UCA). The microphones are taken as omnidirectional and equispaced along the array loci, and we use a standard gain-constrained power minimization beamformer. In this study, a limiting performance is presented by emphasizing the early reflections over the late ones for the prior steering vector. Under this steering vector condition, for the same number of elements, the UCA easily outperforms the ULA on known quality and intelligibility metrics. For both arrays in this room scenario, all the metrics increase with an increasing number of microphones, although for one intelligibility metric, diminishing returns set in at about 12 microphones.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Microphone arrays for improving speech intelligibility in a reverberant and/or noisy field
    NTT Human Interface Lab
    NTT R&D, 1 (65-70):
  • [2] MICROPHONE ARRAYS FOR IMPROVING SPEECH-INTELLIGIBILITY IN A REVERBERANT OR NOISY SPACE
    NOMURA, H
    MIYATA, H
    HOUTGAST, T
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1993, 41 (10): : 771 - 781
  • [3] Subband parameter optimization of microphone arrays for speech recognition in reverberant environments
    Seltzer, ML
    Stern, RM
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 408 - 411
  • [4] Chinese speech intelligibility of children in noisy and reverberant environments
    Peng, Jianxin
    Wu, Shengju
    INDOOR AND BUILT ENVIRONMENT, 2018, 27 (10) : 1357 - 1363
  • [5] LOUDSPEAKER ARRAYS FOR IMPROVING SPEECH-INTELLIGIBILITY IN A REVERBERANT SPACE
    NOMURA, H
    TOHYAMA, M
    HOUTGAST, T
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1991, 39 (05): : 338 - 343
  • [6] Evaluation of microphone arrays for enhancing noisy and reverberant speech for coding
    Li, Z
    Hoffman, MW
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 91 - 95
  • [7] GEOMETRY CALIBRATION OF MULTIPLE MICROPHONE ARRAYS IN HIGHLY REVERBERANT ENVIRONMENTS
    Plinge, Axel
    Fink, Gernot A.
    2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 243 - 247
  • [8] Binary Mask Estimation for Improved Speech Intelligibility in Reverberant Environments
    Hazrati, Oldooz
    Lee, Jaewook
    Loizou, Philipos
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 162 - 165
  • [9] Intelligibility Enhancement of Casual Speech for Reverberant Environments inspired by Clear Speech Properties
    Koutsogiannaki, Maria
    Petkov, Petko N.
    Stylianou, Yannis
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 65 - 69
  • [10] Effects of urgent speech and preceding sounds on speech intelligibility in noisy and reverberant environments
    Hodoshima, Nao
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1696 - 1699