Evaluating Source Separation Algorithms With Reverberant Speech

被引:21
|
作者
Mandel, Michael I. [1 ]
Bressler, Scott [2 ]
Shinn-Cunningham, Barbara [2 ]
Ellis, Daniel P. W. [3 ]
机构
[1] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
[2] Boston Univ, Dept Cognit & Neural Syst, Boston, MA 02215 USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Intelligibility; objective evaluation; reverberation; speech enhancement; time-frequency masking; underdetermined source separation; RECOGNITION; MASKING; PERCEPTION; MIXTURES;
D O I
10.1109/TASL.2010.2052252
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixtures, automatic speech recognition (ASR) performance on the separated signals is quite similar to human performance. In reverberation, however, while signal separation has some benefit for ASR, the results are still far below those of human listeners facing the same task. Performing this same experiment with a number of oracle masks created with a priori knowledge of the separated sources motivates a new objective measure of separation performance, the Direct-path, Early echo, and Reverberation, of the Target and Masker (DERTM), which is closely related to the ASR results. This measure indicates that while the non-oracle algorithms successfully reject the direct-path signal from the masking source, they reject less of its reverberation, explaining the disappointing ASR performance.
引用
收藏
页码:1872 / 1883
页数:12
相关论文
共 50 条
  • [41] GLMSNET: SINGLE CHANNEL SPEECH SEPARATION FRAMEWORK IN NOISY AND REVERBERANT ENVIRONMENTS
    Shi, Huiyu
    Chen, Xi
    Kong, Tianlong
    Yin, Shouyi
    Ouyang, Peng
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 663 - 670
  • [42] Improved Speech Source Localization in Reverberant Environments Based on Correlation Dimension
    Wan, Xinwang
    Wu, Zhenyang
    2009 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2009), 2009, : 1540 - 1543
  • [43] Speech Source Tracking Based on Distributed Particle Filter in Reverberant Environments
    Wang, Ruifang
    Lan, Xiaoyu
    ADVANCED HYBRID INFORMATION PROCESSING, ADHIP 2019, PT II, 2019, 302 : 330 - 342
  • [44] Identification of speech source coupling between sensors in reverberant noisy environments
    Cohen, I
    IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (07) : 613 - 616
  • [45] Time difference of arrival estimation of speech source in a noisy and reverberant environment
    Dvorkind, TG
    Gannot, S
    SIGNAL PROCESSING, 2005, 85 (01) : 177 - 204
  • [46] Speech intelligibility improvement using convolutive blind source separation assisted by denoising algorithms
    Kocinski, Jedrzej
    SPEECH COMMUNICATION, 2008, 50 (01) : 29 - 37
  • [47] On the stability of source separation algorithms
    Cardoso, JF
    NEURAL NETWORKS FOR SIGNAL PROCESSING VIII, 1998, : 13 - 22
  • [48] On the Stability of Source Separation Algorithms
    Jean-François Cardoso
    Journal of VLSI signal processing systems for signal, image and video technology, 2000, 26 : 7 - 14
  • [49] On the stability of source separation algorithms
    Cardoso, JF
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2000, 26 (1-2): : 7 - 14
  • [50] CORRELATION OF SPEECH-INTELLIGIBILITY TESTS IN REVERBERANT ROOMS WITH 3 PREDICTIVE ALGORITHMS
    JACOB, KD
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1989, 37 (12): : 1020 - 1030