Evaluating Source Separation Algorithms With Reverberant Speech

被引:21
|
作者
Mandel, Michael I. [1 ]
Bressler, Scott [2 ]
Shinn-Cunningham, Barbara [2 ]
Ellis, Daniel P. W. [3 ]
机构
[1] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
[2] Boston Univ, Dept Cognit & Neural Syst, Boston, MA 02215 USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Intelligibility; objective evaluation; reverberation; speech enhancement; time-frequency masking; underdetermined source separation; RECOGNITION; MASKING; PERCEPTION; MIXTURES;
D O I
10.1109/TASL.2010.2052252
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixtures, automatic speech recognition (ASR) performance on the separated signals is quite similar to human performance. In reverberation, however, while signal separation has some benefit for ASR, the results are still far below those of human listeners facing the same task. Performing this same experiment with a number of oracle masks created with a priori knowledge of the separated sources motivates a new objective measure of separation performance, the Direct-path, Early echo, and Reverberation, of the Target and Masker (DERTM), which is closely related to the ASR results. This measure indicates that while the non-oracle algorithms successfully reject the direct-path signal from the masking source, they reject less of its reverberation, explaining the disappointing ASR performance.
引用
收藏
页码:1872 / 1883
页数:12
相关论文
共 50 条
  • [1] Fast convergence speech source separation in reverberant acoustic environment
    Zhao, YX
    Hu, R
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 897 - 900
  • [2] Expected performance of a family of blind source separation algorithms in a reverberant room
    Talantzis, F
    Ward, DB
    Naylor, PA
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: AUDIO AND ELECTROACOUSTICS SIGNAL PROCESSING FOR COMMUNICATIONS, 2004, : 61 - 64
  • [3] A Blind Source Separation Based Approach for Speech Enhancement in Noisy and Reverberant Environment
    Pignotti, Alessio
    Marcozzi, Daniele
    Cifani, Simone
    Squartini, Stefano
    Piazza, Francesco
    CROSS-MODAL ANALYSIS OF SPEECH, GESTURES, GAZE AND FACIAL EXPRESSIONS, 2009, 5641 : 356 - 367
  • [4] CONVOLUTIVE PREDICTION FOR REVERBERANT SPEECH SEPARATION
    Wang, Zhong-Qiu
    Wichern, Gordon
    Le Roux, Jonathan
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 56 - 60
  • [5] INTEGRATING BINAURAL CUES AND BLIND SOURCE SEPARATION METHOD FOR SEPARATING REVERBERANT SPEECH MIXTURES
    Alinaghi, Atiyeh
    Wang, Wenwu
    Jackson, Philip J. B.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 209 - 212
  • [6] Structured Sparsity Models for Reverberant Speech Separation
    Asaei, Afsaneh
    Golbabaee, Mohammad
    Bourlard, Herve
    Cevher, Volkan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 620 - 633
  • [7] Humanoid separation of speech sources in reverberant environments
    Schulz, Sylvia
    Herfet, Thorsten
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 377 - 382
  • [8] A study on unsupervised monaural reverberant speech separation
    R. Hemavathi
    R. Kumaraswamy
    International Journal of Speech Technology, 2020, 23 : 451 - 457
  • [9] A study on unsupervised monaural reverberant speech separation
    Hemavathi, R.
    Kumaraswamy, R.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 451 - 457
  • [10] Synchronizing Speech Mixtures in Speech Separation Problems under Reverberant Conditions
    Llerena, Cosme
    Gil-Pita, Roberto
    Alvarez, Lorena
    Rosa-Zurera, Manuel
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, PT I, 2013, 7894 : 568 - 579