Evaluating Source Separation Algorithms With Reverberant Speech

被引:21
|
作者
Mandel, Michael I. [1 ]
Bressler, Scott [2 ]
Shinn-Cunningham, Barbara [2 ]
Ellis, Daniel P. W. [3 ]
机构
[1] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
[2] Boston Univ, Dept Cognit & Neural Syst, Boston, MA 02215 USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Intelligibility; objective evaluation; reverberation; speech enhancement; time-frequency masking; underdetermined source separation; RECOGNITION; MASKING; PERCEPTION; MIXTURES;
D O I
10.1109/TASL.2010.2052252
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixtures, automatic speech recognition (ASR) performance on the separated signals is quite similar to human performance. In reverberation, however, while signal separation has some benefit for ASR, the results are still far below those of human listeners facing the same task. Performing this same experiment with a number of oracle masks created with a priori knowledge of the separated sources motivates a new objective measure of separation performance, the Direct-path, Early echo, and Reverberation, of the Target and Masker (DERTM), which is closely related to the ASR results. This measure indicates that while the non-oracle algorithms successfully reject the direct-path signal from the masking source, they reject less of its reverberation, explaining the disappointing ASR performance.
引用
收藏
页码:1872 / 1883
页数:12
相关论文
共 50 条
  • [31] Performance analysis of dynamic acoustic source separation in reverberant rooms
    Talantzis, Fotios
    Ward, Darren B.
    Naylor, Patrick A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1378 - 1390
  • [32] Dynamic Precedence Effect Modeling for Source Separation in Reverberant Environments
    Hummersone, Christopher
    Mason, Russell
    Brookes, Tim
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1867 - 1871
  • [33] Multi-branch Learning for Noisy and Reverberant Monaural Speech Separation
    Ma, Chao
    Li, Dongmei
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1247 - 1251
  • [34] A Performance Evaluation of Several Deep Neural Networks for Reverberant Speech Separation
    Liu, Qingju
    Wang, Wenwu
    Jackson, Philip J. B.
    Safavi, Saeid
    2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2018, : 689 - 693
  • [35] Improvement of "SAFIA" source separation method under reverberant conditions
    Aoki, M
    Furuya, KI
    Kataoka, A
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2006, 89 (03): : 22 - 37
  • [36] A Comparison of Computational Precedence Models for Source Separation in Reverberant Environments
    Hummersone, Christopher
    Mason, Russell
    Brookes, Tim
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2013, 61 (7-8): : 508 - 520
  • [37] Features for Masking-Based Monaural Speech Separation in Reverberant Conditions
    Delfarah, Masood
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 1085 - 1094
  • [38] Exploring permutation inconsistency in blind separation of speech signals in a reverberant environment
    Ikram, MZ
    Morgan, DR
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1041 - 1044
  • [39] Interference Reduction in Reverberant Speech Separation With Visual Voice Activity Detection
    Liu, Qingju
    Aubrey, Andrew J.
    Wang, Wenwu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (06) : 1610 - 1623
  • [40] Determined Reverberant Blind Source Separation of Audio Mixing Signals
    Yang, Senquan
    Ding, Fan
    Liu, Jianjun
    Li, Pu
    Hu, Songxi
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 36 (03): : 3309 - 3323