A Comparison of Computational Precedence Models for Source Separation in Reverberant Environments

被引:0
|
作者
Hummersone, Christopher [1 ]
Mason, Russell [1 ]
Brookes, Tim [1 ]
机构
[1] Univ Surrey, Inst Sound Recording, Guildford GU2 5XH, Surrey, England
来源
基金
英国工程与自然科学研究理事会;
关键词
CROSS-CORRELATION MODEL; CONTRALATERAL INHIBITION; BINAURAL LOCALIZATION; SPEECH RECOGNITION; SOUND LOCALIZATION; NOISE; EXTENSION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Reverberation continues to be problematic in many areas of audio and speech processing, including source separation. The precedence effect is an important psychoacoustic tool utilized by humans to assist in localization by suppressing reflections arising from room boundaries. Numerous computational precedence models have been developed over the years and all suggest quite different strategies for handling reverberation. However, relatively little work has been done on incorporating precedence into source separation. This paper details a study comparing several computational precedence models and their impact on the performance of a baseline separation algorithm. The models are tested in a range of reverberant rooms and with a range of other mixture parameters. Large differences in the performance of the models are observed. The results show that a model based on interaural coherence and onset-based inhibition produce the greatest performance gain over the baseline algorithm. The results also show that it may be necessary to adapt the precedence model to the acoustic conditions of the room in order to optimize the performance of the separation algorithm.
引用
收藏
页码:508 / 520
页数:13
相关论文
共 50 条
  • [1] Dynamic Precedence Effect Modeling for Source Separation in Reverberant Environments
    Hummersone, Christopher
    Mason, Russell
    Brookes, Tim
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1867 - 1871
  • [2] MONAURAL SOURCE SEPARATION: FROM ANECHOIC TO REVERBERANT ENVIRONMENTS
    Cord-Landwehr, Tobias
    Boeddeker, Christoph
    Von Neumann, Thilo
    Zorila, Catalin
    Doddipatla, Rama
    Haeb-Umbach, Reinhold
    [J]. 2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [3] Infinite Sparse Factor Analysis for Blind Source Separation in Reverberant Environments
    Nagira, Kohei
    Otsuka, Takuma
    Okuno, Hiroshi G.
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 638 - 647
  • [4] A multichannel learning-based approach for sound source separation in reverberant environments
    You-Siang Chen
    Zi-Jie Lin
    Mingsian R. Bai
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [5] A multichannel learning-based approach for sound source separation in reverberant environments
    Chen, You-Siang
    Lin, Zi-Jie
    Bai, Mingsian R.
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [6] Analysis of source localization in reverberant environments
    Peterson, J. Michael
    Kyriakakis, Chris
    [J]. 2006 IEEE SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP PROCEEDINGS, VOLS 1 AND 2, 2006, : 672 - +
  • [7] Audio Source Separation in Reverberant Environments Using β-Divergence-Based Nonnegative Factorization
    Fakhry, Mahmoud
    Svaizer, Piergiorgio
    Omologo, Maurizio
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1462 - 1476
  • [8] Humanoid separation of speech sources in reverberant environments
    Schulz, Sylvia
    Herfet, Thorsten
    [J]. 2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 377 - 382
  • [9] Evaluating Source Separation Algorithms With Reverberant Speech
    Mandel, Michael I.
    Bressler, Scott
    Shinn-Cunningham, Barbara
    Ellis, Daniel P. W.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1872 - 1883
  • [10] Underdetermined Blind Source Separation in Reverberant Environment
    Li, Shuai
    Liu, Hongqing
    Lu, Gan
    Zhou, Yi
    [J]. 2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 217 - 221