FILTERED NOISE SHAPING FOR TIME DOMAIN ROOM IMPULSE RESPONSE ESTIMATION FROM REVERBERANT SPEECH

被引:16
|
作者
Steinmetz, Christian J. [1 ,2 ]
Ithapu, Vamsi Krishna [2 ]
Calamia, Paul [2 ]
机构
[1] Queen Mary Univ London, Ctr Digital Mus, London, England
[2] Facebook Real Labs Res, Redmond, WA USA
关键词
Room impulse response; acoustic matching; reverberation; synthesis; blind estimation;
D O I
10.1109/WASPAA52581.2021.9632680
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning approaches have emerged that aim to transform an audio signal so that it sounds as if it was recorded in the same room as a reference recording, with applications both in audio post-production and augmented reality. In this work, we propose FiNS, a Filtered Noise Shaping network that directly estimates the time domain room impulse response (RIR) from reverberant speech. Our domain-inspired architecture features a time domain encoder and a filtered noise shaping decoder that models the RIR as a summation of decaying filtered noise signals, along with direct sound and early reflection components. Previous methods for acoustic matching utilize either large models to transform audio to match the target room or predict parameters for algorithmic reverberators. Instead, blind estimation of the RIR enables efficient and realistic transformation with a single convolution. An evaluation demonstrates our model not only synthesizes RIRs that match parameters of the target room, such as the T-60 and DRR, but also more accurately reproduces perceptual characteristics of the target room, as shown in a listening test when compared to deep learning baselines.
引用
收藏
页码:221 / 225
页数:5
相关论文
共 50 条
  • [41] Estimation of impulse response length to compute room acoustical criteria
    Faiget, L
    Ruiz, R
    Legros, C
    ACUSTICA, 1996, 82 : S148 - S148
  • [42] YET ANOTHER GENERATIVE MODEL FOR ROOM IMPULSE RESPONSE ESTIMATION
    Lee, Sungho
    Choi, Hyeong-Seok
    Lee, Kyogu
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [43] Spectral hybrid M sequences for room impulse response estimation
    Paulo, JP
    Coelho, JB
    Martins, CR
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL X, PROCEEDINGS: SIGNALS PROCESSING AND OPTICAL SYSTEMS, TECHNOLOGIES AND APPLICATIONS, 2003, : 96 - 100
  • [44] Time course of a perceptual enhancement effect for noise-masked speech in reverberant environments
    Brandewie, Eugene
    Zahorik, Pavel
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (02): : EL265 - EL270
  • [45] Bayesian regularization and nonnegative deconvolution for room impulse response estimation
    Lin, YQ
    Lee, DD
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (03) : 839 - 847
  • [46] A non-linear technique for room impulse response estimation
    Collins, T
    DAFX-03: 6TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, PROCEEDINGS, 2003, : 192 - 197
  • [47] Room Impulse Response Estimation with Kernel-Based Regularization
    Fujimoto, Yusuke
    Abe, Fumika
    Nagahara, Masaaki
    2019 58TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2019, : 528 - 531
  • [48] Sound field control in a reverberant room using the Finite Difference Time Domain method
    Antonello, Niccole
    De Sena, Enzo
    Moonen, Marc
    Naylor, Patrick A.
    van Waterschoot, Toon
    60TH AES INTERNATIONAL CONFERENCE ON DREAMS (DEREVERBERATION AND REVERBERATION OF AUDIO, MUSIC, AND SPEECH), 2016,
  • [49] ESTIMATION OF IMPULSE NOISE FROM CUMULATIVE TIME DISTRIBUTIONS WITH A NEW SOUND PRESSURE TIME ANALYZER
    ERLANDSSON, B
    HAKANSON, H
    IVARSSON, A
    KARLSSON, E
    NILSSON, P
    SCANDINAVIAN AUDIOLOGY, 1980, : 33 - 39
  • [50] Comparison of Noise Compensation Methods for Room Acoustic Impulse Response Evaluations
    Guski, M.
    Vorlaender, M.
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2014, 100 (02) : 320 - 327