FILTERED NOISE SHAPING FOR TIME DOMAIN ROOM IMPULSE RESPONSE ESTIMATION FROM REVERBERANT SPEECH

被引:16
|
作者
Steinmetz, Christian J. [1 ,2 ]
Ithapu, Vamsi Krishna [2 ]
Calamia, Paul [2 ]
机构
[1] Queen Mary Univ London, Ctr Digital Mus, London, England
[2] Facebook Real Labs Res, Redmond, WA USA
关键词
Room impulse response; acoustic matching; reverberation; synthesis; blind estimation;
D O I
10.1109/WASPAA52581.2021.9632680
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning approaches have emerged that aim to transform an audio signal so that it sounds as if it was recorded in the same room as a reference recording, with applications both in audio post-production and augmented reality. In this work, we propose FiNS, a Filtered Noise Shaping network that directly estimates the time domain room impulse response (RIR) from reverberant speech. Our domain-inspired architecture features a time domain encoder and a filtered noise shaping decoder that models the RIR as a summation of decaying filtered noise signals, along with direct sound and early reflection components. Previous methods for acoustic matching utilize either large models to transform audio to match the target room or predict parameters for algorithmic reverberators. Instead, blind estimation of the RIR enables efficient and realistic transformation with a single convolution. An evaluation demonstrates our model not only synthesizes RIRs that match parameters of the target room, such as the T-60 and DRR, but also more accurately reproduces perceptual characteristics of the target room, as shown in a listening test when compared to deep learning baselines.
引用
收藏
页码:221 / 225
页数:5
相关论文
共 50 条
  • [31] GEOMETRICAL ROOM GEOMETRY ESTIMATION FROM ROOM IMPULSE RESPONSES
    Rajapaksha, Tilak
    Qiu, Xiaojun
    Cheng, Eva
    Burnett, Ian
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 331 - 335
  • [32] APPLICATION OF WAVELET TRANSFORM IN REDUCTION OF NOISE IN ROOM IMPULSE RESPONSE
    Andrejevic, Milan
    Ciric, Dejan
    2013 21ST TELECOMMUNICATIONS FORUM (TELFOR), 2013, : 773 - +
  • [33] Transient noise influence in MLS measurement of room impulse response
    Ciric, DG
    Milosevic, MA
    ACTA ACUSTICA UNITED WITH ACUSTICA, 2005, 91 (01) : 110 - 120
  • [34] IMPROVED NOISE CHARACTERIZATION FOR RELATIVE IMPULSE RESPONSE ESTIMATION
    Srikrishnan, Tharun Adithya
    Rao, Bhaskar D.
    Giri, Ritwik
    Zhang, Tao
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 411 - 415
  • [35] Time Delay Estimation of Reverberant Meeting Speech: On the Use of Multichannel Linear Prediction
    Cheng, E.
    Burnett, I. S.
    Ritz, C.
    SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 531 - 537
  • [36] A robust method for speech signal time-delay estimation in reverberant rooms
    Brandstein, MS
    Silverman, HF
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 375 - 378
  • [37] Reverberation Time Estimation based on a Model for the Power Spectral Density of Reverberant Speech
    Faraji, Neda
    Ahadi, Seyed Mohammad
    Sheikhzadeh, Hamid
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1453 - 1457
  • [38] Single Snapshot Detection and Estimation of Reflections From Room Impulse Responses in the Spherical Harmonic Domain
    Tervo, Sakari
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2466 - 2480
  • [39] EFFECTS ON SPEECH-PERCEPTION OF MODIFYING IMPULSE RESPONSE OF A SMALL ROOM
    BERKLEY, DA
    CURTIS, TH
    ALLEN, JB
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1973, 53 (01): : 301 - &
  • [40] Removal of impulse noise from audio and speech signals
    Rajagopalan, R
    Subramanian, B
    SCS 2003: INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2003, : 161 - 163