DNN-Based Frequency Component Prediction for Frequency-Domain Audio Source Separation

被引:0
|
作者
Watanabe, Rui [1 ]
Kitamura, Daichi [1 ]
Saruwatari, Hiroshi [2 ]
Takahashi, Yu [3 ]
Kondo, Kazunobu [3 ]
机构
[1] Natl Inst Technol, Kagawa Coll, Takamatsu, Kagawa 7618058, Japan
[2] Univ Tokyo, Tokyo 1138656, Japan
[3] Yamaha Corp, Shizuoka 4308650, Japan
关键词
audio source separation; deep neural networks; frequency component prediction; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multichannel audio source separation (MASS) plays an important role in various audio applications. Frequency-domain MASS algorithms such as multichannel nonnegative matrix factorization achieve better separation quality. However, they require a considerable computational cost for estimating the frequency-wise separation filter. To solve this problem, we propose a new framework combining the MASS algorithms and a simple deep neural network (DNN). In the proposed framework, frequency-domain MASS is performed only in narrowband frequency bins. Then, DNN predicts the separated source components in other frequency bins, where both the observed mixture of all frequency bins and the separated narrowband source components are used as DNN inputs. Our experimental results show the validity of the proposed MASS framework in terms of computational efficiency.
引用
收藏
页码:805 / 809
页数:5
相关论文
共 50 条
  • [1] DNN-Based Frequency-Domain Permutation Solver for Multichannel Audio Source Separation
    Hasuike, Fumiya
    Kitamura, Daichi
    Watanabe, Rui
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 871 - 876
  • [2] DNN-Based Permutation Solver for Frequency-Domain Independent Component Analysis in Two-Source Mixture Case
    Yamaji, Shuhei
    Kitamura, Daichi
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 781 - 787
  • [3] AUDIO SOURCE SEPARATION BASED ON CONVOLUTIVE TRANSFER FUNCTION AND FREQUENCY-DOMAIN LASSO OPTIMIZATION
    Li, Xiaofei
    Girin, Laurent
    Horaud, Radu
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 541 - 545
  • [4] Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction
    Motlicek, Petr
    Ganapathy, Sriram
    Hermansky, Hynek
    Garudadri, Harinath
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
  • [5] Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction
    Petr Motlicek
    Sriram Ganapathy
    Hynek Hermansky
    Harinath Garudadri
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2010
  • [6] Equivalence between frequency-domain blind source separation and frequency-domain adaptive beamforming for convolutive mixtures
    Araki, S
    Makino, S
    Hinamoto, Y
    Mukai, R
    Nishikawa, T
    Saruwatari, H
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (11) : 1157 - 1166
  • [7] Equivalence between Frequency-Domain Blind Source Separation and Frequency-Domain Adaptive Beamforming for Convolutive Mixtures
    Shoko Araki
    Shoji Makino
    Yoichi Hinamoto
    Ryo Mukai
    Tsuyoki Nishikawa
    Hiroshi Saruwatari
    [J]. EURASIP Journal on Advances in Signal Processing, 2003
  • [8] A DNN-based Post Filter for Geometric Source Separation
    Chen, Chenghao
    Zhou, Yi
    Liu, Hongqing
    [J]. 2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [9] Indeterminacy Free Frequency-Domain Blind Separation of Reverberant Audio Sources
    Di Persia, Leandro
    Milone, Diego
    Yanagida, Masuzo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (02): : 299 - 311
  • [10] On calculating the inverse of separation matrix in frequency-domain blind source separation
    Sawada, H
    Araki, S
    Mukai, R
    Makino, S
    [J]. INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 691 - 699