TOWARDS REAL-TIME SINGLE-CHANNEL SINGING-VOICE SEPARATION WITH PRUNED MULTI-SCALED DENSENETS

被引:0
|
作者
Huber, Markus [1 ,3 ]
Schindler, Gunther [2 ]
Roth, Wolfgang [1 ]
Froning, Holger [2 ]
Schorkhuber, Christian [3 ]
Pernkopf, Franz [1 ]
机构
[1] Graz Univ Technol, Signal Proc & Speech Commun Lab, Graz, Austria
[2] Heidelberg Univ, Inst Comp Engn, Heidelberg, Germany
[3] Sonible GmbH, Graz, Austria
关键词
Musical Source Separation; Real-time; Parameterized Structured Pruning; Multi-scaled DenseNet;
D O I
10.1109/icassp40776.2020.9053542
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Modern musical source separation systems based on deep neural networks reach unprecedented levels of separation quality. However, harnessing the power of these large-scale models in typical audio production environments, which frequently offer only limited computing resources while demanding real-time processing, remains challenging. We extend the multi-scaled DenseNet in several aspects to facilitate real-time source separation scenarios. Specifically, we reduce the computational requirements by inferring Melscaled masks and decrease the model size via effective use of bottleneck layers, while improving performance using a deep clustering objective. In addition, we are able to further increase the model efficiency by applying parameterized structured pruning of convolutional weights without any significant impact on the separation performance. We significantly reduce the model size and increase the computational efficiency by a factor of 1.6 and 4.3, respectively, while maintaining the separation performance.
引用
收藏
页码:806 / 810
页数:5
相关论文
共 34 条
  • [1] Real-time Single-channel Dereverberation and Separation with Time-domain Audio Separation Network
    Luo, Yi
    Mesgarani, Nima
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 342 - 346
  • [2] ONLINE DEEP ATTRACTOR NETWORK FOR REAL-TIME SINGLE-CHANNEL SPEECH SEPARATION
    Han, Cong
    Luo, Yi
    Mesgarani, Nima
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 361 - 365
  • [3] TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME, SINGLE-CHANNEL SPEECH SEPARATION
    Luo, Yi
    Mesgarani, Nima
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 696 - 700
  • [4] Single-Channel Real-Time Drowsiness Detection Based on Electroencephalography
    Albalawi, Hassan
    Li, Xin
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 98 - 101
  • [5] PERFORMANCE COMPARISON OF REAL-TIME SINGLE-CHANNEL SPEECH DEREVERBERATION ALGORITHMS
    Xiong, Feifei
    Meyer, Bernd T.
    Cauchi, Benjamin
    Jukic, Ante
    Doclo, Simon
    Goetze, Stefan
    [J]. 2017 HANDS-FREE SPEECH COMMUNICATIONS AND MICROPHONE ARRAYS (HSCMA 2017), 2017, : 126 - 130
  • [6] Real-time Single-Channel EOG removal based on Empirical Mode Decomposition
    Kien, Nguyen Trong
    Nhat, Nguyen Luong
    Hanh, Tan
    Duy, Tran Trung
    Huong, Ha Thi Thanh
    Duy, Pham The
    Binh, Nguyen Thanh
    [J]. EAI Endorsed Transactions on Industrial Networks and Intelligent Systems, 2024, 11 (02) : 1 - 9
  • [7] Single-channel multiplexing without melting curve analysis in real-time PCR
    Young-Jo Lee
    Daeyoung Kim
    Kihoon Lee
    Jong-Yoon Chun
    [J]. Scientific Reports, 4
  • [8] Single-channel multiplexing without melting curve analysis in real-time PCR
    Lee, Young-Jo
    Kim, Daeyoung
    Lee, Kihoon
    Chun, Jong-Yoon
    [J]. SCIENTIFIC REPORTS, 2014, 4
  • [9] Real-time single-channel speech enhancement based on causal attention mechanism
    Fan, Junyi
    Yang, Jibin
    Zhang, Xiongwei
    Yao, Yao
    [J]. APPLIED ACOUSTICS, 2022, 201
  • [10] A novel single-channel edge computing LoRa gateway for real-time confirmed messaging
    Zhong, Chen
    Nie, Xianzhong
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)