Self-Supervised Learning and Multi-Task Pre-Training Based Single-Channel Acoustic Denoising

被引：0

作者：

Li, Yi ^{[1
]}

Sun, Yang ^{[2
]}

Naqvi, Syed Mohsen ^{[1
]}

机构：

[1] Newcastle Univ, Sch Engn, Intelligent Sensing & Commun Grp, Newcastle Upon Tyne NE1 7RU, England

[2] Univ Oxford, Big Data Inst, Oxford OX3 7LF, England

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI) | 2022年

关键词：

MONAURAL SOURCE SEPARATION; SPEECH; ENVIRONMENTS;

D O I：

10.1109/MFI55806.2022.9913855

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In self-supervised learning-based single-channel speech denoising problem, it is challenging to reduce the gap between the denoising performance on the estimated and target speech signals with existed pre-tasks. In this paper, we propose a multi-task pre-training method to improve the speech denoising performance within self-supervised learning. In the proposed pre-training autoencoder (PAE), only a very limited set of unpaired and unseen clean speech signals are required to learn speech latent representations. Meanwhile, to solve the limitation of existing single pre-task, the proposed masking module exploits the dereverberated mask and estimated ratio mask to denoise the mixture as the new pre-task. The downstream task autoencoder (DAE) utilizes unlabeled and unseen reverberant mixtures to generate the estimated mixtures. The DAE is trained to share a latent representation with the clean examples from the learned representation in the PAE. Experimental results on a benchmark dataset demonstrate that the proposed method outperforms the state-of-the-art approaches.

引用

页数：5

共 50 条

[31] Self-supervised Multi-task Representation Learning for Sequential Medical Images
Dong, Nanqing
Kampffmeyer, Michael
Voiculescu, Irina
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 779 - 794
[32] Learning Representations for Bipartite Graphs Using Multi-task Self-supervised Learning
Sethi, Akshay
Gupta, Sonia
Malhotra, Aakarsh
Asthana, Siddhartha
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT III, 2023, 14171 : 19 - 35
[33] Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering
Yang, Yaming
Guan, Ziyu
Wang, Zhe
Zhao, Wei
Xu, Cai
Lu, Weigang
Huang, Jianbin
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[34] Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
Georgescu, Mariana-Iuliana
Barbalau, Antonio
Ionescu, Radu Tudor
Khan, Fahad Shahbaz
Popescu, Marius
Shah, Mubarak
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12737 - 12747
[35] SELF-SUPERVISED MULTI-TASK LEARNING FOR SEMANTIC SEGMENTATION OF URBAN SCENES
Santiago, Jonathan Gonzalez
Schenkel, Fabian
Middelmann, Wolfgang
IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVII, 2021, 11862
[36] MULTI-TASK VOICE ACTIVATED FRAMEWORK USING SELF-SUPERVISED LEARNING
Hussain, Shehzeen
Van Nguyen
Zhang, Shuhua
Visser, Erik
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6137 - 6141
[37] Multi-task self-supervised time-series representation learning
Choi, Heejeong
Kang, Pilsung
INFORMATION SCIENCES, 2024, 671
[38] The Effectiveness of Self-supervised Pre-training for Multi-modal Endometriosis Classification
Butler, David
Wang, Hu
Zhang, Yuan
To, Minh-Son
Condous, George
Leonardi, Mathew
Knox, Steven
Avery, Jodie
Hull, M. Louise
Carneiro, Gustavo
2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
[39] A Stacked Denoising Autoencoder Based on Supervised Pre-training
Wang, Xiumei
Mu, Shaomin
Shi, Aiju
Lin, Zhongqi
SMART INNOVATIONS IN COMMUNICATION AND COMPUTATIONAL SCIENCES, VOL 2, 2019, 670 : 139 - 146
[40] ConvMTL: Multi-task Learning via Self-supervised Learning for Simultaneous Dense Predictions
Iyer, Vijayasri
Thangavel, Senthil Kumar
Nalluri, Madhusudana Rao
Chang, Maiga
COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 455 - 466

← 1 2 3 4 5 →