Meta-SE: A Meta-Learning Framework for Few-Shot Speech Enhancement

被引:1
|
作者
Zhou, Weili [1 ]
Lu, Mingliang [1 ]
Ji, Ruijie [1 ]
机构
[1] Foshan Univ, Sch Elect & Informat Engn, Foshan 528225, Peoples R China
关键词
Task analysis; Speech enhancement; Training; Robots; Noise measurement; Adaptation models; Data models; single-channel; meta-learning; few-shot learning; NOISE-ESTIMATION;
D O I
10.1109/ACCESS.2021.3066609
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Separating target speech from noisy signal is important for many realistic applications. Recently, deep neural network (DNN) has been widely used in speech enhancement (SE) and obtained prominent performance improvements. However, the current deep models require a large amount of training data to obtain a good performance. It is still challenging to construct an effective deep speech enhancement model with actual few training samples. At present, meta-learning has become the research focus of few-shot learning due to its capability of quickly process new tasks with few samples by the prior meta-knowledge, but there are very few works applying meta-learning on few-shot speech enhancement. In this paper, we propose a generic meta-learning framework Meta-SE which applies the U-Net as the meta-learner, to tackle the few-shot speech enhancement problem. Meta-SE is trained and optimized with the changed speech enhancement tasks to obtain meta-knowledge, and towards better capability of fast and good generalizing to the new unseen noises with few training samples. The experiment results show that the proposed method not only outperforms the state-of-the-arts DNN-SE models under the few-shot conditions, but also learns a more general and flexible model for task adaption.
引用
收藏
页码:46068 / 46078
页数:11
相关论文
共 50 条
  • [1] Unsupervised meta-learning for few-shot learning
    Xu, Hui
    Wang, Jiaxing
    Li, Hao
    Ouyang, Deqiang
    Shao, Jie
    [J]. PATTERN RECOGNITION, 2021, 116
  • [2] Few-shot time series forecasting in a meta-learning framework
    [J]. Ma, Ping (1533321767@qq.com), 1600, IOS Press BV (46):
  • [3] Decentralized federated meta-learning framework for few-shot multitask learning
    Li, Xiaoli
    Li, Yuzheng
    Wang, Jining
    Chen, Chuan
    Yang, Liu
    Zheng, Zibin
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8490 - 8522
  • [4] Meta-Learning for Few-Shot NMT Adaptation
    Sharaf, Amr
    Hassan, Hany
    Daume, Hal, III
    [J]. NEURAL GENERATION AND TRANSLATION, 2020, : 43 - 53
  • [5] Fair Meta-Learning For Few-Shot Classification
    Zhao, Chen
    Li, Changbin
    Li, Jincheng
    Chen, Feng
    [J]. 11TH IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH (ICKG 2020), 2020, : 275 - 282
  • [6] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [7] A META-LEARNING FRAMEWORK FOR FEW-SHOT CLASSIFICATION OF REMOTE SENSING SCENE
    Zhang, Pei
    Bai, Yunpeng
    Wang, Dong
    Bai, Bendu
    Li, Ying
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4590 - 4594
  • [8] META-LEARNING WITH ATTENTION FOR IMPROVED FEW-SHOT LEARNING
    Hou, Zejiang
    Walid, Anwar
    Kung, Sun-Yuan
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2725 - 2729
  • [9] MedOptNet: Meta-Learning Framework for Few-Shot Medical Image Classification
    Lu, Liangfu
    Cui, Xudong
    Tan, Zhiyuan
    Wu, Yulei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (04) : 725 - 736
  • [10] Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech
    Huang, Sung-Feng
    Lin, Chyi-Jiunn
    Liu, Da-Rong
    Chen, Yi-Chen
    Lee, Hung-yi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1558 - 1571