A Task-Specific Meta-Learning Framework for Few-Shot Sound Event Detection

被引:0
|
作者
Zhang, Tianyang [1 ]
Yang, Liping [1 ]
Gu, Xiaohua [2 ]
Wang, Yuyang [1 ]
机构
[1] Chongqing Univ, Key Lab Optoelect Technol & Syst, MOE, Chongqing, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypical network; Sound event detection; Few-shot; Task-specific; Inter-class and intra-class;
D O I
10.1109/MMSP55362.2022.9949191
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Meta-learning is extensively used for few-shot learning. Prototypical Network (ProtoNet) has been proved to perform well for few-shot sound event detection. ProtoNet as a metalearning method consists of two stages: meta-train and metatest. During meta-train stage, an embedded network is trained on base-classes using episode training strategy. During metatest stage, the embedded network is transferred directly to unseen classes during meta-train. This style of transference is task-agnostic: the embedded network may not learn the optimal discrimination embedding features for specific tasks including unseen classes. In this paper, we propose a task-specific meta-learning framework (TSMLF) for few-shot sound event detection, which makes embedded network learn discrimination embedding features for specific tasks. TSMLF inherits the metatrain process of ProtoNet. During meta-test stage, the framework enables embedded network to learn discriminative embedding features by inter-class and intra-class differences. Concretely, we calculate the inter-class and intra-class distance that support set sound samples. Maximizing inter-class distance and minimizing intra-class distance (MIMI) are used as a criteria to fine-tune embedded network for specific tasks. In addition, due to the small-scaled support set of meta-test, similar sound samples are easily excessively clustered during fine-tuning. We set a distance constraint on intra-class distance to avoid overfitting of embedded network. The proposed framework is evaluated using few-shot dataset of Detection and Classification of Acoustic Scenes and Events challenges 2022 (DCASE2022) task 5. Extensive ablation experimental results validate that all components of TSMLF can provide positive effects on few-shot sound event detection.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Task Agnostic Meta-Learning for Few-Shot Learning
    Jamal, Muhammad Abdullah
    Qi, Guo-Jun
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 11711 - 11719
  • [2] A MUTUAL LEARNING FRAMEWORK FOR FEW-SHOT SOUND EVENT DETECTION
    Yang, Dongchao
    Wang, Helin
    Zou, Yuexian
    Ye, Zhongjie
    Wang, Wenwu
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 811 - 815
  • [3] Graph Few-shot Learning with Task-specific Structures
    Wang, Song
    Chen, Chen
    Li, Jundong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] Few-Shot Learning of Compact Models via Task-Specific Meta Distillation
    Wu, Yong
    Chanda, Shekhor
    Hosseinzadeh, Mehrdad
    Liu, Zhi
    Wang, Yang
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 6254 - 6263
  • [5] Meta-Learning for Few-Shot Plant Disease Detection
    Chen, Liangzhe
    Cui, Xiaohui
    Li, Wei
    FOODS, 2021, 10 (10)
  • [6] Active Few-Shot Learning for Sound Event Detection
    Wang, Yu
    Cartwright, Mark
    Bello, Juan Pablo
    INTERSPEECH 2022, 2022, : 1551 - 1555
  • [7] A Method of Few-Shot Network Intrusion Detection Based on Meta-Learning Framework
    Xu, Congyuan
    Shen, Jizhong
    Du, Xin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 3540 - 3552
  • [8] Few-shot time series forecasting in a meta-learning framework
    Ma P.
    Ni Z.
    Ma, Ping (1533321767@qq.com), 1600, IOS Press BV (46): : 8903 - 8916
  • [9] Decentralized federated meta-learning framework for few-shot multitask learning
    Li, Xiaoli
    Li, Yuzheng
    Wang, Jining
    Chen, Chuan
    Yang, Liu
    Zheng, Zibin
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8490 - 8522
  • [10] Unsupervised meta-learning for few-shot learning
    Xu, Hui
    Wang, Jiaxing
    Li, Hao
    Ouyang, Deqiang
    Shao, Jie
    PATTERN RECOGNITION, 2021, 116