A Task-Specific Meta-Learning Framework for Few-Shot Sound Event Detection

被引:0
|
作者
Zhang, Tianyang [1 ]
Yang, Liping [1 ]
Gu, Xiaohua [2 ]
Wang, Yuyang [1 ]
机构
[1] Chongqing Univ, Key Lab Optoelect Technol & Syst, MOE, Chongqing, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypical network; Sound event detection; Few-shot; Task-specific; Inter-class and intra-class;
D O I
10.1109/MMSP55362.2022.9949191
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Meta-learning is extensively used for few-shot learning. Prototypical Network (ProtoNet) has been proved to perform well for few-shot sound event detection. ProtoNet as a metalearning method consists of two stages: meta-train and metatest. During meta-train stage, an embedded network is trained on base-classes using episode training strategy. During metatest stage, the embedded network is transferred directly to unseen classes during meta-train. This style of transference is task-agnostic: the embedded network may not learn the optimal discrimination embedding features for specific tasks including unseen classes. In this paper, we propose a task-specific meta-learning framework (TSMLF) for few-shot sound event detection, which makes embedded network learn discrimination embedding features for specific tasks. TSMLF inherits the metatrain process of ProtoNet. During meta-test stage, the framework enables embedded network to learn discriminative embedding features by inter-class and intra-class differences. Concretely, we calculate the inter-class and intra-class distance that support set sound samples. Maximizing inter-class distance and minimizing intra-class distance (MIMI) are used as a criteria to fine-tune embedded network for specific tasks. In addition, due to the small-scaled support set of meta-test, similar sound samples are easily excessively clustered during fine-tuning. We set a distance constraint on intra-class distance to avoid overfitting of embedded network. The proposed framework is evaluated using few-shot dataset of Detection and Classification of Acoustic Scenes and Events challenges 2022 (DCASE2022) task 5. Extensive ablation experimental results validate that all components of TSMLF can provide positive effects on few-shot sound event detection.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Decomposed Meta-Learning for Few-Shot Sequence Labeling
    Ma, Tingting
    Wu, Qianhui
    Jiang, Huiqiang
    Lin, Jieru
    Karlsson, Borje F.
    Zhao, Tiejun
    Lin, Chin-Yew
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1980 - 1993
  • [42] Meta-Learning for Few-Shot Named Entity Recognition
    de Lichy, Cyprien
    Glaude, Hadrien
    Campbell, William
    1ST WORKSHOP ON META LEARNING AND ITS APPLICATIONS TO NATURAL LANGUAGE PROCESSING (METANLP 2021), 2021, : 44 - 58
  • [43] Meta-Learning for Few-Shot Time Series Classification
    Narwariya, Jyoti
    Malhotra, Pankaj
    Vig, Lovekesh
    Shroff, Gautam
    Vishnu, T. V.
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 28 - 36
  • [44] Meta-Learning for Few-Shot Land Cover Classification
    Russwurm, Marc
    Wang, Sherrie
    Koerner, Marco
    Lobell, David
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 788 - 796
  • [45] META-LEARNING FOR FEW-SHOT TIME SERIES CLASSIFICATION
    Wang, Sherrie
    Russwurm, Marc
    Koerner, Marco
    Lobell, David B.
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 7041 - 7044
  • [46] Adversarially Robust Few-Shot Learning: A Meta-Learning Approach
    Goldblum, Micah
    Fowl, Liam
    Goldstein, Tom
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [47] Stress Testing of Meta-learning Approaches for Few-shot Learning
    Aimen, Aroof
    Sidheekh, Sahil
    Madan, Vineet
    Krishnan, Narayanan C.
    AAAI WORKSHOP ON META-LEARNING AND METADL CHALLENGE, VOL 140, 2021, 140 : 38 - 44
  • [48] MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
    Zhang, Baoquan
    Luo, Chuyao
    Yu, Demin
    Li, Xutao
    Lin, Huiwei
    Ye, Yunming
    Zhang, Bowen
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16687 - 16695
  • [49] Few-Shot Personality-Specific Image Captioning via Meta-Learning
    Hosseinzadeh, Mehrdad
    Wang, Yang
    2023 20TH CONFERENCE ON ROBOTS AND VISION, CRV, 2023, : 320 - 327
  • [50] Fast Few-Shot Classification by Few-Iteration Meta-Learning
    Tripathi, Ardhendu Shekhar
    Danelljan, Martin
    Van Gool, Luc
    Timofte, Radu
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 9522 - 9528