A Task-Specific Meta-Learning Framework for Few-Shot Sound Event Detection

被引:0
|
作者
Zhang, Tianyang [1 ]
Yang, Liping [1 ]
Gu, Xiaohua [2 ]
Wang, Yuyang [1 ]
机构
[1] Chongqing Univ, Key Lab Optoelect Technol & Syst, MOE, Chongqing, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Elect Engn, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypical network; Sound event detection; Few-shot; Task-specific; Inter-class and intra-class;
D O I
10.1109/MMSP55362.2022.9949191
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Meta-learning is extensively used for few-shot learning. Prototypical Network (ProtoNet) has been proved to perform well for few-shot sound event detection. ProtoNet as a metalearning method consists of two stages: meta-train and metatest. During meta-train stage, an embedded network is trained on base-classes using episode training strategy. During metatest stage, the embedded network is transferred directly to unseen classes during meta-train. This style of transference is task-agnostic: the embedded network may not learn the optimal discrimination embedding features for specific tasks including unseen classes. In this paper, we propose a task-specific meta-learning framework (TSMLF) for few-shot sound event detection, which makes embedded network learn discrimination embedding features for specific tasks. TSMLF inherits the metatrain process of ProtoNet. During meta-test stage, the framework enables embedded network to learn discriminative embedding features by inter-class and intra-class differences. Concretely, we calculate the inter-class and intra-class distance that support set sound samples. Maximizing inter-class distance and minimizing intra-class distance (MIMI) are used as a criteria to fine-tune embedded network for specific tasks. In addition, due to the small-scaled support set of meta-test, similar sound samples are easily excessively clustered during fine-tuning. We set a distance constraint on intra-class distance to avoid overfitting of embedded network. The proposed framework is evaluated using few-shot dataset of Detection and Classification of Acoustic Scenes and Events challenges 2022 (DCASE2022) task 5. Extensive ablation experimental results validate that all components of TSMLF can provide positive effects on few-shot sound event detection.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Few-Shot High-Resolution Range Profile Ship Target Recognition Based on Task-Specific Meta-Learning with Mixed Training and Meta Embedding
    Kong, Yingying
    Zhang, Yuxuan
    Peng, Xiangyang
    Leung, Henry
    REMOTE SENSING, 2023, 15 (22)
  • [32] Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection
    Zhao, Chendong
    Wang, Jianzong
    Li, Leilai
    Qu, Xiaoyang
    Xiao, Jing
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [33] Federated Meta-Learning Framework for Few-shot Fault Diagnosis in Industrial IoT
    Chen, Jiao
    Tang, Jianhua
    Chen, Jie
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 2993 - 2998
  • [34] Few-shot driver identification via meta-learning
    Lu, Lin
    Xiong, Shengwu
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [35] Few-Shot Directed Meta-Learning for Image Classification
    Ouyang, Jihong
    Duan, Ganghai
    Liu, Siguang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [36] Meta-learning for few-shot time series forecasting
    Xiao, Feng
    Liu, Lu
    Han, Jiayu
    Guo, Degui
    Wang, Shang
    Cui, Hai
    Peng, Tao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 325 - 341
  • [37] Few-shot driver identification via meta-learning
    Lu, Lin
    Xiong, Shengwu
    Expert Systems with Applications, 2022, 203
  • [38] Unsupervised Meta-Learning for Few-Shot Image Classification
    Khodadadeh, Siavash
    Boloni, Ladislau
    Shah, Mubarak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [39] Contrastive Meta-Learning for Few-shot Node Classification
    Wang, Song
    Tan, Zhen
    Liu, Huan
    Li, Jundong
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2386 - 2397
  • [40] Few-shot Edge Classification in Graph Meta-learning
    Yang, Xiaoxiao
    Xu, Jungang
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 166 - 172