Few-shot class-incremental audio classification via discriminative prototype learning

被引:3
|
作者
Xie, Wei [1 ]
Li, Yanxiong [1 ]
He, Qianhua [1 ]
Cao, Wenchang [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou 510641, Peoples R China
关键词
Audio classification; Few-shot learning; Class-incremental learning; Selective-attention; Prototype adjustment; NEURAL-NETWORK; RECOGNITION; DATASET;
D O I
10.1016/j.eswa.2023.120044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real-world scenarios, new audio classes with insufficient samples usually emerge continually, which motivates the study of few-shot class-incremental audio classification (FCAC) in this paper. FCAC aims to enable the model to recognize new audio classes while remembering the base ones continually. To solve the FCAC problem, the discriminability of the prototypes is vital to the model's classification performance. Thus, we proposed a method to learn the discriminative prototypes from two aspects. First, since the generalization ability of the embedding module (EM) significantly affects the discriminability of the prototypes, the proposed method employs a scheme of pseudo-episodic incremental training to train the EM by simulating the test scenario. Second, to enable the model to achieve a balanced classification performance on both base and new audio classes, the proposed method employs a selective-attention module to adjust different prototypes to enhance their discriminability. Extensive experimental results demonstrate that the proposed method achieves state-of-the-art performance in solving the FCAC problem. Notably, the proposed method achieves a comprehensive performance score (CPS) of 87.82% and 59.25% on the Neural Synthesis musical notes of 100 classes (NSynth-100) and Free sound clips of 89 classes (FSC-89) datasets, respectively, which outperforms the comparison methods. Our code is available at https://github.com/chester-w-xie/DPL_FCAC.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Flexible few-shot class-incremental learning with prototype container
    Xu, Xinlei
    Wang, Zhe
    Fu, Zhiling
    Guo, Wei
    Chi, Ziqiu
    Li, Dongdong
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (15): : 10875 - 10889
  • [2] Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation
    Lu, Bin
    Gan, Xiaoying
    Yang, Lina
    Zhang, Weinan
    Fu, Luoyi
    Wang, Xinbing
    [J]. PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 1152 - 1161
  • [3] Few-Shot Class-Incremental SAR Target Recognition via Cosine Prototype Learning
    Zhao, Yan
    Zhao, Lingjun
    Ding, Ding
    Hu, Dewen
    Kuang, Gangyao
    Liu, Li
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration
    Wang, Qi-Wei
    Zhou, Da-Wei
    Zhang, Yi-Kai
    Zhan, De-Chuan
    Ye, Han-Jia
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] A survey on few-shot class-incremental learning
    Tian, Songsong
    Li, Lusi
    Li, Weijun
    Ran, Hang
    Ning, Xin
    Tiwari, Prayag
    [J]. NEURAL NETWORKS, 2024, 169 : 307 - 324
  • [6] A survey on few-shot class-incremental learning
    Tian, Songsong
    Li, Lusi
    Li, Weijun
    Ran, Hang
    Ning, Xin
    Tiwari, Prayag
    [J]. Neural Networks, 2024, 169 : 307 - 324
  • [7] Graph Few-shot Class-incremental Learning
    Tan, Zhen
    Ding, Kaize
    Guo, Ruocheng
    Liu, Huan
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 987 - 996
  • [8] Constrained Few-shot Class-incremental Learning
    Hersche, Michael
    Karunaratne, Geethan
    Cherubini, Giovanni
    Benini, Luca
    Sebastian, Abu
    Rahimi, Abbas
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9047 - 9057
  • [9] Few-Shot Class-Incremental Audio Classification With Adaptive Mitigation of Forgetting and Overfitting
    Li, Yanxiong
    Li, Jialong
    Si, Yongjie
    Tan, Jiaxin
    He, Qianhua
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 2297 - 2311
  • [10] Few-Shot Class-Incremental Learning for Medical Time Series Classification
    Sun, Le
    Zhang, Mingyang
    Wang, Benyou
    Tiwari, Prayag
    [J]. IEEE Journal of Biomedical and Health Informatics, 2024, 28 (04) : 1872 - 1882