Exploring On-Device Learning Using Few Shots for Audio Classification

被引:0
|
作者
Chauhan, Jagmohan [1 ,2 ]
Kwon, Young D. [2 ]
Mascolo, Cecilia [2 ]
机构
[1] Univ Southampton, Southampton, Hants, England
[2] Univ Cambridge, Cambridge, England
来源
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022) | 2022年
基金
欧洲研究理事会;
关键词
Few Shot Learning; Acoustic Event Classification; Keyword Spotting; On-Device Learning; Performance;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Few shot learning (FSL) improves the generalization of neural network classifiers to unseen classes and tasks using small annotated samples of data. Recently, there have been attempts to apply few shot learning in the audio domain for various applications. However, the focus has been mainly on accuracy. Here, we take a holistic view and investigate various system aspects such as latency, storage and memory requirements of few shot learning methods in addition to improving the accuracy with very deep learning models for the tasks of audio classification. To this end, we not only compare the performance of different few shot learning methods but also, for the first time, design an end-to-end framework for smartphones and wearables which can run such methods completely on-device. Our results indicate the need to collect large datasets with more classes as we show much higher gains can be obtained with very deep learning models on big datasets. Surprisingly, metric-based methods such as ProtoTypical Networks can be realized practically on-device and quantization helps further (50%) in reducing the resource requirements, while having no impact on accuracy for the audio classification tasks.
引用
收藏
页码:424 / 428
页数:5
相关论文
共 50 条
  • [21] Improving on-device speaker verification using federated learning with privacy
    Granqvist, Filip
    Seigel, Matt
    van Dalen, Rogier
    Cahill, Aine
    Shum, Stephen
    Paulik, Matthias
    INTERSPEECH 2020, 2020, : 4328 - 4332
  • [22] Promoting Occupancy Detection Accuracy Using On-Device Lifelong Learning
    Emad-Ud-Din, Muhammad
    Wang, Ya
    IEEE SENSORS JOURNAL, 2023, 23 (09) : 9595 - 9606
  • [23] Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation
    Singh, Riyansha
    Nema, Parinita
    Kurmi, Vinod K.
    INTERSPEECH 2024, 2024, : 5023 - 5027
  • [24] Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification
    Bai, Jing
    Yuan, Anran
    Xiao, Zhu
    Zhou, Huaji
    Wang, Dingchen
    Jiang, Hongbo
    Jiao, Licheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 5474 - 5485
  • [25] On-Device Learning with Binary Neural Networks
    Vorabbi, Lorenzo
    Maltoni, Davide
    Santi, Stefano
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 39 - 50
  • [26] A Crowdsourcing Framework for On-Device Federated Learning
    Pandey, Shashi Raj
    Tran, Nguyen H.
    Bennis, Mehdi
    Tun, Yan Kyaw
    Manzoor, Aunas
    Hong, Choong Seon
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) : 3241 - 3256
  • [27] On-device Training for Breast Ultrasound Image Classification
    Hou, Dennis
    Hou, Raymond
    Hou, Janpu
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 78 - 82
  • [28] A Survey of Audio Classification Using Deep Learning
    Zaman, Khalid
    Sah, Melike
    Direkoglu, Cem
    Unoki, Masashi
    IEEE ACCESS, 2023, 11 : 106620 - 106649
  • [29] Open Set Audio Classification Using Autoencoders Trained on Few Data
    Naranjo-Alcazar, Javier
    Perez-Castanos, Sergi
    Zuccarello, Pedro
    Antonacci, Fabio
    Cobos, Maximo
    SENSORS, 2020, 20 (13) : 1 - 19
  • [30] 12 mJ per Class On-Device Online Few-Shot Class-Incremental Learning
    Wibowo, Yoga Esa
    Cioflan, Cristian
    Ingolfsson, Thorir Mar
    Hersche, Michael
    Zhao, Leo
    Rahimi, Abbas
    Benini, Luca
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,