Exploring On-Device Learning Using Few Shots for Audio Classification

被引：0

作者：

Chauhan, Jagmohan ^{[1
,2
]}

Kwon, Young D. ^{[2
]}

Mascolo, Cecilia ^{[2
]}

机构：

[1] Univ Southampton, Southampton, Hants, England

[2] Univ Cambridge, Cambridge, England

来源：

2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022) | 2022年

基金：

欧洲研究理事会;

关键词：

Few Shot Learning; Acoustic Event Classification; Keyword Spotting; On-Device Learning; Performance;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Few shot learning (FSL) improves the generalization of neural network classifiers to unseen classes and tasks using small annotated samples of data. Recently, there have been attempts to apply few shot learning in the audio domain for various applications. However, the focus has been mainly on accuracy. Here, we take a holistic view and investigate various system aspects such as latency, storage and memory requirements of few shot learning methods in addition to improving the accuracy with very deep learning models for the tasks of audio classification. To this end, we not only compare the performance of different few shot learning methods but also, for the first time, design an end-to-end framework for smartphones and wearables which can run such methods completely on-device. Our results indicate the need to collect large datasets with more classes as we show much higher gains can be obtained with very deep learning models on big datasets. Surprisingly, metric-based methods such as ProtoTypical Networks can be realized practically on-device and quantization helps further (50%) in reducing the resource requirements, while having no impact on accuracy for the audio classification tasks.

引用

页码：424 / 428

页数：5

共 50 条

[21] Improving on-device speaker verification using federated learning with privacy
Granqvist, Filip
Seigel, Matt
van Dalen, Rogier
Cahill, Aine
Shum, Stephen
Paulik, Matthias
INTERSPEECH 2020, 2020, : 4328 - 4332
[22] Promoting Occupancy Detection Accuracy Using On-Device Lifelong Learning
Emad-Ud-Din, Muhammad
Wang, Ya
IEEE SENSORS JOURNAL, 2023, 23 (09) : 9595 - 9606
[23] Towards Robust Few-shot Class Incremental Learning in Audio Classification using Contrastive Representation
Singh, Riyansha
Nema, Parinita
Kurmi, Vinod K.
INTERSPEECH 2024, 2024, : 5023 - 5027
[24] Class Incremental Learning With Few-Shots Based on Linear Programming for Hyperspectral Image Classification
Bai, Jing
Yuan, Anran
Xiao, Zhu
Zhou, Huaji
Wang, Dingchen
Jiang, Hongbo
Jiao, Licheng
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (06) : 5474 - 5485
[25] On-Device Learning with Binary Neural Networks
Vorabbi, Lorenzo
Maltoni, Davide
Santi, Stefano
IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT I, 2024, 14365 : 39 - 50
[26] A Crowdsourcing Framework for On-Device Federated Learning
Pandey, Shashi Raj
Tran, Nguyen H.
Bennis, Mehdi
Tun, Yan Kyaw
Manzoor, Aunas
Hong, Choong Seon
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (05) : 3241 - 3256
[27] On-device Training for Breast Ultrasound Image Classification
Hou, Dennis
Hou, Raymond
Hou, Janpu
2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 78 - 82
[28] A Survey of Audio Classification Using Deep Learning
Zaman, Khalid
Sah, Melike
Direkoglu, Cem
Unoki, Masashi
IEEE ACCESS, 2023, 11 : 106620 - 106649
[29] Open Set Audio Classification Using Autoencoders Trained on Few Data
Naranjo-Alcazar, Javier
Perez-Castanos, Sergi
Zuccarello, Pedro
Antonacci, Fabio
Cobos, Maximo
SENSORS, 2020, 20 (13) : 1 - 19
[30] 12 mJ per Class On-Device Online Few-Shot Class-Incremental Learning
Wibowo, Yoga Esa
Cioflan, Cristian
Ingolfsson, Thorir Mar
Hersche, Michael
Zhao, Leo
Rahimi, Abbas
Benini, Luca
2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,

← 1 2 3 4 5 →