Exploring On-Device Learning Using Few Shots for Audio Classification

被引：0

作者：

Chauhan, Jagmohan ^{[1
,2
]}

Kwon, Young D. ^{[2
]}

Mascolo, Cecilia ^{[2
]}

机构：

[1] Univ Southampton, Southampton, Hants, England

[2] Univ Cambridge, Cambridge, England

来源：

2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022) | 2022年

基金：

欧洲研究理事会;

关键词：

Few Shot Learning; Acoustic Event Classification; Keyword Spotting; On-Device Learning; Performance;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Few shot learning (FSL) improves the generalization of neural network classifiers to unseen classes and tasks using small annotated samples of data. Recently, there have been attempts to apply few shot learning in the audio domain for various applications. However, the focus has been mainly on accuracy. Here, we take a holistic view and investigate various system aspects such as latency, storage and memory requirements of few shot learning methods in addition to improving the accuracy with very deep learning models for the tasks of audio classification. To this end, we not only compare the performance of different few shot learning methods but also, for the first time, design an end-to-end framework for smartphones and wearables which can run such methods completely on-device. Our results indicate the need to collect large datasets with more classes as we show much higher gains can be obtained with very deep learning models on big datasets. Surprisingly, metric-based methods such as ProtoTypical Networks can be realized practically on-device and quantization helps further (50%) in reducing the resource requirements, while having no impact on accuracy for the audio classification tasks.

引用

页码：424 / 428

页数：5

共 50 条

[1] TEMPORAL KNOWLEDGE DISTILLATION FOR ON-DEVICE AUDIO CLASSIFICATION
Choi, Kwanghee
Kersner, Martin
Morton, Jacob
Chang, Buru
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 486 - 490
[2] On-Device Intelligence for Real-Time Audio Classification and Enhancement
Hwang, Inwoo
Kim, Kibeom
Kim, Sunmin
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2023, 71 (10): : 719 - 728
[3] WHO CALLS THE SHOTS? RETHINKING FEW-SHOT LEARNING FOR AUDIO
Wang, Yu
Bryan, Nicholas J.
Salamon, Justin
Cartwright, Mark
Bello, Juan Pablo
2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 36 - 40
[4] A few shots at few shot learning
Waagen, Donald
Hulsey, Don
Gray, David
AUTOMATIC TARGET RECOGNITION XXXIII, 2023, 12521
[5] On-Device Document Classification using multimodal features
Garg, Sugam
Harichandana, S. S.
Kumar, Sumit
CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 203 - 207
[6] Enhanced On-Device Video Summarization Using Audio and Visual Features
Nagaraju, Lokesh Kumar Thandaga
Ranjitha, B.
Shaik, Jani Basha
COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 86 - 98
[7] FEW-SHOT CONTINUAL LEARNING FOR AUDIO CLASSIFICATION
Wang, Yu
Bryan, Nicholas J.
Cartwright, Mark
Bello, Juan Pablo
Salamon, Justin
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 321 - 325
[8] A novel image model for vehicle classification in restricted areas using on-device machine learning
Lamba A.
Kumar V.
International Journal of Information Technology, 2023, 15 (6) : 3037 - 3043
[9] Walking Speed Estimation and Gait Classification Using Plantar Pressure and On-Device Deep Learning
Cho, Hyuntae
IEEE SENSORS JOURNAL, 2023, 23 (19) : 23336 - 23347
[10] Enabling on-device classification of ECG with compressed learning for health IoT
Li, Wenzhuo
Chu, Haoming
Huang, Boming
Huan, Yuxiang
Zheng, Lirong
Zou, Zhuo
MICROELECTRONICS JOURNAL, 2021, 115 (115):

← 1 2 3 4 5 →