Exploring On-Device Learning Using Few Shots for Audio Classification

被引:0
|
作者
Chauhan, Jagmohan [1 ,2 ]
Kwon, Young D. [2 ]
Mascolo, Cecilia [2 ]
机构
[1] Univ Southampton, Southampton, Hants, England
[2] Univ Cambridge, Cambridge, England
基金
欧洲研究理事会;
关键词
Few Shot Learning; Acoustic Event Classification; Keyword Spotting; On-Device Learning; Performance;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Few shot learning (FSL) improves the generalization of neural network classifiers to unseen classes and tasks using small annotated samples of data. Recently, there have been attempts to apply few shot learning in the audio domain for various applications. However, the focus has been mainly on accuracy. Here, we take a holistic view and investigate various system aspects such as latency, storage and memory requirements of few shot learning methods in addition to improving the accuracy with very deep learning models for the tasks of audio classification. To this end, we not only compare the performance of different few shot learning methods but also, for the first time, design an end-to-end framework for smartphones and wearables which can run such methods completely on-device. Our results indicate the need to collect large datasets with more classes as we show much higher gains can be obtained with very deep learning models on big datasets. Surprisingly, metric-based methods such as ProtoTypical Networks can be realized practically on-device and quantization helps further (50%) in reducing the resource requirements, while having no impact on accuracy for the audio classification tasks.
引用
收藏
页码:424 / 428
页数:5
相关论文
共 50 条
  • [1] TEMPORAL KNOWLEDGE DISTILLATION FOR ON-DEVICE AUDIO CLASSIFICATION
    Choi, Kwanghee
    Kersner, Martin
    Morton, Jacob
    Chang, Buru
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 486 - 490
  • [2] On-Device Intelligence for Real-Time Audio Classification and Enhancement
    Hwang, Inwoo
    Kim, Kibeom
    Kim, Sunmin
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2023, 71 (10): : 719 - 728
  • [3] WHO CALLS THE SHOTS? RETHINKING FEW-SHOT LEARNING FOR AUDIO
    Wang, Yu
    Bryan, Nicholas J.
    Salamon, Justin
    Cartwright, Mark
    Bello, Juan Pablo
    2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 36 - 40
  • [4] A few shots at few shot learning
    Waagen, Donald
    Hulsey, Don
    Gray, David
    AUTOMATIC TARGET RECOGNITION XXXIII, 2023, 12521
  • [5] On-Device Document Classification using multimodal features
    Garg, Sugam
    Harichandana, S. S.
    Kumar, Sumit
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 203 - 207
  • [6] Enhanced On-Device Video Summarization Using Audio and Visual Features
    Nagaraju, Lokesh Kumar Thandaga
    Ranjitha, B.
    Shaik, Jani Basha
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 86 - 98
  • [7] FEW-SHOT CONTINUAL LEARNING FOR AUDIO CLASSIFICATION
    Wang, Yu
    Bryan, Nicholas J.
    Cartwright, Mark
    Bello, Juan Pablo
    Salamon, Justin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 321 - 325
  • [8] A novel image model for vehicle classification in restricted areas using on-device machine learning
    Lamba A.
    Kumar V.
    International Journal of Information Technology, 2023, 15 (6) : 3037 - 3043
  • [9] Walking Speed Estimation and Gait Classification Using Plantar Pressure and On-Device Deep Learning
    Cho, Hyuntae
    IEEE SENSORS JOURNAL, 2023, 23 (19) : 23336 - 23347
  • [10] Enabling on-device classification of ECG with compressed learning for health IoT
    Li, Wenzhuo
    Chu, Haoming
    Huang, Boming
    Huan, Yuxiang
    Zheng, Lirong
    Zou, Zhuo
    MICROELECTRONICS JOURNAL, 2021, 115 (115):