Self-Supervised Transfer Learning from Natural Images for Sound Classification

被引:7
|
作者
Shin, Sungho [1 ]
Kim, Jongwon [1 ]
Yu, Yeonguk [1 ]
Lee, Seongju [1 ]
Lee, Kyoobin [1 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Integrated Technol, Gwangju 61005, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 07期
关键词
deep learning; sound event detection; self-supervised learning; transfer learning; natural image; CONVOLUTIONAL NEURAL-NETWORKS;
D O I
10.3390/app11073043
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We propose the implementation of transfer learning from natural images to audio-based images using self-supervised learning schemes. Through self-supervised learning, convolutional neural networks (CNNs) can learn the general representation of natural images without labels. In this study, a convolutional neural network was pre-trained with natural images (ImageNet) via self-supervised learning; subsequently, it was fine-tuned on the target audio samples. Pre-training with the self-supervised learning scheme significantly improved the sound classification performance when validated on the following benchmarks: ESC-50, UrbanSound8k, and GTZAN. The network pre-trained via self-supervised learning achieved a similar level of accuracy as those pre-trained using a supervised method that require labels. Therefore, we demonstrated that transfer learning from natural images contributes to improvements in audio-related tasks, and self-supervised learning with natural images is adequate for pre-training scheme in terms of simplicity and effectiveness.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Self-supervised learning for Environmental Sound Classification
    Tripathi, Achyut Mani
    Mishra, Aakansha
    [J]. APPLIED ACOUSTICS, 2021, 182
  • [2] Respiratory sound classification using supervised and self-supervised learning
    Lee, Sunju
    Ha, Taeyoung
    Hyon, YunKyong
    Chung, Chaeuk
    Kim, Yoonjoo
    Woo, Seong-Dae
    Lee, Song-I
    [J]. RESPIROLOGY, 2023, 28 : 160 - 161
  • [3] Self-supervised Transfer Learning for Hand Mesh Recovery from Binocular Images
    Chen, Zheng
    Wang, Sihan
    Sun, Yi
    Ma, Xiaohong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11606 - 11614
  • [4] Self-Supervised Transfer Learning of Pulmonary Nodule Classification Based on Partially Annotated CT Images
    Huang, Hong
    Peng, Chao
    Wu, Ruoyu
    Tao, Junli
    Zhang, Jiuquan
    [J]. Guangxue Xuebao/Acta Optica Sinica, 2020, 40 (18):
  • [5] Self-Supervised Transfer Learning of Pulmonary Nodule Classification Based on Partially Annotated CT Images
    Huang Hong
    Peng Chao
    Wu Ruoyu
    Tao Junli
    Zhang Jiuquan
    [J]. ACTA OPTICA SINICA, 2020, 40 (18)
  • [6] Self-supervised Motion Learning from Static Images
    Huang, Ziyuan
    Zhang, Shiwei
    Jiang, Jianwen
    Tang, Mingqian
    Jin, Rong
    Ang, Marcelo H., Jr.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1276 - 1285
  • [7] Classification of Polyps in Endoscopic Images Using Self-Supervised Structured Learning
    Huang, Qi-Xian
    Lin, Guo-Shiang
    Sun, Hung-Min
    [J]. IEEE ACCESS, 2023, 11 : 50025 - 50037
  • [8] Kidney Tumor Classification on CT images using Self-supervised Learning
    Özbay, Erdal
    Özbay, Feyza Altunbey
    Gharehchopogh, Farhad Soleimanian
    [J]. Computers in Biology and Medicine, 2024, 176
  • [9] SELF-SUPERVISED LEARNING FROM AUTOMATICALLY SEPARATED SOUND SCENES
    Fonseca, Eduardo
    Jansen, Aren
    Ellis, Daniel P. W.
    Wisdom, Scott
    Tagliasacchi, Marco
    Hershey, John R.
    Plakal, Manoj
    Hershey, Shawn
    Moore, R. Channing
    Serra, Xavier
    [J]. 2021 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2021, : 251 - 255
  • [10] Self-Supervised Learning for Alignment of Objects and Sound
    Liu, Xinzhu
    Liu, Xiaoyu
    Guo, Di
    Liu, Huaping
    Sun, Fuchun
    Min, Haibo
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 1588 - 1594