Patch-level contrastive embedding learning for respiratory sound classification

被引:2
|
作者
Song, Wenjie [1 ]
Han, Jiqing [1 ]
机构
[1] Harbin Inst Technol, Comp Fac, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Respiratory sound classification; Adventitious sound; Contrastive learning; Patch-level method; AUSCULTATION; MODEL;
D O I
10.1016/j.bspc.2022.104338
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Nowadays, due to the difficulty of data acquisition and expensive manual annotation, respiratory sound clas-sification suffers from limited training samples, which restrains the performance improvement of existing methods. To learn more information from the limited samples, we previously proposed a method of contrastive embedding learning to incorporate additional out-of-class information into the model. However, since the method mapped each entire sample to a deep embedding vector and modelled the distribution of the embed -dings, it hardly learned the detailed information within the samples. In fact, a sample is a finite combination of various components, and the classification task essentially is to detect the presence of components that contain adventitious sounds, where detailed component-wise information is crucial. To this end, a method of patch-level contrastive embedding learning based on finer-grained patches is further proposed in this paper. It divides each sample into multiple patches and maps the patches to the embedding space. The patches are split into different subclasses, according to the type of adventitious sounds contained in each patch. Considering that there might be no patch-level labels provided in most cases, a Multi-Instance Learning (MIL) based approach is designed to estimate the labels. Then by modelling intra-and inter-subclass distance between the patch-level embeddings, the method learns the detailed information about the difference between patches, which benefits the identifi-cation task. The results following random and official splitting on the ICBHI dataset show that our method achieves the performance of 79.99% and 52.95%, exceeding the previous one by 1.81% and 1.58%, respectively.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
    Bae, Sangmin
    Kim, June-Woo
    Cho, Won-Yang
    Baek, Hyerim
    Son, Soyoun
    Lee, Byungjo
    Ha, Changwan
    Tae, Kyongpil
    Kim, Sungnyun
    Yun, Se-Young
    INTERSPEECH 2023, 2023, : 5436 - 5440
  • [2] CONTRASTIVE EMBEDDIND LEARNING METHOD FOR RESPIRATORY SOUND CLASSIFICATION
    Song, Wenjie
    Han, Jiqing
    Song, Hongwei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1275 - 1279
  • [3] Patch-level Tumor Classification in Digital Histopathology Images with Domain Adapted Deep Learning
    Xia, Tian
    Kumar, Ashnil
    Feng, Dagan
    Kim, Jinman
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 644 - 647
  • [4] Patch-Level Spatial Layout for Classification and Weakly Supervised Localization
    Zadrija, Valentina
    Krapac, Josip
    Verbeek, Jakob
    Segvic, Sinisa
    PATTERN RECOGNITION, GCPR 2015, 2015, 9358 : 492 - 503
  • [5] Improved Patch-Mix Transformer and Contrastive Learning Method for Sound Classification in Noisy Environments
    Chen, Xu
    Wang, Mei
    Kan, Ruixiang
    Qiu, Hongbing
    APPLIED SCIENCES-BASEL, 2024, 14 (21):
  • [6] Enhanced Random Forest with Image/Patch-Level Learning for Image Understanding
    Hoo, Wai Lam
    Kim, Tae-Kyun
    Pei, Yuru
    Chan, Chee Seng
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 3434 - 3439
  • [7] Patch-level Representation Learning for Self-supervised Vision Transformers
    Yun, Sukmin
    Lee, Hankook
    Kim, Jaehyung
    Shin, Jinwoo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8344 - 8353
  • [8] Classification of Lung Cancer Histology Images using Patch-Level Summary Statistics
    Graham, Simon
    Shaban, Muhammad
    Qaiser, Talha
    Koohbanani, Navid Alemi
    Khurram, Syed Ali
    Rajpoot, Nasir
    MEDICAL IMAGING 2018: DIGITAL PATHOLOGY, 2018, 10581
  • [9] Vision Transformer based Audio Classification using Patch-level Feature Fusion
    Luo, Juan
    Yang, Jielong
    Chng, Eng Siong
    Zhong, Xionghu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 22 - 26
  • [10] From Patch-level to ROI-level Deep Feature Representations for Breast Histopathology Classification
    Mercan, Caner
    Aksoy, Selim
    Mercan, Ezgi
    Shapiro, Linda G.
    Weaver, Donald L.
    Elmore, Joann G.
    MEDICAL IMAGING 2019: DIGITAL PATHOLOGY, 2019, 10956