Patch-level contrastive embedding learning for respiratory sound classification

被引:2
|
作者
Song, Wenjie [1 ]
Han, Jiqing [1 ]
机构
[1] Harbin Inst Technol, Comp Fac, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Respiratory sound classification; Adventitious sound; Contrastive learning; Patch-level method; AUSCULTATION; MODEL;
D O I
10.1016/j.bspc.2022.104338
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Nowadays, due to the difficulty of data acquisition and expensive manual annotation, respiratory sound clas-sification suffers from limited training samples, which restrains the performance improvement of existing methods. To learn more information from the limited samples, we previously proposed a method of contrastive embedding learning to incorporate additional out-of-class information into the model. However, since the method mapped each entire sample to a deep embedding vector and modelled the distribution of the embed -dings, it hardly learned the detailed information within the samples. In fact, a sample is a finite combination of various components, and the classification task essentially is to detect the presence of components that contain adventitious sounds, where detailed component-wise information is crucial. To this end, a method of patch-level contrastive embedding learning based on finer-grained patches is further proposed in this paper. It divides each sample into multiple patches and maps the patches to the embedding space. The patches are split into different subclasses, according to the type of adventitious sounds contained in each patch. Considering that there might be no patch-level labels provided in most cases, a Multi-Instance Learning (MIL) based approach is designed to estimate the labels. Then by modelling intra-and inter-subclass distance between the patch-level embeddings, the method learns the detailed information about the difference between patches, which benefits the identifi-cation task. The results following random and official splitting on the ICBHI dataset show that our method achieves the performance of 79.99% and 52.95%, exceeding the previous one by 1.81% and 1.58%, respectively.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Weakly supervised Medulloblastoma tumor classification using domain specific patch-level feature extraction
    Maack, Lennart
    Bhattacharya, Debayan
    Behrendt, Finn
    Bockmayr, Michael
    Schlaefer, Alexander
    DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
  • [22] Visual Estimation of Building Condition with Patch-level ConvNets
    Koch, David
    Despotovic, Miroslav
    Sakeena, Muntaha
    Doeller, Mario
    Zeppelzauer, Matthias
    RETECH'18: PROCEEDINGS OF THE 2018 ACM WORKSHOP ON MULTIMEDIA FOR REAL ESTATE TECH, 2018, : 12 - 17
  • [23] Improving ViT interpretability with patch-level mask prediction
    Kang, Junyong
    Heo, Byeongho
    Choe, Junsuk
    PATTERN RECOGNITION LETTERS, 2025, 187 : 73 - 79
  • [24] Supervised Contrastive Learning Framework and Hardware Implementation of Learned ResNet for Real-Time Respiratory Sound Classification
    Hu, Jinhai
    Leow, Cong Sheng
    Tao, Shuailin
    Goh, Wang Ling
    Gao, Yuan
    IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2025, 19 (01) : 185 - 195
  • [25] Active Contour Integrating Patch-Level and Pixel-Level Features
    Mao, Xinyue
    Chen, Yufei
    Liu, Xianhui
    Zhao, Weidong
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 353 - 365
  • [26] Triplet Contrastive Learning for Aspect Level Sentiment Classification
    Xiong, Haoliang
    Yan, Zehao
    Zhao, Hongya
    Huang, Zhenhua
    Xue, Yun
    MATHEMATICS, 2022, 10 (21)
  • [27] A deep residual neural network framework with transfer learning for concrete dams patch-level crack classification and weakly-supervised localization
    Li, Yangtao
    Bao, Tengfei
    Xu, Bo
    Shu, Xiaosong
    Zhou, Yuhang
    Du, Ye
    Wang, Ruijie
    Zhang, Kang
    MEASUREMENT, 2022, 188
  • [28] Patch-level Gaze Distribution Prediction for Gaze Following
    Miao, Qiaomu
    Minh Hoai
    Samaras, Dimitris
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 880 - 889
  • [29] Efficient patch-level descriptor for image categorization via patch pivots selection
    Beijing Key Laboratory of Traffic Data Analysis and Mining , Beijing
    100044, China
    不详
    071002, China
    不详
    071000, China
    Ruan Jian Xue Bao, 11 (2930-2938):
  • [30] UNSUPERVISED PERSON RE-IDENTIFICATION VIA GLOBAL-LEVEL AND PATCH-LEVEL DISCRIMINATIVE FEATURE LEARNING
    Sun, Zongzhe
    Zhao, Feng
    Wu, Feng
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2363 - 2367