Patch-level contrastive embedding learning for respiratory sound classification

被引：2

作者：

Song, Wenjie ^{[1
]}

Han, Jiqing ^{[1
]}

机构：

[1] Harbin Inst Technol, Comp Fac, Harbin, Peoples R China

来源：

BIOMEDICAL SIGNAL PROCESSING AND CONTROL | 2023年 / 80卷

基金：

中国国家自然科学基金;

关键词：

Respiratory sound classification; Adventitious sound; Contrastive learning; Patch-level method; AUSCULTATION; MODEL;

D O I：

10.1016/j.bspc.2022.104338

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Nowadays, due to the difficulty of data acquisition and expensive manual annotation, respiratory sound clas-sification suffers from limited training samples, which restrains the performance improvement of existing methods. To learn more information from the limited samples, we previously proposed a method of contrastive embedding learning to incorporate additional out-of-class information into the model. However, since the method mapped each entire sample to a deep embedding vector and modelled the distribution of the embed -dings, it hardly learned the detailed information within the samples. In fact, a sample is a finite combination of various components, and the classification task essentially is to detect the presence of components that contain adventitious sounds, where detailed component-wise information is crucial. To this end, a method of patch-level contrastive embedding learning based on finer-grained patches is further proposed in this paper. It divides each sample into multiple patches and maps the patches to the embedding space. The patches are split into different subclasses, according to the type of adventitious sounds contained in each patch. Considering that there might be no patch-level labels provided in most cases, a Multi-Instance Learning (MIL) based approach is designed to estimate the labels. Then by modelling intra-and inter-subclass distance between the patch-level embeddings, the method learns the detailed information about the difference between patches, which benefits the identifi-cation task. The results following random and official splitting on the ICBHI dataset show that our method achieves the performance of 79.99% and 52.95%, exceeding the previous one by 1.81% and 1.58%, respectively.

引用

页数：10

共 50 条

[21] Weakly supervised Medulloblastoma tumor classification using domain specific patch-level feature extraction
Maack, Lennart
Bhattacharya, Debayan
Behrendt, Finn
Bockmayr, Michael
Schlaefer, Alexander
DIGITAL AND COMPUTATIONAL PATHOLOGY, MEDICAL IMAGING 2024, 2024, 12933
[22] Visual Estimation of Building Condition with Patch-level ConvNets
Koch, David
Despotovic, Miroslav
Sakeena, Muntaha
Doeller, Mario
Zeppelzauer, Matthias
RETECH'18: PROCEEDINGS OF THE 2018 ACM WORKSHOP ON MULTIMEDIA FOR REAL ESTATE TECH, 2018, : 12 - 17
[23] Improving ViT interpretability with patch-level mask prediction
Kang, Junyong
Heo, Byeongho
Choe, Junsuk
PATTERN RECOGNITION LETTERS, 2025, 187 : 73 - 79
[24] Supervised Contrastive Learning Framework and Hardware Implementation of Learned ResNet for Real-Time Respiratory Sound Classification
Hu, Jinhai
Leow, Cong Sheng
Tao, Shuailin
Goh, Wang Ling
Gao, Yuan
IEEE TRANSACTIONS ON BIOMEDICAL CIRCUITS AND SYSTEMS, 2025, 19 (01) : 185 - 195
[25] Active Contour Integrating Patch-Level and Pixel-Level Features
Mao, Xinyue
Chen, Yufei
Liu, Xianhui
Zhao, Weidong
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 353 - 365
[26] Triplet Contrastive Learning for Aspect Level Sentiment Classification
Xiong, Haoliang
Yan, Zehao
Zhao, Hongya
Huang, Zhenhua
Xue, Yun
MATHEMATICS, 2022, 10 (21)
[27] A deep residual neural network framework with transfer learning for concrete dams patch-level crack classification and weakly-supervised localization
Li, Yangtao
Bao, Tengfei
Xu, Bo
Shu, Xiaosong
Zhou, Yuhang
Du, Ye
Wang, Ruijie
Zhang, Kang
MEASUREMENT, 2022, 188
[28] Patch-level Gaze Distribution Prediction for Gaze Following
Miao, Qiaomu
Minh Hoai
Samaras, Dimitris
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 880 - 889
[29] Efficient patch-level descriptor for image categorization via patch pivots selection
Beijing Key Laboratory of Traffic Data Analysis and Mining , Beijing
100044, China
不详
071002, China
不详
071000, China
Ruan Jian Xue Bao, 11 (2930-2938):
[30] UNSUPERVISED PERSON RE-IDENTIFICATION VIA GLOBAL-LEVEL AND PATCH-LEVEL DISCRIMINATIVE FEATURE LEARNING
Sun, Zongzhe
Zhao, Feng
Wu, Feng
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2363 - 2367

← 1 2 3 4 5 →