Segment-Based Spotting of Bowel Sounds Using Pretrained Models in Continuous Data Streams

被引:3
|
作者
Baronetto, Annalisa [1 ,2 ]
Graf, Luisa S. [3 ]
Fischer, Sarah [3 ,4 ]
Neurath, Markus F. [4 ]
Amft, Oliver [1 ,2 ]
机构
[1] Univ Freiburg, D-79110 Freiburg, Germany
[2] Hahn Schickard, D-79110 Freiburg, Germany
[3] Friedrich Alexander Univ Erlangen Nurnberg, D-91054 Erlangen, Germany
[4] Univ Hosp Erlangen, I Med Clin Deutsch Zentrum Immuntherapie DZI, D-91052 Erlangen, Germany
关键词
Bowel sound detection; deep learning; transfer learning; audio tagging; LONG-TERM; BIOACOUSTICS APPLICATION; WAVELETS;
D O I
10.1109/JBHI.2023.3269910
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We analyse pretrained and non-pretrained deep neural models to detect 10-seconds Bowel Sounds (BS) audio segments in continuous audio data streams. The models include MobileNet, EfficientNet, and Distilled Transformer architectures. Models were initially trained on AudioSet and then transferred and evaluated on 84 hours of labelled audio data of eighteen healthy participants. Evaluation data was recorded in a semi-naturalistic daytime setting including movement and background noise using a smart shirt with embedded microphones. The collected dataset was annotated for individual BS events by two independent raters with substantial agreement (Cohen's Kappa kappa = 0.74). Leave-One-Participant-Out cross-validation for detecting 10-second BS audio segments, i.e. segment-based BS spotting, yielded a best F1 score of 73% and 67%, with and without transfer learning respectively. The best model for segment-based BS spotting was EfficientNet-B2 with an attention module. Our results show that pretrained models could improve F1 score up to 26%, in particular, increasing robustness against background noise. Our segment-based BS spotting approach reduces the amount of audio data to be reviewed by experts from 84 h to 11 h, thus by similar to 87%.
引用
收藏
页码:3164 / 3174
页数:11
相关论文
共 50 条
  • [1] A Segment-Based Drift Adaptation Method for Data Streams
    Song, Yiliao
    Lu, Jie
    Liu, Anjin
    Lu, Haiyan
    Zhang, Guangquan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (09) : 4876 - 4889
  • [2] A segment-based framework for modeling and mining data streams
    Charu C. Aggarwal
    Knowledge and Information Systems, 2012, 30 : 1 - 29
  • [3] A segment-based framework for modeling and mining data streams
    Aggarwal, Charu C.
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 30 (01) : 1 - 29
  • [4] A segment-based wordspotter using phonetic filler models
    Manos, AS
    Zue, VW
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 899 - 902
  • [5] Segment-Based Continuous Stereo Matching
    Zhu, Wenqiao
    Diao, Changyu
    Xu, Duanqing
    Lu, Dongming
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1664 - 1668
  • [6] Segment-based adaptive proxy caching of multimedia streams
    Zheng, Dongdong
    Yu, Hongliang
    Zheng, Weimin
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2007, 47 (07): : 1204 - 1207
  • [7] Hybrid Segment-based Transcoding Proxy Caching of Multimedia Streams
    Park, Yoohyun
    Lee, Yongju
    Kim, Hagyoung
    Kim, Kyongsok
    8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS, 2008, : 319 - 324
  • [8] FAST ALGORITHMS FOR PHONE CLASSIFICATION AND RECOGNITION USING SEGMENT-BASED MODELS
    DIGALAKIS, VV
    OSTENDORF, M
    ROHLICEK, JR
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (12) : 2885 - 2896
  • [9] Segment-based Models for Event Detection and Recounting
    Kovvuri, Rama
    Nevatia, Ram
    Snoek, Cees G. M.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3868 - 3873
  • [10] Segment-based Hidden Markov Models for Information Extraction
    Gu, Zhenmei
    Cercone, Nick
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 481 - 488