Detection of breath cycles in pediatric lung sounds via an object detection-based transfer learning method

被引:0
|
作者
Park, Sa-Yoon [1 ,2 ]
Park, Ji Soo [3 ]
Lee, Jisoo [4 ]
Lee, Hyesu [1 ]
Kim, Yelin [5 ]
Suh, Dong In [3 ]
Kim, Kwangsoo [6 ,7 ]
机构
[1] Seoul Natl Univ Hosp, Inst Convergence Med Innovat Technol, Seoul 03080, South Korea
[2] Wonkwang Univ, Coll Korean Med, Dept Physiol, Iksan 54538, South Korea
[3] Seoul Natl Univ, Seoul Natl Univ Hosp, Dept Pediat, Seoul Natl Univ Hosp,Coll Med, 101 Daehak Ro, Seoul 03080, South Korea
[4] Seoul Natl Univ, Interdisciplinary Program Bioengn, Seoul 08826, South Korea
[5] Hongik Univ, Dept Comp Engn, Seoul, South Korea
[6] Seoul Natl Univ Hosp, Inst Convergence Med Innovat Technol, Dept Transdisciplinary Med, 101 Daehak Ro, Seoul 03080, South Korea
[7] Seoul Natl Univ, Coll Med, Dept Med, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Pediatric lung sounds; Breath cycle detection; Object detection; Transfer learning; Auscultation; SYSTEM;
D O I
10.1016/j.bspc.2025.107693
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Auscultation is critical for assessing the respiratory system in children; however, the lack of pediatric lung sound databases impedes the development of automated analysis tools. This study introduces an object detection-based transfer learning method to accurately predict breath cycles in pediatric lung sounds. We utilized a model based on the YOLOv1 architecture, initially pre-trained on an adult lung sound dataset (HF_Lung_v1) and subsequently fine-tuned on a pediatric dataset (SNUCH_Lung). The input feature was the log Mel spectrogram, which effectively captured the relevant frequency and temporal information. The pre-trained model achieved an F1 score of 0.900 +/- 0.003 on the HF_Lung_v1 dataset. After fine-tuning, it reached an F1 score of 0.824 +/- 0.009 on the SNUCH_Lung dataset, confirming the efficacy of transfer learning. This model surpassed the performance of a baseline model trained solely on the SNUCH_Lung dataset without transfer learning. We also explored the impact of segment length, width, and various audio feature extraction techniques; the optimal results were obtained with 15 s segments, a 2-second width, and the log Mel spectrogram. The model is promising for clinical applications, such as generating large-scale annotated datasets, visualizing and labeling individual breath cycles, and performing correlation analysis with physiological indicators. Future research will focus on expanding the pediatric lung sound database through auto-labeling techniques and integrating the model into stethoscopes for real-time analysis. This study highlights the potential of object detection-based transfer learning in enhancing the accuracy of breath cycle prediction in pediatric lung sounds and advancing pediatric respiratory sound analysis tools.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] A detection-based multiple object tracking method
    Han, M
    Sethi, A
    Hua, W
    Gong, YH
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 3065 - 3068
  • [2] A Skeleton Object Detection-Based Dynamic Gesture Recognition Method
    Bai, Yunchao
    Zhang, Libo
    Wang, Tianxing
    Zhou, Xianzhong
    PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 212 - 217
  • [3] Object Detection-Based Video Compression
    Kim, Myung-Jun
    Lee, Yung-Lyul
    APPLIED SCIENCES-BASEL, 2022, 12 (09):
  • [4] Breath Sounds Detection System based on SOPC
    Qin, Gong
    Zhou, Jun
    CURRENT TRENDS IN COMPUTER SCIENCE AND MECHANICAL AUTOMATION (CSMA), VOL 2, 2017, : 93 - 104
  • [5] Detection of breath sounds in speech: A deep learning approach
    Arafath, K. Mohamed Ismail Yasar
    Routray, Aurobinda
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [6] A visual tracking method via object detection based on deep learning
    Tang C.
    Ling Y.
    Yang H.
    Yang X.
    Zheng C.
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (05):
  • [7] Cephalometric Landmarks Identification Through an Object Detection-based Deep Learning Model
    Tafala, Idriss
    Ben-Bouazza, Fatima-Ezzahraa
    Edder, Aymane
    Manchadi, Oumaima
    Et-Taoussi, Mehdi
    Jioudi, Bassma
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (02) : 859 - 867
  • [8] Object Detection-based Visual SLAM for Dynamic Scenes
    Zhao, Xinhua
    Ye, Lei
    PROCEEDINGS OF 2022 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (IEEE ICMA 2022), 2022, : 1153 - 1158
  • [9] An efficient semantic segmentation method based on transfer learning from object detection
    Yang, Wei
    Zhang, Jianlin
    Chen, Zhongbi
    Xu, Zhiyong
    IET IMAGE PROCESSING, 2021, 15 (01) : 57 - 64
  • [10] An object detection-based few-shot learning approach for multimedia quality assessment
    Chatterjee, Rajdeep
    Chatterjee, Ankita
    Islam, S. K. Hafizul
    Khan, Muhammad Khurram
    MULTIMEDIA SYSTEMS, 2023, 29 (05) : 2899 - 2912