Heart Sound Segmentation Using Bidirectional LSTMs With Attention

被引:41
|
作者
Fernando, Tharindu [1 ]
Ghaemmaghami, Houman [2 ]
Denman, Simon [1 ]
Sridharan, Sridha [1 ]
Hussain, Nayyar [2 ]
Fookes, Clinton [1 ]
机构
[1] Queensland Univ Technol, Speech & Audio Res Lab, SAIVT, Brisbane, Qld 4000, Australia
[2] M3DICINE Pty Ltd, Brisbane, Qld 4113, Australia
关键词
Heart; Phonocardiography; Animals; Feature extraction; Noise measurement; Task analysis; Recurrent neural networks; Heart sound segmentation; deep recurrent neural networks; attention models; long short term memory networks; biomedical signal processing; phonocardiogram; ALGORITHM;
D O I
10.1109/JBHI.2019.2949516
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: This paper proposes a novel framework for the segmentation of phonocardiogram (PCG) signals into heart states, exploiting the temporal evolution of the PCG as well as considering the salient information that it provides for the detection of the heart state. Methods: We propose the use of recurrent neural networks and exploit recent advancements in attention based learning to segment the PCG signal. This allows the network to identify the most salient aspects of the signal and disregard uninformative information. Results: The proposed method attains state-of-the-art performance on multiple benchmarks including both human and animal heart recordings. Furthermore, we empirically analyse different feature combinations including envelop features, wavelet and Mel Frequency Cepstral Coefficients (MFCC), and provide quantitative measurements that explore the importance of different features in the proposed approach. Conclusion: We demonstrate that a recurrent neural network coupled with attention mechanisms can effectively learn from irregular and noisy PCG recordings. Our analysis of different feature combinations shows that MFCC features and their derivatives offer the best performance compared to classical wavelet and envelop features. Significance: Heart sound segmentation is a crucial pre-processing step for many diagnostic applications. The proposed method provides a cost effective alternative to labour extensive manual segmentation, and provides a more accurate segmentation than existing methods. As such, it can improve the performance of further analysis including the detection of murmurs and ejection clicks. The proposed method is also applicable for detection and segmentation of other one dimensional biomedical signals.
引用
收藏
页码:1601 / 1609
页数:9
相关论文
共 50 条
  • [1] Segmentation of Radar-Recorded Heart Sound Signals Using Bidirectional LSTM Networks
    Shi, Kilin
    Schellenberger, Sven
    Weber, Leon
    Wiedemann, Jan Philipp
    Michler, Fabian
    Steigleder, Tobias
    Malessa, Anke
    Lurz, Fabian
    Ostgathe, Christoph
    Weigel, Robert
    Koelpin, Alexander
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 6677 - 6680
  • [2] Context-Sensitive and Role-Dependent Spoken Language Understanding using Bidirectional and Attention LSTMs
    Hori, Chiori
    Hori, Takaaki
    Watanabe, Shinji
    Hershey, John R.
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3236 - 3240
  • [3] Advanced Dance Choreography System Using Bidirectional LSTMs
    Yoo, Hanha
    Sung, Yunsick
    [J]. SYSTEMS, 2023, 11 (04):
  • [4] Heart Sound Segmentation Using Fractal Decomposition
    Thomas, Rijil
    Hsi, Ling Lieng
    Boon, Soh Cheong
    Gunawan, Erry
    [J]. 2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 6234 - 6237
  • [5] Whispered speech to neutral speech conversion using bidirectional LSTMs
    Meenakshi, G. Nisha
    Ghosh, Prasanta Kumar
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 491 - 495
  • [6] Segmentation of Heart Sound Using Double-threshold
    Chen Jie
    Hou Hailiang
    [J]. 2013 FIFTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2013), 2013, : 985 - 988
  • [7] Heart sound segmentation algorithm based on heart sound envelogram
    Liang, H
    Lukkarinen, S
    Hartimo, I
    [J]. COMPUTERS IN CARDIOLOGY 1997, VOL 24, 1997, 24 : 105 - 108
  • [8] Video object segmentation by multi-scale attention using bidirectional strategy
    Wang, Jingxin
    Zhang, Yunfeng
    Bao, Fangxun
    Liu, Yuetong
    Zhang, Qiuyue
    Zhang, Caiming
    [J]. IMAGE AND VISION COMPUTING, 2024, 148
  • [9] Learning Fashion Compatibility with Bidirectional LSTMs
    Han, Xintong
    Wu, Zuxuan
    Jiang, Yu-Gang
    Davis, Larry S.
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1078 - 1086
  • [10] Better Anomaly Detection for Access Attacks Using Deep Bidirectional LSTMs
    Clausen, Henry
    Grov, Gudmund
    Sabate, Marc
    Aspinall, David
    [J]. MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 1 - 18