Segmentation of expiratory and inspiratory sounds in baby cry audio recordings using hidden Markov models

被引:15
|
作者
Aucouturier, Jean-Julien [1 ]
Nonaka, Yulri [2 ]
Katahira, Kentaro [2 ]
Okanoya, Kazuo [2 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Minato Ku, Tokyo 1060047, Japan
[2] RIKEN Brain Sci Inst, JST ERATO Okanoya Emot Informat Project, Wako, Saitama 3510198, Japan
来源
关键词
INFANT CRY; RECOGNITION;
D O I
10.1121/1.3641377
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper describes an application of machine learning techniques to identify expiratory and inspiration phases from the audio recording of human baby cries. Crying episodes were recorded from 14 infants, spanning four vocalization contexts in their first 12 months of age; recordings from three individuals were annotated manually to identify expiratory and inspiratory sounds and used as training examples to segment automatically the recordings of the other 11 individuals. The proposed algorithm uses a hidden Markov model architecture, in which state likelihoods are estimated either with Gaussian mixture models or by converting the classification decisions of a support vector machine. The algorithm yields up to 95% classification precision (86% average), and its ability generalizes over different babies, different ages, and vocalization contexts. The technique offers an opportunity to quantify expiration duration, count the crying rate, and other time-related characteristics of baby crying for screening, diagnosis, and research purposes over large populations of infants. (C) 2011 Acoustical Society of America. [DOI: 10.1121/1.3641377]
引用
收藏
页码:2969 / 2977
页数:9
相关论文
共 50 条
  • [1] Automatic segmentation of infant cry signals using hidden Markov models
    Naithani, Gaurav
    Kivinummi, Jaana
    Virtanen, Tuomas
    Tammela, Outi
    Peltola, Mikko J.
    Leppanen, Jukka M.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [2] Automatic segmentation of infant cry signals using hidden Markov models
    Gaurav Naithani
    Jaana Kivinummi
    Tuomas Virtanen
    Outi Tammela
    Mikko J. Peltola
    Jukka M. Leppänen
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [3] Detection of cough signals in continuous audio recordings using hidden Markov models
    Matos, Sergio
    Birring, Surinder S.
    Pavord, Ian D.
    Evans, David H.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (06) : 1078 - 1083
  • [4] A fully automated approach for baby cry signal segmentation and boundary detection of expiratory and inspiratory episodes
    Abou-Abbas, Lina
    Tadj, Chakib
    Fersaie, Hesam Alaie
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 142 (03): : 1318 - 1331
  • [5] HEALTHCARE AUDIO EVENT CLASSIFICATION USING HIDDEN MARKOV MODELS AND HIERARCHICAL HIDDEN MARKOV MODELS
    Peng, Ya-Ti
    Lin, Ching-Yung
    Sun, Ming-Ting
    Tsai, Kun-Cheng
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 1218 - +
  • [6] Analysis of swallowing sounds using hidden Markov models
    Mohammad Aboofazeli
    Zahra Moussavi
    Medical & Biological Engineering & Computing, 2008, 46 : 307 - 314
  • [7] Analysis of swallowing sounds using hidden Markov models
    Aboofazeli, Mohammad
    Moussavi, Zahra
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2008, 46 (04) : 307 - 314
  • [8] Phonocardiogram segmentation by using hidden markov models
    Lima, Carlos S.
    Cardoso, Manuel. J.
    PROCEEDINGS OF THE FIFTH IASTED INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING, 2007, : 415 - 418
  • [9] A Study on Classification of Heart Sounds Using Hidden Markov Models
    Hee-Keun, Kim
    Young-Joo, Chung
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (03): : 144 - 150
  • [10] Segmentation of yeast DNA using hidden Markov models
    Peshkin, L
    Gelfand, MS
    BIOINFORMATICS, 1999, 15 (12) : 980 - 986