Segmentation of expiratory and inspiratory sounds in baby cry audio recordings using hidden Markov models

被引:15
|
作者
Aucouturier, Jean-Julien [1 ]
Nonaka, Yulri [2 ]
Katahira, Kentaro [2 ]
Okanoya, Kazuo [2 ]
机构
[1] Temple Univ, Dept Comp & Informat Sci, Minato Ku, Tokyo 1060047, Japan
[2] RIKEN Brain Sci Inst, JST ERATO Okanoya Emot Informat Project, Wako, Saitama 3510198, Japan
来源
关键词
INFANT CRY; RECOGNITION;
D O I
10.1121/1.3641377
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The paper describes an application of machine learning techniques to identify expiratory and inspiration phases from the audio recording of human baby cries. Crying episodes were recorded from 14 infants, spanning four vocalization contexts in their first 12 months of age; recordings from three individuals were annotated manually to identify expiratory and inspiratory sounds and used as training examples to segment automatically the recordings of the other 11 individuals. The proposed algorithm uses a hidden Markov model architecture, in which state likelihoods are estimated either with Gaussian mixture models or by converting the classification decisions of a support vector machine. The algorithm yields up to 95% classification precision (86% average), and its ability generalizes over different babies, different ages, and vocalization contexts. The technique offers an opportunity to quantify expiration duration, count the crying rate, and other time-related characteristics of baby crying for screening, diagnosis, and research purposes over large populations of infants. (C) 2011 Acoustical Society of America. [DOI: 10.1121/1.3641377]
引用
收藏
页码:2969 / 2977
页数:9
相关论文
共 50 条
  • [41] Name segmentation using hidden Markov models and its application in record linkage
    Braga Goncalves, Rita de Cassia
    Freire, Sergio Miranda
    CADERNOS DE SAUDE PUBLICA, 2014, 30 (10): : 2039 - 2048
  • [42] Multiscale texture segmentation using wavelet-domain hidden Markov models
    Choi, HK
    Baraniuk, R
    CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 1692 - 1697
  • [43] Multiscale image segmentation using wavelet-domain hidden Markov models
    Choi, H
    Baraniuk, RG
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (09) : 1309 - 1321
  • [44] Automatic Segmentation of the Second Cardiac Sound by Using Wavelets and Hidden Markov Models
    Lima, C. S.
    Barbosa, D.
    2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vols 1-8, 2008, : 334 - 337
  • [45] Contourlet based multiresolution texture segmentation using contextual hidden Markov models
    Raghavendra, BS
    Bhat, PS
    INTELLIGENT INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, 3356 : 336 - 343
  • [46] Multiscale document segmentation using wavelet-domain hidden Markov models
    Choi, H
    Baraniuk, R
    DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 234 - 247
  • [47] UNSUPERVISED TEXTURE SEGMENTATION USING MULTICHANNEL DECOMPOSITION AND HIDDEN MARKOV-MODELS
    CHEN, JL
    KUNDU, A
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (05) : 603 - 619
  • [48] A Phonetic Segmentation Procedure Based on Hidden Markov Models
    Pakoci, Edvin
    Popovic, Branislav
    Jakovljevic, Niksa
    Pekar, Darko
    Yassa, Fathy
    SPEECH AND COMPUTER, 2016, 9811 : 67 - 74
  • [49] Hidden Markov measure field models for image segmentation
    Marroquin, JL
    Santana, EA
    Botello, S
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (11) : 1380 - 1387
  • [50] Segmentation of heart sound recordings by a duration-dependent hidden Markov model
    Schmidt, S. E.
    Holst-Hansen, C.
    Graff, C.
    Toft, E.
    Struijk, J. J.
    PHYSIOLOGICAL MEASUREMENT, 2010, 31 (04) : 513 - 529