Segmentation of expiratory and inspiratory sounds in baby cry audio recordings using hidden Markov models

被引：15

作者：

Aucouturier, Jean-Julien ^{[1
]}

Nonaka, Yulri ^{[2
]}

Katahira, Kentaro ^{[2
]}

Okanoya, Kazuo ^{[2
]}

机构：

[1] Temple Univ, Dept Comp & Informat Sci, Minato Ku, Tokyo 1060047, Japan

[2] RIKEN Brain Sci Inst, JST ERATO Okanoya Emot Informat Project, Wako, Saitama 3510198, Japan

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2011年 / 130卷 / 05期

关键词：

INFANT CRY; RECOGNITION;

D O I：

10.1121/1.3641377

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The paper describes an application of machine learning techniques to identify expiratory and inspiration phases from the audio recording of human baby cries. Crying episodes were recorded from 14 infants, spanning four vocalization contexts in their first 12 months of age; recordings from three individuals were annotated manually to identify expiratory and inspiratory sounds and used as training examples to segment automatically the recordings of the other 11 individuals. The proposed algorithm uses a hidden Markov model architecture, in which state likelihoods are estimated either with Gaussian mixture models or by converting the classification decisions of a support vector machine. The algorithm yields up to 95% classification precision (86% average), and its ability generalizes over different babies, different ages, and vocalization contexts. The technique offers an opportunity to quantify expiration duration, count the crying rate, and other time-related characteristics of baby crying for screening, diagnosis, and research purposes over large populations of infants. (C) 2011 Acoustical Society of America. [DOI: 10.1121/1.3641377]

引用

页码：2969 / 2977

页数：9

共 50 条

[41] Name segmentation using hidden Markov models and its application in record linkage
Braga Goncalves, Rita de Cassia
Freire, Sergio Miranda
CADERNOS DE SAUDE PUBLICA, 2014, 30 (10): : 2039 - 2048
[42] Multiscale texture segmentation using wavelet-domain hidden Markov models
Choi, HK
Baraniuk, R
CONFERENCE RECORD OF THE THIRTY-SECOND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 1692 - 1697
[43] Multiscale image segmentation using wavelet-domain hidden Markov models
Choi, H
Baraniuk, RG
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (09) : 1309 - 1321
[44] Automatic Segmentation of the Second Cardiac Sound by Using Wavelets and Hidden Markov Models
Lima, C. S.
Barbosa, D.
2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Vols 1-8, 2008, : 334 - 337
[45] Contourlet based multiresolution texture segmentation using contextual hidden Markov models
Raghavendra, BS
Bhat, PS
INTELLIGENT INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, 3356 : 336 - 343
[46] Multiscale document segmentation using wavelet-domain hidden Markov models
Choi, H
Baraniuk, R
DOCUMENT RECOGNITION AND RETRIEVAL VII, 2000, 3967 : 234 - 247
[47] UNSUPERVISED TEXTURE SEGMENTATION USING MULTICHANNEL DECOMPOSITION AND HIDDEN MARKOV-MODELS
CHEN, JL
KUNDU, A
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (05) : 603 - 619
[48] A Phonetic Segmentation Procedure Based on Hidden Markov Models
Pakoci, Edvin
Popovic, Branislav
Jakovljevic, Niksa
Pekar, Darko
Yassa, Fathy
SPEECH AND COMPUTER, 2016, 9811 : 67 - 74
[49] Hidden Markov measure field models for image segmentation
Marroquin, JL
Santana, EA
Botello, S
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (11) : 1380 - 1387
[50] Segmentation of heart sound recordings by a duration-dependent hidden Markov model
Schmidt, S. E.
Holst-Hansen, C.
Graff, C.
Toft, E.
Struijk, J. J.
PHYSIOLOGICAL MEASUREMENT, 2010, 31 (04) : 513 - 529

← 1 2 3 4 5 →