Infant's Cry Sound Classification using Mel-Frequency Cepstrum Coefficients Feature Extraction and Backpropagation Neural Network

被引:0
|
作者
Rosita, Yesy Diah [1 ]
Junaedi, Hartarto [2 ]
机构
[1] Univ Islam Majapahit, Informat Engn Study Program, Mojokerto, Indonesia
[2] Sekolah Tinggi Tekn Surabaya, Informat Engn Dept, Surabaya, Indonesia
来源
2016 2ND INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY-COMPUTER (ICST) | 2016年
关键词
infant's cry sound; pitch; energy; harmonic ratio; mel-frequency cepstrum coefficients; backpropagation neural network;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Crying is a communication method used by infants given the limitations of language. Parents or nannies who have never had the experience to take care of the baby will experience anxiety when the infant is crying. Therefore, we need a way to understand about infant's cry and apply the formula. This research develops a system to classify the infant's cry sound using MACF (Mel-Frequency Cepstrum Coefficients) feature extraction and BNN (Backpropagation Neural Network) based on voice type. It is classified into 3 classes: hungry, discomfort, and tired. A voice input must be ascertained as infant's cry sound which using 3 features extraction (pitch with 2 approaches: Modified Autocorrelation Function and Cepstrum Pitch Determination, Energy, and Harmonic Ratio). The features coefficients of MFCC are furthermore classified by Backpropagation Neural Network. The experiment shows that the system can classify the infant's cry sound quite well, with 30 coefficients and 10 neurons in the hidden layer.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks
    García, JO
    García, CAR
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3140 - 3145
  • [2] Iris feature extraction through wavelet mel-frequency cepstrum coefficients
    Barpanda, Soubhagya Sankar
    Majhi, Banshidhar
    Sa, Panjak Kumar
    Sangaiah, Arun Kumar
    Bakshi, Sambit
    OPTICS AND LASER TECHNOLOGY, 2019, 110 : 13 - 23
  • [3] Mel-Frequency Cepstrum Coefficient Analysis of Infant Cry with Hypothyroidism
    Zabidi, Azlee
    Mansor, Wahidah
    Khuan, Lee Yoot
    Sahak, Rohilah
    Rahman, Farah Yasmin Abd
    CSPA: 2009 5TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, 2009, : 204 - 208
  • [4] Modelling and Characterization of an Artificial Neural Network for Infant Cry Recognition Using Mel-Frequency Cepstral Coefficients
    Bandala, Argel A.
    Lim, Allimzon M.
    Cai, Mark Anthony D.
    Bacar, Allan Jeffrey C.
    Manosca, Aynna Claudine G.
    TENCON 2014 - 2014 IEEE REGION 10 CONFERENCE, 2014,
  • [5] Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients
    Pawar, Manju D.
    Kokate, Rajendra D.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (10) : 15563 - 15587
  • [6] Convolution neural network based automatic speech emotion recognition using Mel-frequency Cepstrum coefficients
    Manju D. Pawar
    Rajendra D. Kokate
    Multimedia Tools and Applications, 2021, 80 : 15563 - 15587
  • [7] Filled Pause Classification Using Energy-Boosted Mel-Frequency Cepstrum Coefficients
    Hamzah, Raseeda
    Jamil, Nursuriati
    Seman, Noraini
    8TH INTERNATIONAL CONFERENCE ON ROBOTIC, VISION, SIGNAL PROCESSING & POWER APPLICATIONS: INNOVATION EXCELLENCE TOWARDS HUMANISTIC TECHNOLOGY, 2014, 291 : 311 - 319
  • [8] Particle Swarm Optimisation of Mel-frequency Cepstral Coefficients Computation for the Classification of Asphyxiated Infant Cry
    Zabidi, A.
    Mansor, W.
    Lee, Y. K.
    Yassin, A. I. Mohd
    Sahak, R.
    2010 3RD INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2010), VOLS 1-7, 2010, : 991 - 995
  • [9] Algorithm for Gunshot Detection Using Mel-Frequency Cepstrum Coefficients (MFCC)
    Suman, Preetam
    Karan, Subhdeep
    Singh, Vrijendra
    Maringanti, R.
    PROCEEDINGS OF NINTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS (WCSN 2013), 2014, 299 : 155 - 166
  • [10] Boosting speech/non-speech classification using averaged Mel-frequency Cepstrum Coefficients features
    Xiong, ZY
    Huang, TS
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 573 - 580