Infant cry classification by MFCC feature extraction with MLP and CNN structures

被引:15
|
作者
Abbaskhah, Ahmad [1 ,4 ]
Sedighi, Hamed [2 ,3 ,5 ]
Marvi, Hossein [4 ]
机构
[1] Sharif Univ Technol, Dept Elect Engn, Sharif, Iran
[2] Beijing Inst Technol, Sch Aerosp & Engn, Beijing, Peoples R China
[3] Shahrood Univ Technol, Fac Mech Engn, Shahrood, Iran
[4] Shahrood Univ Technol, Fac Elect Engn, Shahrood, Iran
[5] Shahrood Univ Technol, Fac Mech Engn, Shahrood 3619995161, Iran
关键词
Infant cry; Mel-frequency Cepstral Coefficient; Multilayer perceptron; Support vector machine; Convolutional neural network; SMOTE; Classification; IDENTIFICATION;
D O I
10.1016/j.bspc.2023.105261
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In this study, Dunstan's infant cry data set is pre-processed with the feature vector approach, including MFCC (19 features) and energy (one feature). By using extracted features and Support Vector Machine (SVM), Multilayer Perceptron (MLP), and Convolutional Neural Network (CNN) classifiers, five classes of infant cry ("Neh" = hungry; "Eh" = need to burp; "Owh" = tired; "Eairh" = stomach cramp; "Heh" = physical discomfort) are distinguished. The proposed MLP and CNN structures are analyzed according to the loss and the accuracy based on the epoch; moreover, to evaluate the performance of classifiers AUC-ROC, Confusion matrix, accuracy, f1_score, recall, and precision have been used. All three classifiers are analyzed, and their results show that the CNN-designed model has the best performance. Results show that the performance will improve by increasing the complexity of the model. With this approach, classifiers are run 10 times, and the average accuracy for SVM for SMOTE and non-SMOTE data are obtained with tolerance 0.823 +/- 0.02, 0.861 +/- 0.02, respectively. These accuracies for MLP are 0.876 +/- 0.01, 0.892 +/- 0.01, and finally, for CNN, are 0.921 +/- 0.005, 0.911 +/- 0.005. At the best condition, an accuracy of 92.1 % is obtained for five classes of infant cries by the proposed CNN structure.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A review of infant cry analysis and classification
    Chunyan Ji
    Thosini Bamunu Mudiyanselage
    Yutong Gao
    Yi Pan
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [32] Feature extraction and classification techniques for health monitoring of structures
    Amezquita-Sanchez, J. P.
    Adeli, H.
    SCIENTIA IRANICA, 2015, 22 (06) : 1931 - 1940
  • [33] The Research of Feature Extraction Based on MFCC for Speaker Recognition
    Zhang Wanli
    Li Guoxin
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 1074 - 1077
  • [34] Acoustic monitoring and classification of bee swarm activity using MFCC feature extraction and HMM acoustic modeling
    Zgank, Andrej
    12TH INTERNATIONAL CONFERENCE ELEKTRO 2018, 2018,
  • [35] Inception MLP: A vision MLP backbone for multi-scale feature extraction
    Li, Jia
    Yang, Rongchao
    Cao, Xinyan
    Zeng, Bo
    Shi, Zhandong
    Ren, Wei
    Cao, Xixin
    INFORMATION SCIENCES, 2025, 701
  • [36] FDCNet: Presentation of the Fuzzy CNN and Fractal Feature Extraction for Detection and Classification of Tumors
    Molaei, Sepideh
    Ghorbani, Niloofar
    Dashtiahangar, Fatemeh
    Peivandi, Mohammad
    Pourasad, Yaghoub
    Esmaeili, Mona
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [37] Gait feature extraction and gait classification using two-branch CNN
    Xiuhui Wang
    Jiajia Zhang
    Multimedia Tools and Applications, 2020, 79 : 2917 - 2930
  • [38] Gait feature extraction and gait classification using two-branch CNN
    Wang, Xiuhui
    Zhang, Jiajia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) : 2917 - 2930
  • [39] SPECTRAL-SPATIAL FEATURE EXTRACTION BASED CNN FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Quan, Yinghui
    Dong, Shuxian
    Feng, Wei
    Dauphin, Gabriel
    Zhao, Guoping
    Wang, Yong
    Xing, Mengdao
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 485 - 488
  • [40] Retraction Note: Spectrogram analysis of ECG signal and classification efficiency using MFCC feature extraction technique
    Yalamanchili Arpitha
    G. L. Madhumathi
    N. Balaji
    Journal of Ambient Intelligence and Humanized Computing, 2024, 15 (Suppl 1) : 235 - 235