Linear Frequency Residual Features for Infant Cry Classification

被引:0
|
作者
Uthiraa, S. [1 ]
Kachhi, Aastha [1 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Speech Res Lab, Gandhinagar, Gujarat, India
来源
关键词
Infant cry classification; Excitation source information; LP residual; Linear frequency residual cepstral coefficients;
D O I
10.1007/978-3-031-48309-7_44
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Classification of normal vs. pathological infant cries is a socially relevant task as crying is the only known mode of infant communication. Due to quasi-periodic sampling of the vocal tract system, the spectrum formed by high pitch-source harmonics results in extremely poor spectral resolution for commonly used features. This paper investigates the effect of excitation source-based features captured using Linear Prediction Residual for classification of normal vs. pathological infant cries. The performance of Linear Frequency Residual Cepstral Coefficients (LFRCC) was compared for matched conditions (of train and test data) against state-of-the-art feature sets, namely, Mel Frequency Cepstral Coefficients (MFCC) and Linear Frequency Cepstral Coefficients (LFCC) using Gaussian Mixture Model (GMM) and Convolutional Neural Network (CNN) as classifiers. This study also investigated the effect of LFRCC on cross-database (i.e., mismatched conditions) and combined database evaluation scenarios. It was observed that LFRCC outperformed MFCC and LFCC by 24.9% and 17.43%, respectively, for mismatched conditions and over 0.27%-1.11% for the combined database. The relatively better performance of LFRCC feature set maybe due to its capability in representing excitation source information, which is very prevalent in infant cry as formant structures are not well developed in the initial period of life.
引用
收藏
页码:550 / 561
页数:12
相关论文
共 50 条
  • [1] Robustness of Whisper Features for Infant Cry Classification
    Charola, Monil
    Rathod, Siddharth
    Patil, Hemant A.
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 421 - 433
  • [2] Infant Cry Classification: Time Frequency Analysis
    Saraswathy, J.
    Hariharan, M.
    Khairunizam, Wan
    Yaacob, Sazali
    Thiyagar, N.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2013), 2013, : 499 - +
  • [3] ANALYSIS OF ACOUSTIC FEATURES OF INFANT CRY FOR CLASSIFICATION PURPOSES
    Messaoud, Ali
    Tadj, Chakib
    [J]. 2011 24TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2011, : 89 - 92
  • [4] An Investigation into Audio Features and DTW Algorithms for Infant Cry Classification
    Yu, Xilin
    Wang, Laishuan
    Zhao, Xian
    Lu, Chunmei
    Long, Xi
    Chen, Wei
    [J]. ICBBE 2019: 2019 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, 2019, : 54 - 59
  • [5] Improved binary dragonfly optimization algorithm and wavelet packet based non-linear features for infant cry classification
    Hariharan, M.
    Sindhu, R.
    Vijean, Vikneswaran
    Yazid, Haniza
    Nadarajaw, Thiyagar
    Yaacob, Sazali
    Polat, Kemal
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 155 : 39 - 51
  • [6] Deep Learning for Asphyxiated Infant Cry Classification Based on Acoustic Features and Weighted Prosodic Features
    Ji, Chunyan
    Xiao, Xueli
    Basodi, Sunitha
    Pan, Yi
    [J]. 2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 1233 - 1240
  • [7] Infant Cry Classification Using Dual Tree Complex Wavelet Transform Features
    Jer, Lim Wei
    Hariharan, M.
    Vijean, Vikneswaran
    Yazid, Haniza
    Chin, Lim Chee
    [J]. ADVANCED SCIENCE LETTERS, 2018, 24 (03) : 1741 - 1744
  • [8] Time-frequency analysis in infant cry classification using quadratic time frequency distributions
    Saraswathy, J.
    Hariharan, M.
    Khairunizam, Wan
    Sarojini, J.
    Thiyagar, N.
    Sazali, Y.
    Nisha, Shafriza
    [J]. BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2018, 38 (03) : 634 - 645
  • [9] INFANT CRY SOUND - DEVELOPMENTAL FEATURES
    PRESCOTT, R
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1975, 57 (05): : 1186 - 1191
  • [10] Data Augmentation for Infant Cry Classification
    Kachhi, Aastha
    Chaturvedi, Shreya
    Patil, Hemant A.
    Singh, Dipesh Kumar
    [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 433 - 437