Linear Frequency Residual Features for Infant Cry Classification

被引:0
|
作者
Uthiraa, S. [1 ]
Kachhi, Aastha [1 ]
Patil, Hemant A. [1 ]
机构
[1] DA IICT, Speech Res Lab, Gandhinagar, Gujarat, India
来源
关键词
Infant cry classification; Excitation source information; LP residual; Linear frequency residual cepstral coefficients;
D O I
10.1007/978-3-031-48309-7_44
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Classification of normal vs. pathological infant cries is a socially relevant task as crying is the only known mode of infant communication. Due to quasi-periodic sampling of the vocal tract system, the spectrum formed by high pitch-source harmonics results in extremely poor spectral resolution for commonly used features. This paper investigates the effect of excitation source-based features captured using Linear Prediction Residual for classification of normal vs. pathological infant cries. The performance of Linear Frequency Residual Cepstral Coefficients (LFRCC) was compared for matched conditions (of train and test data) against state-of-the-art feature sets, namely, Mel Frequency Cepstral Coefficients (MFCC) and Linear Frequency Cepstral Coefficients (LFCC) using Gaussian Mixture Model (GMM) and Convolutional Neural Network (CNN) as classifiers. This study also investigated the effect of LFRCC on cross-database (i.e., mismatched conditions) and combined database evaluation scenarios. It was observed that LFRCC outperformed MFCC and LFCC by 24.9% and 17.43%, respectively, for mismatched conditions and over 0.27%-1.11% for the combined database. The relatively better performance of LFRCC feature set maybe due to its capability in representing excitation source information, which is very prevalent in infant cry as formant structures are not well developed in the initial period of life.
引用
收藏
页码:550 / 561
页数:12
相关论文
共 50 条
  • [21] Infant Cry Recognition using Excitation Source Features
    Singh, Avinash Kumar
    Mukhopadhyay, Jayanta
    Kumar, Sunil S. B.
    Rao, K. Sreenivasa
    [J]. 2013 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2013,
  • [22] A review: survey on automatic infant cry analysis and classification
    Jeyaraman S.
    Muthusamy H.
    Khairunizam W.
    Jeyaraman S.
    Nadarajaw T.
    Yaacob S.
    Nisha S.
    [J]. Health and Technology, 2018, 8 (5) : 391 - 404
  • [23] Newborn cry nonlinear features extraction and classification
    Peralta-Malvaez, Lizbeth
    Lopez-Rincon, Omar
    Rojas-Velazquez, David
    Oswaldo Valencia-Rosado, Luis
    Rosas-Romero, Roberto
    Etcheverry, Gibran
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 3281 - 3289
  • [24] Child Cry Classification - An Analysis of Features and Models
    Kulkarni, Prathamesh
    Umarani, Sarthak
    Diwan, Vaishnavi
    Korde, Vishakha
    Rege, Priti P.
    [J]. 2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2021,
  • [25] Particle Swarm Optimisation of Mel-frequency Cepstral Coefficients Computation for the Classification of Asphyxiated Infant Cry
    Zabidi, A.
    Mansor, W.
    Lee, Y. K.
    Yassin, A. I. Mohd
    Sahak, R.
    [J]. 2010 3RD INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI 2010), VOLS 1-7, 2010, : 991 - 995
  • [26] Automatic Diagnosis of Asphyxia Infant Cry Signals Using Wavelet Based Mel Frequency Cepstrum Features
    Badreldine, Omnia Magdy
    Elbeheiry, Nourhan A.
    Haroon, Ahmad Nashaat M.
    ElShehaby, Saleh
    Marzook, Eman Mohamed
    [J]. 2018 14TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2018, : 96 - 100
  • [27] Frequency Amplitude Frequency Amplitude Classification of Pathological Infant Cries using Modulation Spectrogram Features
    Chittora, Anshu
    Patil, Hemant A.
    [J]. 2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 541 - 545
  • [28] Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks
    García, JO
    García, CAR
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3140 - 3145
  • [29] Constant-Q Based Harmonic and Pitch Features for Normal vs. Pathological Infant Cry Classification
    Pusuluri, Aditya
    Kachhi, Aastha
    Patil, Hemant A.
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 407 - 420
  • [30] Auditory Filterbank Learning Using ConvRBM for Infant Cry Classification
    Sailor, Hardik B.
    Patil, Hemant A.
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 706 - 710