Discrimination Effectiveness of Speech Cepstral Features

被引:0
|
作者
Malegaonkar, A. [1 ]
Ariyaeeinia, A. [1 ]
Sivakumaran, P. [1 ]
Pillay, S. [1 ]
机构
[1] Univ Hertfordshire, Hatfield AL10 9AB, Herts, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, the discrimination capabilities of speech cepstra for text and speaker related information are investigated. For this purpose, Bhattacharya distance metric is used as the measure of discrimination. The scope of the study covers static and dynamic cepstra derived using the linear prediction analysis (LPCC) as well as mel-frequency analysis (MFCC). The investigations also include the assessment of the linear prediction-based mel-frequency cepstral coefficients (LP-MFCC) as an alternative speech feature type. It is shown experimentally that whilst contaminations in speech unfavourably affect the performance of all types of cepstra, the effects are more severe in the case of MFCC. Furthermore, it is shown that with a combination of static and dynamic features, LP-based mel-frequency cepstra (LP-MFCC) exhibit the best discrimination capabilities in almost all experimental cases.
引用
收藏
页码:91 / 99
页数:9
相关论文
共 50 条
  • [41] BLIND SPEECH SEGMENTATION USING SPECTROGRAM IMAGE-BASED FEATURES AND MEL CEPSTRAL COEFFICIENTS
    Stan, Adriana
    Valentini-Botinhao, Cassia
    Orza, Bogdan
    Giurgiu, Mircea
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 597 - 602
  • [42] Hierarchical subband linear predictive cepstral (HSLPC) features for HMM-based speech recognition
    Chengalvarayan, R
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 409 - 412
  • [43] Multi-resolution cepstral features for phoneme recognition across speech sub-bands
    McCourt, P
    Vaseghi, S
    Harte, N
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 557 - 560
  • [44] Hierarchical subband linear predictive cepstral (HSLPC) features for HMM-based speech recognition
    Chengalvarayan, Rathinavelu
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 409 - 412
  • [45] Estimation of BMI Status via Speech Signals using Short-term Cepstral Features
    Berkai, Chawki
    Hariharan, M.
    Yaacob, Sazali
    Omar, Mohd Iqbal
    PROCEEDINGS 5TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2015), 2015, : 195 - 199
  • [46] Speech music discrimination using class-specific features
    Beierholm, T
    Baggenstoss, PM
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 379 - 382
  • [47] Discrimination Capability of Prosodic and Spectral Features for Emotional Speech Recognition
    Delic, V.
    Bojanic, M.
    Gnjatovic, M.
    Secujski, M.
    Jovicic, S. T.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 18 (09) : 51 - 54
  • [48] Cepstral coefficients effectiveness for gunshot classifying
    Svatos, Jakub
    Holub, Jan
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (07)
  • [49] Learning Speech Emotion Features by Joint Disentangling-Discrimination
    Xue, Wentao
    Huang, Zhengwei
    Luo, Xin
    Mao, Qirong
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 374 - 379
  • [50] Regularized minimum variance distortionless response-based cepstral features for robust continuous speech recognition
    Alam, Md Jahangir
    Kenny, Patrick
    O'Shaughnessy, Douglas
    SPEECH COMMUNICATION, 2015, 73 : 28 - 46