Discrimination Effectiveness of Speech Cepstral Features

被引:0
|
作者
Malegaonkar, A. [1 ]
Ariyaeeinia, A. [1 ]
Sivakumaran, P. [1 ]
Pillay, S. [1 ]
机构
[1] Univ Hertfordshire, Hatfield AL10 9AB, Herts, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, the discrimination capabilities of speech cepstra for text and speaker related information are investigated. For this purpose, Bhattacharya distance metric is used as the measure of discrimination. The scope of the study covers static and dynamic cepstra derived using the linear prediction analysis (LPCC) as well as mel-frequency analysis (MFCC). The investigations also include the assessment of the linear prediction-based mel-frequency cepstral coefficients (LP-MFCC) as an alternative speech feature type. It is shown experimentally that whilst contaminations in speech unfavourably affect the performance of all types of cepstra, the effects are more severe in the case of MFCC. Furthermore, it is shown that with a combination of static and dynamic features, LP-based mel-frequency cepstra (LP-MFCC) exhibit the best discrimination capabilities in almost all experimental cases.
引用
收藏
页码:91 / 99
页数:9
相关论文
共 50 条
  • [21] ADAPTIVE CEPSTRAL ANALYSIS OF SPEECH
    TOKUDA, K
    KOBAYASHI, T
    IMAI, S
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06): : 481 - 489
  • [22] Adaptive cepstral analysis of speech
    Tokyo Inst of Technology, Tokyo, Japan
    IEEE Trans Speech Audio Process, 6 (481-489):
  • [23] Time-Varying LP Cepstral Features for Improved Isolated Word Speech Recognition
    Ang, Federico
    Tsutsui, Hiroshi
    Miyanaga, Yoshikazu
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 302 - 306
  • [24] Twin identification from speech: linear and non-linear cepstral features and models
    A. Revathi
    R. Nagakrishnan
    N. Sasikaladevi
    International Journal of Speech Technology, 2020, 23 : 183 - 189
  • [25] Twin identification from speech: linear and non-linear cepstral features and models
    Revathi, A.
    Nagakrishnan, R.
    Sasikaladevi, N.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (01) : 183 - 189
  • [26] Neural Network Based Classification of Stressed Speech Using Nonlinear Spectral and Cepstral Features
    Sanaullah, Muhammad
    Chowdhury, Masud H.
    2014 IEEE 12TH INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2014, : 33 - 36
  • [27] Use of Machine Learning for Deception Detection From Spectral and Cepstral Features of Speech Signals
    Fernandes, Sinead V.
    Ullah, Muhammad S.
    IEEE ACCESS, 2021, 9 : 78925 - 78935
  • [28] Composite Wavelet Model for Stability-Oriented Speech Synthesis from Cepstral Features
    Koguchi, Junya
    Sagayama, Shigeki
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1697 - 1701
  • [29] Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification
    Valero, Xavier
    Alias, Francesc
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (06) : 1684 - 1689
  • [30] Classification of speech under stress based on cepstral features and One-class SVM
    Besbes, Salsabil
    Lachiri, Zied
    2017 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND DIAGNOSIS (ICCAD), 2017, : 213 - 218