Discrimination Effectiveness of Speech Cepstral Features

被引:0
|
作者
Malegaonkar, A. [1 ]
Ariyaeeinia, A. [1 ]
Sivakumaran, P. [1 ]
Pillay, S. [1 ]
机构
[1] Univ Hertfordshire, Hatfield AL10 9AB, Herts, England
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, the discrimination capabilities of speech cepstra for text and speaker related information are investigated. For this purpose, Bhattacharya distance metric is used as the measure of discrimination. The scope of the study covers static and dynamic cepstra derived using the linear prediction analysis (LPCC) as well as mel-frequency analysis (MFCC). The investigations also include the assessment of the linear prediction-based mel-frequency cepstral coefficients (LP-MFCC) as an alternative speech feature type. It is shown experimentally that whilst contaminations in speech unfavourably affect the performance of all types of cepstra, the effects are more severe in the case of MFCC. Furthermore, it is shown that with a combination of static and dynamic features, LP-based mel-frequency cepstra (LP-MFCC) exhibit the best discrimination capabilities in almost all experimental cases.
引用
收藏
页码:91 / 99
页数:9
相关论文
共 50 条
  • [1] CEPSTRAL MEAN BASED SPEECH SOURCE DISCRIMINATION
    Greenhall, Adam
    Atlas, Les
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4490 - 4493
  • [2] Speech recognition using cepstral articulatory features
    Najnin, Shamima
    Banerjee, Bonny
    SPEECH COMMUNICATION, 2019, 107 : 26 - 37
  • [3] Features of speech in neurodegenerative diseases revealed by cepstral analysis
    Goursky, I.
    Likhachev, S.
    Charnukha, T.
    Rushkevich, Y.
    EUROPEAN JOURNAL OF NEUROLOGY, 2016, 23 : 396 - 396
  • [4] Robust Speech Recognition Combining Cepstral and Articulatory Features
    Zha, Zhuan-ling
    Hu, Jin
    Zhan, Qing-ran
    Shan, Ya-hui
    Xie, Xiang
    Wang, Jing
    Cheng, Hao-bo
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1401 - 1405
  • [5] Comparison of the effectiveness of cepstral coefficients for Russian speech synthesis detection
    Efanov, Dmitry
    Aleksandrov, Pavel
    Mironov, Ilia
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2024, 20 (03) : 375 - 382
  • [6] NMF-based Cepstral Features for Speech Emotion Recognition
    Lashkari, Milad
    Seyedin, Sanaz
    2018 4TH IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2018, : 189 - 193
  • [7] Speech Emotion Recognition Using Auditory Spectrogram and Cepstral Features
    Zhao, Shujie
    Yang, Yan
    Cohen, Israel
    Zhang, Lijun
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 136 - 140
  • [8] Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech
    Raikar, Aditya
    Gandhi, Ami
    Patil, Hemant A.
    TEXT, SPEECH, AND DIALOGUE (TSD 2015), 2015, 9302 : 405 - 413
  • [9] Healthy and Parkinson voices discrimination based on compensation/normalization cepstral features
    Meghraoui, Djamila
    Boudraa, Bachir
    Djeddou, Mustapha
    Meksen, Thouraya Merazi
    Boudraa, Malika
    PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON APPLIED SMART SYSTEMS (ICASS), 2018,
  • [10] Robustifying cepstral features by mitigating the outlier effect for noisy speech recognition
    Fan, Hao-teng
    Hsieh, Kuan-wei
    Huang, Chien-hao
    Hung, Jeih-weih
    2013 10TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2013, : 935 - 939