Robustness of Auditory Teager Energy Cepstrum Coefficients for Classification of Pathological and Normal Voices in Noisy Environments

被引:4
|
作者
Salhi, Lotfi [1 ]
Cherif, Adnane [1 ]
机构
[1] Univ Tunis ElManar, Fac Sci Tunis, Dept Phys, Signal Proc Lab, Tunis 1060, Tunisia
来源
关键词
VOCAL DYSPERIODICITIES; SPEECH; PHASE; RECOGNITION; PERFORMANCE; FREQUENCY; FILTER;
D O I
10.1155/2013/435729
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper focuses on a robust feature extraction algorithm for automatic classification of pathological and normal voices in noisy environments. The proposed algorithm is based on human auditory processing and the nonlinear Teager-Kaiser energy operator. The robust features which labeled Teager Energy Cepstrum Coefficients (TECCs) are computed in three steps. Firstly, each speech signal frame is passed through a Gammatone or Mel scale triangular filter bank. Then, the absolute value of the Teager energy operator of the short-time spectrum is calculated. Finally, the discrete cosine transform of the log-filtered Teager Energy spectrum is applied. This feature is proposed to identify the pathological voices using a developed neural system of multilayer perceptron (MLP). We evaluate the developed method using mixed voice database composed of recorded voice samples from normophonic or dysphonic speakers. In order to show the robustness of the proposed feature in detection of pathological voices at different White Gaussian noise levels, we compare its performance with results for clean environments. The experimental results show that TECCs computed from Gammatone filter bank are more robust in noisy environments than other extracted features, while their performance is practically similar to clean environments.
引用
收藏
页数:8
相关论文
共 22 条
  • [1] Combining Evidences from Variable Teager Energy Source and Mel Cepstral Features for Classification of Normal vs. Pathological Voices
    Patil, Hemant A.
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [2] Teager Energy Cepstral Coefficients for Classification of Normal vs. Whisper Speech
    Khoria, Kuldeep
    Kamble, Madhu R.
    Patil, Hemant A.
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 371 - 375
  • [4] Robust speech recognition in noisy backgrounds based on teager energy operator and auditory process
    Zhao, JH
    Kuang, JM
    Dai, QH
    [J]. CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 550 - 554
  • [5] Maximum Approximate Entropy for Normal and Pathological Voices Classification
    Restrepo, Juan F.
    Schlotthauer, Gaston
    Torres, Maria E.
    [J]. VI LATIN AMERICAN CONGRESS ON BIOMEDICAL ENGINEERING (CLAIB 2014), 2014, 49 : 548 - 551
  • [6] Rayleigh modeling of teager energy operated perceptual wavelet packet coefficients for enhancing noisy speech
    Islam, Md Tauhidul
    Shahnaz, Celia
    Zhu, Wei-Ping
    Ahmad, M. Omair
    [J]. SPEECH COMMUNICATION, 2017, 86 : 64 - 74
  • [7] Teager Energy Cepstral Coefficients For Classification of Dysarthric Speech Severity-Level
    Kachhi, Aastha
    Therattil, Anand
    Patil, Ankur T.
    Sailor, Hardik B.
    Patil, Hemant A.
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1462 - 1468
  • [8] A semisoft thresholding method based on Teager energy operation on wavelet packet coefficients for enhancing noisy speech
    Sanam, Tahsina Farah
    Shahnaz, Celia
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [9] A semisoft thresholding method based on Teager energy operation on wavelet packet coefficients for enhancing noisy speech
    Tahsina Farah Sanam
    Celia Shahnaz
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2013
  • [10] Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks
    García, JO
    García, CAR
    [J]. PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 3140 - 3145