Wavelet sub-band features for voice disorder detection and classification

被引:0
|
作者
Girish Gidaye
Jagannath Nirmal
Kadria Ezzine
Mondher Frikha
机构
[1] Research scholar at K. J. Somaiya College of Engineering,Vidyalankar Institute of Technology
[2] Vidyalankar Institute of Technology,Department of Electronics Engineering
[3] K. J. Somaiya College of Engineering,ATISP, ENET’COM
[4] Sfax University,undefined
来源
关键词
Voice disorder detection; Stationary wavelet transform; Voice pathology; Statistical features; Feature selection; Information gain;
D O I
暂无
中图分类号
学科分类号
摘要
Acoustic analysis of the speech signal enables non-intrusive, affordable, unbiased and fast assessment of voice pathologies. This assessment provides complimentary information to otolaryngologist for preliminary diagnosis of pathological larynx. Several voice impairment assessment systems focused on acoustic analysis have been introduced in recent years. Nevertheless, these systems are tested using only one or two datasets and are not independent of database and human bias. In this paper, a unified wavelet based framework is suggested for evaluating voice disorders, which is independent of database and human bias. Stationary wavelet transform (SWT) is used to decompose the speech signal, since it offers good time and frequency localization. Energy and statistical features are extracted from each sub-band after multilevel decomposition. Higher the decomposition level, higher is the order of feature vector. To decrease the dimension of the feature vector, information gain (IG) based feature selection technique is harnessed for selecting most relevant and discarding redundant features. The enriched feature vector is assessed using support vector machine (SVM), stochastic gradient descent (SGD) and artificial neural network (ANN) classifiers. Records of vowel /a/, vocalized at natural pitch for both healthy and pathological subjects, are mined from German, English, Arabic and Spanish speech databases. During the first phase of experiments, input speech signal is detected as healthy or pathological. Second phase classifies input speech samples into healthy, cyst, paralysis or polyp. Experimental results demonstrate that, the extracted energy and statistical features can be used as possible clues for voice disorder evaluation. The most important aspect of the proposed method is that the features are independent of the fundamental frequency. The detection and classification rates attained are comparable to other state-of-the-art approaches.
引用
收藏
页码:28499 / 28523
页数:24
相关论文
共 50 条
  • [1] Wavelet sub-band features for voice disorder detection and classification
    Gidaye, Girish
    Nirmal, Jagannath
    Ezzine, Kadria
    Frikha, Mondher
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) : 28499 - 28523
  • [2] Nonlinear Features of Bark Wavelet Sub-band Filtering for Pathological Voice Recognition
    Zhang, Xiao-Jun
    Zhu, Xin-Cheng
    Wu, Di
    Xiao, Zhong-Zhe
    Tao, Zhi
    Zhao, He-Ming
    [J]. ENGINEERING LETTERS, 2021, 29 (01) : 49 - 60
  • [3] A Discrete Wavelet Transform-Based Voice Activity Detection and Noise Classification With Sub-Band Selection
    Abdullah, Salinna
    Zamani, Majid
    Demosthenous, Andreas
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [4] Acoustic classification of battlefield transient events using wavelet sub-band features
    Azimi-Sadjadi, M. R.
    Jiang, Y.
    Srinivasan, S.
    [J]. UNATTENDED GROUND, SEA, AND AIR SENSOR TECHNOLOGIES AND APPLICATIONS IX, 2007, 6562
  • [5] Wavelet based robust sub-band features for phoneme recognition
    Farooq, O
    Datta, S
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2004, 151 (03): : 187 - 193
  • [6] Epileptic Electroencephalogram Classification using Relative Wavelet Sub-band Energy and Wavelet Entropy
    Hadiyoso, S.
    Irawati, I. D.
    Rizal, A.
    [J]. INTERNATIONAL JOURNAL OF ENGINEERING, 2021, 34 (01): : 75 - 81
  • [7] Epileptic electroencephalogram classification using relative wavelet sub-band energy and wavelet entropy
    Hadiyoso, S.
    Irawati, I.D.
    Rizal, A.
    [J]. International Journal of Engineering, Transactions A: Basics, 2021, 34 (01): : 75 - 81
  • [8] ROBUST VOICE ACTIVITY DETECTION BASED ON PITCH AND SUB-BAND ENERGY
    Zhang, Zhihao
    Lin, Jinlong
    [J]. SIGMAP 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2009, : 44 - 48
  • [9] WAVELET SUB-BAND BASED TEMPORAL FEATURES FOR ROBUST HINDI PHONEME RECOGNITION
    Farooq, O.
    Datta, S.
    Shrotriya, M. C.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2010, 8 (06) : 847 - 859
  • [10] Wavelet-based texture features:: A new method for sub-band characterization
    Mourougaya, F
    Carré, P
    Fernandez-Maloigne, C
    [J]. 2005 International Conference on Image Processing (ICIP), Vols 1-5, 2005, : 69 - 72