Pathological Voice Classification Based on Wavelet Packet Multiscale Analysis

被引:0
|
作者
Zhang, Xuehui [1 ]
Hu, Weiping [1 ]
机构
[1] Guangxi Normal Univ, Fac Elect Engn, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
vocal fold paralysis; wavelet packet transform; nonlinear feature; Support Vector Machine (SVM);
D O I
10.1145/3302425.3302450
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vocal fold paralysis is a common type of laryngeal diseases, and it is fundamental different from vocal fold non-paralysis (including vocal fold nodules, polyps, etc.). Based on the differences between the various of laryngeal diseases, a classification algorithm for pathological voice is proposed. In which the wavelet packet transformation and multi-scale analysis are used. Firstly, the sub-bands of different frequency signals are obtained using wavelet packet decomposition, and then the nonlinear features i.e. the Hurst parameter, 2-Renyi entropy, Box-counting dimension and attractor are extracted from different frequency bands. These features are used to evaluate the contribution of each frequency band in detecting and classifying pathological voices. The experimental data are derived from the Massachusetts Eye and Ear Infirmary (MEEI) database and the Saarbrucken Voice Database (SVD). The experimental results show that when using the support vector machine to classify the Hurst parameter and the 2-Renyi entropy combined features, the method used in this paper can achieve good classification results when classifying normal, paralysis and non-paralysis these three kinds of voices on those two databases. The average classification accuracies on these two databases are 98.37% and 92.83% respectively.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Classification of Pathological and Healthy Voice Using Perceptual Wavelet Packet Decomposition and Support Vector Machine
    Arslan, Ozkan
    [J]. 2020 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2020,
  • [2] Voice Disorder Classification Using MLP and Wavelet Packet Transform
    Barizao, Andrieli H.
    Fermino, Murillo A.
    Dajer, Maria E.
    Liboni, Luisa H. B.
    Spatti, Danilo H.
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [3] Statistical damage classification method based on wavelet packet analysis
    Law, S. S.
    Zhu, X. Q.
    Tian, Y. J.
    Li, X. Y.
    Wu, S. Q.
    [J]. STRUCTURAL ENGINEERING AND MECHANICS, 2013, 46 (04) : 459 - 486
  • [4] Pathological Voice Analysis and Classification Based on Empirical Mode Decomposition
    Schlotthauer, Gaston
    Torres, Maria E.
    Rufiner, Hugo L.
    [J]. DEVELOPMENT OF MULTIMODAL INTERFACES: ACTIVE LISTING AND SYNCHRONY, 2010, 5967 : 364 - +
  • [5] Classification of pathological and normal voice based on linear discriminant analysis
    Lee, Ji-Yeoun
    Jeong, SangBae
    Hahn, Minsoo
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 2, 2007, 4432 : 382 - +
  • [6] Voice activity detection based on using wavelet packet
    Eshaghi, Mohadese
    Mollaei, M. R. Karami
    [J]. DIGITAL SIGNAL PROCESSING, 2010, 20 (04) : 1102 - 1115
  • [7] Wavelet packet based respiratory sound classification
    Pesu, L
    Ademovic, E
    Pesquet, JC
    Helisto, P
    [J]. PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 377 - 380
  • [8] Fabric texture classification based on wavelet packet
    Liu Shuguang
    Qu Pingge
    [J]. ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 38 - 41
  • [9] Dysphonic voice classification using wavelet packet transform and artificial neural network
    Schuck, A
    Guimaraes, LV
    Wisbeck, JO
    [J]. PROCEEDINGS OF THE 25TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: A NEW BEGINNING FOR HUMAN HEALTH, 2003, 25 : 2958 - 2961
  • [10] An efficient voice pathology classification scheme based on applying multi-layer linear discriminant analysis to wavelet packet-based features
    Akbari, Ali
    Arjmandi, Meisam Khalil
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2014, 10 : 209 - 223