Analysis and classification of speech signals by generalized fractal dimension features

被引:33
|
作者
Pitsikalis, Vassilis [1 ]
Maragos, Petros [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Athens, Greece
关键词
Feature extraction; Generalized fractal dimensions; Broad class phoneme classification; MULTIFRACTAL NATURE; ATTRACTORS; TURBULENCE; DYNAMICS; MODELS;
D O I
10.1016/j.specom.2009.06.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We explore nonlinear signal processing methods inspired by dynamical systems and fractal theory in order to analyze and characterize speech sounds. A speech signal is at first embedded in a multidimensional phase-space and further employed for the estimation of measurements related to the fractal dimensions. Our goals are to compute these raw measurements in the practical cases of speech signals, to further utilize them for the extraction of simple descriptive features and to address issues on the efficacy of the proposed features to characterize speech sounds. We observe that distinct feature vector elements obtain values or show statistical trends that on average depend on general characteristics such as the voicing, the manner and the place of articulation of broad phoneme classes. Moreover the way that the statistical parameters of the features are altered as an effect of the variation of phonetic characteristics seem to follow some roughly formed patterns. We also discuss some qualitative aspects concerning the linear phoneme-wise correlation between the fractal features and the commonly employed mel-frequency cepstral coefficients (MFCCs) demonstrating phonetic cases of maximal and minimal correlation. In the same context we also investigate the fractal features' spectral content, in terms of the most and least correlated components with the MFCC. Further the proposed methods are examined under the light of indicative phoneme classification experiments. These quantify the efficacy of the features to characterize broad classes of speech sounds. The results are shown to be comparable for some classification scenarios with the corresponding ones of the MFCC features. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1206 / 1223
页数:18
相关论文
共 50 条
  • [1] Selection of Fractal Dimension Features for Speech Emotion Classification
    Tamulevicius, Gintautas
    Karbauskaite, Rasa
    Dzemvda, Gintautas
    [J]. 2017 OPEN CONFERENCE OF ELECTRICAL, ELECTRONIC AND INFORMATION SCIENCES (ESTREAM), 2017,
  • [2] Speech emotion classification using fractal dimension-based features
    Tamulevicius, Gintautas
    Karbauskaite, Rasa
    Dzemyda, Gintautas
    [J]. NONLINEAR ANALYSIS-MODELLING AND CONTROL, 2019, 24 (05): : 679 - 695
  • [3] Classification of EEG Signals using Fractal Dimension Features and Artificial Neural Networks
    Vazquez, Roberto A.
    Salazar-Varas, R.
    [J]. 2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1747 - 1752
  • [4] Fractal Dimension Analysis of Uroflowmetry Signals
    Kara, Sarper
    Ertas, Metin
    Uzunoglu, Cengiz Polat
    Akan, Aydin
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [5] Analysis and classification of tissue section images using directional fractal dimension features
    Shang, CJ
    Daly, C
    McGrath, J
    Barker, J
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2000, : 164 - 167
  • [6] On Radar Signals Analysis Using the Fractal Dimension
    Pashchenko, Ruslan
    Ivankov, Viktor
    Tsyupak, Dmitry
    [J]. 2017 4TH INTERNATIONAL SCIENTIFIC-PRACTICAL CONFERENCE PROBLEMS OF INFOCOMMUNICATIONS-SCIENCE AND TECHNOLOGY (PIC S&T), 2017, : 351 - 354
  • [7] Detection of Voice Pathology using Fractal Dimension in a Multiresolution Analysis of Normal and Disordered Speech Signals
    Zulfiqar Ali
    Irraivan Elamvazuthi
    Mansour Alsulaiman
    Ghulam Muhammad
    [J]. Journal of Medical Systems, 2016, 40
  • [8] Detection of Voice Pathology using Fractal Dimension in a Multiresolution Analysis of Normal and Disordered Speech Signals
    Ali, Zulfiqar
    Elamvazuthi, Irraivan
    Alsulaiman, Mansour
    Muhammad, Ghulam
    [J]. JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (01) : 1 - 10
  • [9] Analysis and classification of commercial ham slice images using directional fractal dimension features
    Mendoza, Fernando
    Valous, Nektarios A.
    Allen, Paul
    Kenny, Tony A.
    Ward, Paddy
    Sun, Da-Wen
    [J]. MEAT SCIENCE, 2009, 81 (02) : 313 - 320
  • [10] Classification of electromyography signals using relevance vector machines and fractal dimension
    Lima, Clodoaldo A. M.
    Coelho, Andre L. V.
    Madeo, Renata C. B.
    Peres, Sarajane M.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2016, 27 (03): : 791 - 804