Analysis and classification of speech signals by generalized fractal dimension features

被引:33
|
作者
Pitsikalis, Vassilis [1 ]
Maragos, Petros [1 ]
机构
[1] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Athens, Greece
关键词
Feature extraction; Generalized fractal dimensions; Broad class phoneme classification; MULTIFRACTAL NATURE; ATTRACTORS; TURBULENCE; DYNAMICS; MODELS;
D O I
10.1016/j.specom.2009.06.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We explore nonlinear signal processing methods inspired by dynamical systems and fractal theory in order to analyze and characterize speech sounds. A speech signal is at first embedded in a multidimensional phase-space and further employed for the estimation of measurements related to the fractal dimensions. Our goals are to compute these raw measurements in the practical cases of speech signals, to further utilize them for the extraction of simple descriptive features and to address issues on the efficacy of the proposed features to characterize speech sounds. We observe that distinct feature vector elements obtain values or show statistical trends that on average depend on general characteristics such as the voicing, the manner and the place of articulation of broad phoneme classes. Moreover the way that the statistical parameters of the features are altered as an effect of the variation of phonetic characteristics seem to follow some roughly formed patterns. We also discuss some qualitative aspects concerning the linear phoneme-wise correlation between the fractal features and the commonly employed mel-frequency cepstral coefficients (MFCCs) demonstrating phonetic cases of maximal and minimal correlation. In the same context we also investigate the fractal features' spectral content, in terms of the most and least correlated components with the MFCC. Further the proposed methods are examined under the light of indicative phoneme classification experiments. These quantify the efficacy of the features to characterize broad classes of speech sounds. The results are shown to be comparable for some classification scenarios with the corresponding ones of the MFCC features. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:1206 / 1223
页数:18
相关论文
共 50 条
  • [21] Fractal Analysis of Cardiorespiratory Signals for Sleep Stage Classification
    Castiglioni, Paolo
    Faini, Andrea
    Parati, Gianfranco
    Lombardi, Carolina
    [J]. 2014 8TH CONFERENCE OF THE EUROPEAN STUDY GROUP ON CARDIOVASCULAR OSCILLATIONS (ESGCO), 2014, : 83 - +
  • [22] RESEARCH FROM SINGLE FRACTAL DIMENSION TO GEN-ERALIZED FRACTAL DIMENSION OF SPEECH
    董远
    胡光锐
    陈玮
    [J]. Journal of Shanghai Jiaotong University(Science), 1998, (02) : 54 - 57+62
  • [23] Fractal dimension analysis of audio signals for Indian musical instrument recognition
    Gunasekaran, S.
    Revathy, K.
    [J]. 2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 257 - 261
  • [24] Analysis of Fractal Dimension of EEG Signals Under Mobile Phone Radiation
    Smitha, C. K.
    Narayanan, N. K.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [25] Fractal dimension analysis for spike detection in low SNR extracellular signals
    Salmasi, Mehrdad
    Buettner, Ulrich
    Glasauer, Stefan
    [J]. JOURNAL OF NEURAL ENGINEERING, 2016, 13 (03)
  • [26] Features importance analysis for emotional speech classification
    Tao, JH
    Kang, YG
    [J]. AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 449 - 457
  • [27] On texture classification using fractal dimension
    Chen, YQ
    Bi, G
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (06) : 929 - 943
  • [28] A comparative study of fractal dimension estimation for speech
    Fekkai, S
    Al-Akaidi, M
    Blackledge, J
    [J]. SIMULATION AND MODELLING: ENABLERS FOR A BETTER QUALITY OF LIFE, 2000, : 676 - 680
  • [29] A speech recognition approach with MFCC and fractal dimension
    Yao, Minghai
    Hu, Jing
    [J]. DCABES 2006 Proceedings, Vols 1 and 2, 2006, : 349 - 351
  • [30] MEC-Based Evacuation Planning Using Variance Fractal Dimension Trajectory for Speech Classification
    DeSantis, Christopher
    Refaey, Ahmed
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,