Dimensionality Reduction of Modulation Frequency Features for Speech Discrimination

被引:0
|
作者
Markaki, Maria [1 ]
Stylianou, Yannis [1 ]
机构
[1] Univ Crete, Dept Comp Sci, Khania, Greece
关键词
modulation spectrum; multilinear algebra; feature selection; mutual information; speech discrimination;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a dimensionality reduction method for modulation spectral features, which keeps the time-varying information of interest to the classification task. Due to the varying degrees of redundancy and discriminative power of the acoustic and modulation frequency subspaces, we first employ a generalization of SVD to tensors (Higher Order SVD) to reduce dimensions. Projection of modulation spectral features on the principal axes with the higher energy in each subspace results in a compact feature set. We further estimate the relevance of these projections to speech discrimination based on mutual information to the target class. Reconstruction of modulation spectrograms from the "best" 22 features back to the initial dimensions, shows that modulation spectral features close to syllable and phoneme rates as well as pitch values of speakers are preserved.
引用
收藏
页码:646 / 649
页数:4
相关论文
共 50 条
  • [41] Curvature analysis of frequency modulated manifolds in dimensionality reduction
    Guillemard, Mijail
    Iske, Armin
    CALCOLO, 2011, 48 (01) : 107 - 125
  • [42] Curvature analysis of frequency modulated manifolds in dimensionality reduction
    Mijail Guillemard
    Armin Iske
    Calcolo, 2011, 48 : 107 - 125
  • [43] IDENTIFICATION VERSUS DISCRIMINATION OF DISTINCTIVE FEATURES IN SPEECH PERCEPTION
    BLUMSTEIN, S
    COOPER, W
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1972, 24 (MAY): : 207 - +
  • [44] CHOICE RESPONSE TIME AND DISTINCTIVE FEATURES IN SPEECH DISCRIMINATION
    CHANANIE, JD
    TIKOFSKY, RS
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1969, 81 (01): : 161 - &
  • [45] Modulation and chaotic acoustic features for speech recognition
    Dimitriadis, D.
    Maragos, P.
    Pitsikalis, V.
    Potamianos, A.
    Control and Intelligent Systems, 2002, 30 (01) : 19 - 26
  • [46] EMOTION CLASSIFICATION OF SPEECH USING MODULATION FEATURES
    Chaspari, Theodora
    Dimitriadis, Dimitrios
    Maragos, Petros
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1552 - 1556
  • [47] FREQUENCY DIVISION IN SPEECH BANDWIDTH REDUCTION
    BOGNER, RE
    IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, 1965, CO13 (04): : 438 - &
  • [48] EMG-based speech recognition using dimensionality reduction methods
    Ratnovsky, Anat
    Malayev, Sarit
    Ratnovsky, Shahar
    Naftali, Sara
    Rabin, Neta
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 14 (1) : 597 - 607
  • [49] EMG-based speech recognition using dimensionality reduction methods
    Anat Ratnovsky
    Sarit Malayev
    Shahar Ratnovsky
    Sara Naftali
    Neta Rabin
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 597 - 607
  • [50] A Comparison of Linear and Nonlinear Dimensionality Reduction Methods Applied to Synthetic Speech
    Errity, Andrew
    McKenna, John
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1079 - 1082