Dimensionality Reduction of Modulation Frequency Features for Speech Discrimination

被引:0
|
作者
Markaki, Maria [1 ]
Stylianou, Yannis [1 ]
机构
[1] Univ Crete, Dept Comp Sci, Khania, Greece
关键词
modulation spectrum; multilinear algebra; feature selection; mutual information; speech discrimination;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a dimensionality reduction method for modulation spectral features, which keeps the time-varying information of interest to the classification task. Due to the varying degrees of redundancy and discriminative power of the acoustic and modulation frequency subspaces, we first employ a generalization of SVD to tensors (Higher Order SVD) to reduce dimensions. Projection of modulation spectral features on the principal axes with the higher energy in each subspace results in a compact feature set. We further estimate the relevance of these projections to speech discrimination based on mutual information to the target class. Reconstruction of modulation spectrograms from the "best" 22 features back to the initial dimensions, shows that modulation spectral features close to syllable and phoneme rates as well as pitch values of speakers are preserved.
引用
收藏
页码:646 / 649
页数:4
相关论文
共 50 条
  • [11] Modulation features for speech recognition
    Dimitriadis, D
    Maragos, P
    Potamianos, L
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 377 - 380
  • [12] Discrimination Effectiveness of Speech Cepstral Features
    Malegaonkar, A.
    Ariyaeeinia, A.
    Sivakumaran, P.
    Pillay, S.
    BIOMETRICS AND IDENTITY MANAGEMENT, 2008, 5372 : 91 - 99
  • [13] Dimensionality reduction for discrimination: removal of common structures with iSTAC
    Graham I Cummins
    Alexander G Dimitrov
    BMC Neuroscience, 11 (Suppl 1)
  • [14] Novel Energy Separation Based Frequency Modulation Features For Spoofed Speech Classification
    Kamble, Madhu R.
    Patil, Hemant A.
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 326 - 331
  • [15] Amplitude and Frequency Modulation-based features for detection of replay Spoof Speech
    Kamble, Madhu R.
    Tak, Hemlata
    Patil, Hemant A.
    SPEECH COMMUNICATION, 2020, 125 : 114 - 127
  • [16] Constructing modulation frequency domain-based features for robust speech recognition
    Hung, Jeih-Weih
    Tsai, Wei-Yi
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 563 - 577
  • [17] Sequential dimensionality reduction for extracting localized features
    Casalino, Gabriella
    Gillis, Nicolas
    PATTERN RECOGNITION, 2017, 63 : 15 - 29
  • [18] Guided autoencoder for dimensionality reduction of pedestrian features
    Xuan Li
    Tao Zhang
    Xin Zhao
    Zhengming Yi
    Applied Intelligence, 2020, 50 : 4557 - 4567
  • [19] Guided autoencoder for dimensionality reduction of pedestrian features
    Li, Xuan
    Zhang, Tao
    Zhao, Xin
    Yi, Zhengming
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4557 - 4567
  • [20] Learning in discrimination of frequency or modulation rate: generalization to fundamental frequency discrimination
    Grimault, N
    Micheyl, C
    Carlyon, RP
    Bacon, SP
    Collet, L
    HEARING RESEARCH, 2003, 184 (1-2) : 41 - 50