Dimensionality Reduction of Modulation Frequency Features for Speech Discrimination

被引:0
|
作者
Markaki, Maria [1 ]
Stylianou, Yannis [1 ]
机构
[1] Univ Crete, Dept Comp Sci, Khania, Greece
关键词
modulation spectrum; multilinear algebra; feature selection; mutual information; speech discrimination;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a dimensionality reduction method for modulation spectral features, which keeps the time-varying information of interest to the classification task. Due to the varying degrees of redundancy and discriminative power of the acoustic and modulation frequency subspaces, we first employ a generalization of SVD to tensors (Higher Order SVD) to reduce dimensions. Projection of modulation spectral features on the principal axes with the higher energy in each subspace results in a compact feature set. We further estimate the relevance of these projections to speech discrimination based on mutual information to the target class. Reconstruction of modulation spectrograms from the "best" 22 features back to the initial dimensions, shows that modulation spectral features close to syllable and phoneme rates as well as pitch values of speakers are preserved.
引用
收藏
页码:646 / 649
页数:4
相关论文
共 50 条
  • [31] DISGUISED DISCRIMINATION OF LOCALITY-BASED UNSUPERVISED DIMENSIONALITY REDUCTION
    Yang, Bo
    Chen, Songcan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2010, 24 (07) : 1011 - 1025
  • [32] Modulation features for speech and music classification
    Mubarak, Omer Mohsin
    Ambikairajah, Eliathamby
    Epps, Julien
    Gunawan, Teddy Surya
    2006 10TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS, VOLS 1 AND 2, 2006, : 764 - +
  • [33] Dimensionality Reduction of Facial Features to Recognize Emotion State
    Gaikwad, Kiran P.
    Rani, C. M. Sheela
    Mahajan, S. B.
    Sanjeevikumar, P.
    ADVANCES IN SYSTEMS, CONTROL AND AUTOMATION, 2018, 442 : 719 - 725
  • [34] Dimensionality Reduction for Speech Recognition Using Neighborhood Components Analysis
    Singh-Miller, Natasha
    Collins, Michael
    Hazen, Timothy J.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1397 - 1400
  • [35] Dimensionality reduction of visual features for efficient retrieval and classification
    Boufounos, Petros T.
    Mansour, Hassan
    Rane, Shantanu
    Vetro, Anthony
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2016, 5
  • [36] Improving the hate speech analysis through dimensionality reduction approach
    Rai, Neha
    Meena, Pooja
    Agrawal, Chetan
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 321 - 325
  • [37] Dimensionality reduction method of analog circuit fault features
    Liao, Jian
    Zhou, Shaolei
    Shi, Xianjun
    Wang, Zhen
    Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2015, 35 (02): : 302 - 308
  • [38] THE EFFECT OF FREQUENCY COMPRESSION ON SPEECH-DISCRIMINATION
    SALAND, J
    PARTRIDGE, LD
    CLINICAL RESEARCH, 1993, 41 (01): : A53 - A53
  • [39] Speech Modulation Features for Robust Nonnative Speech Accent Detection
    Sam, Sethserey
    Xiao, Xiong
    Besacier, Laurent
    Castelli, Eric
    Li, Haizhou
    Chng, Eng Siong
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2428 - 2431
  • [40] Voice activity detection in noise using modulation spectrum of speech: Investigation of speech frequency and modulation frequency ranges
    Pek, Kimhuoch
    Arai, Takayuki
    Kanedera, Noboru
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2012, 33 (01) : 33 - 44