Dimensionality Reduction for Speech Emotion Features by Multiscale Kernels

被引:0
|
作者
Xu, Xinzhou [1 ,2 ]
Deng, Jun [1 ]
Zheng, Wenming [2 ]
Zhao, Li [2 ]
Schuller, Bjoern [3 ,4 ]
机构
[1] Tech Univ Munich, MMK, MISP Grp, D-80290 Munich, Germany
[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China
[3] Imperial Coll London, Dept Comp, London, England
[4] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany
关键词
speech emotion recognition; dimensionality reduction; multiscale kernels; RECOGNITION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To achieve efficient and compact low-dimensional features for speech emotion recognition, this paper proposes a novel feature reduction method using multiscale kernels in the framework of graph embedding. With Fisher discriminant embedding graph, multiscale Gaussian kernels are used in constructing optimal linear combination of Gram matrices for multiple kernel learning. To evaluate the proposed method, comprehensive experiments, using different public feature sets from the open-source toolbox openSMILE on various corpora, show that the proposed method achieves better performance compared with conventional linear dimensionality reduction methods and single kernel methods.
引用
收藏
页码:1532 / 1536
页数:5
相关论文
共 50 条
  • [21] Multiscale kernels
    Opfer, Roland
    ADVANCES IN COMPUTATIONAL MATHEMATICS, 2006, 25 (04) : 357 - 380
  • [22] Evaluation of graph embedding approach for dimensionality reduction using different kernels
    Naeemi, Mohammad Amin
    Mohseni, Hadis
    2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 69 - 74
  • [23] Multiscale kernels
    Roland Opfer
    Advances in Computational Mathematics, 2006, 25 : 357 - 380
  • [24] Sequential dimensionality reduction for extracting localized features
    Casalino, Gabriella
    Gillis, Nicolas
    PATTERN RECOGNITION, 2017, 63 : 15 - 29
  • [25] NOVEL AFFECTIVE FEATURES FOR MULTISCALE PREDICTION OF EMOTION IN MUSIC
    Kumar, Naveen
    Guha, Tanaya
    Huang, Che-Wei
    Vaz, Colin
    Narayanan, Shrikanth S.
    2016 IEEE 18TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2016,
  • [26] Guided autoencoder for dimensionality reduction of pedestrian features
    Xuan Li
    Tao Zhang
    Xin Zhao
    Zhengming Yi
    Applied Intelligence, 2020, 50 : 4557 - 4567
  • [27] Guided autoencoder for dimensionality reduction of pedestrian features
    Li, Xuan
    Zhang, Tao
    Zhao, Xin
    Yi, Zhengming
    APPLIED INTELLIGENCE, 2020, 50 (12) : 4557 - 4567
  • [28] SPEECH EMOTION RECOGNITION WITH MULTISCALE AREA ATTENTION AND DATA AUGMENTATION
    Xu, Mingke
    Zhang, Fan
    Cui, Xiaodong
    Zhang, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6319 - 6323
  • [29] Speech emotion recognition based on multimodal and multiscale feature fusion
    Hu, Huangshui
    Wei, Jie
    Sun, Hongyu
    Wang, Chuhang
    Tao, Shuo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
  • [30] Dimensionality reduction for visualization of normal and pathological speech data
    Goddard, J.
    Schlotthauer, G.
    Torres, M. E.
    Rufiner, H. L.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 194 - 201