Dimensionality Reduction for Speech Emotion Features by Multiscale Kernels

被引：0

作者：

Xu, Xinzhou ^{[1
,2
]}

Deng, Jun ^{[1
]}

Zheng, Wenming ^{[2
]}

Zhao, Li ^{[2
]}

Schuller, Bjoern ^{[3
,4
]}

机构：

[1] Tech Univ Munich, MMK, MISP Grp, D-80290 Munich, Germany

[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing, Jiangsu, Peoples R China

[3] Imperial Coll London, Dept Comp, London, England

[4] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany

来源：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年

关键词：

speech emotion recognition; dimensionality reduction; multiscale kernels; RECOGNITION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

To achieve efficient and compact low-dimensional features for speech emotion recognition, this paper proposes a novel feature reduction method using multiscale kernels in the framework of graph embedding. With Fisher discriminant embedding graph, multiscale Gaussian kernels are used in constructing optimal linear combination of Gram matrices for multiple kernel learning. To evaluate the proposed method, comprehensive experiments, using different public feature sets from the open-source toolbox openSMILE on various corpora, show that the proposed method achieves better performance compared with conventional linear dimensionality reduction methods and single kernel methods.

引用

页码：1532 / 1536

页数：5

共 50 条

[21] Multiscale kernels
Opfer, Roland
ADVANCES IN COMPUTATIONAL MATHEMATICS, 2006, 25 (04) : 357 - 380
[22] Evaluation of graph embedding approach for dimensionality reduction using different kernels
Naeemi, Mohammad Amin
Mohseni, Hadis
2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 69 - 74
[23] Multiscale kernels
Roland Opfer
Advances in Computational Mathematics, 2006, 25 : 357 - 380
[24] Sequential dimensionality reduction for extracting localized features
Casalino, Gabriella
Gillis, Nicolas
PATTERN RECOGNITION, 2017, 63 : 15 - 29
[25] NOVEL AFFECTIVE FEATURES FOR MULTISCALE PREDICTION OF EMOTION IN MUSIC
Kumar, Naveen
Guha, Tanaya
Huang, Che-Wei
Vaz, Colin
Narayanan, Shrikanth S.
2016 IEEE 18TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2016,
[26] Guided autoencoder for dimensionality reduction of pedestrian features
Xuan Li
Tao Zhang
Xin Zhao
Zhengming Yi
Applied Intelligence, 2020, 50 : 4557 - 4567
[27] Guided autoencoder for dimensionality reduction of pedestrian features
Li, Xuan
Zhang, Tao
Zhao, Xin
Yi, Zhengming
APPLIED INTELLIGENCE, 2020, 50 (12) : 4557 - 4567
[28] SPEECH EMOTION RECOGNITION WITH MULTISCALE AREA ATTENTION AND DATA AUGMENTATION
Xu, Mingke
Zhang, Fan
Cui, Xiaodong
Zhang, Wei
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6319 - 6323
[29] Speech emotion recognition based on multimodal and multiscale feature fusion
Hu, Huangshui
Wei, Jie
Sun, Hongyu
Wang, Chuhang
Tao, Shuo
SIGNAL IMAGE AND VIDEO PROCESSING, 2025, 19 (01)
[30] Dimensionality reduction for visualization of normal and pathological speech data
Goddard, J.
Schlotthauer, G.
Torres, M. E.
Rufiner, H. L.
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 194 - 201

← 1 2 3 4 5 →