Locally Discriminant Diffusion Projection and Its Application in Speech Emotion Recognition

被引:1
|
作者
Xu, Xinzhou [1 ]
Huang, Chengwei [2 ]
Wu, Chen [1 ]
Zhao, Li [3 ]
机构
[1] Southeast Univ, Minist Educ, Key Lab Underwater Acoust Signal Proc, Nanjing, Jiangsu, Peoples R China
[2] Soochow Univ, Sch Phys Sci & Technol, Suzhou, Peoples R China
[3] Soochow Univ, Minist Educ, Key Lab Child Dev & Learning Sci, Key Lab Underwater Acoust Signal Proc, Suzhou, Peoples R China
关键词
diffusion maps; graph embedding framework; locally discriminant diffusion projection; speech emotion recognition; DIMENSIONALITY REDUCTION; FRAMEWORK; FEATURES;
D O I
10.7305/automatika.2016.07.853
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing Diffusion Maps method brings diffusion to data samples by Markov random walk. In this paper, to provide a general solution form of Diffusion Maps, first, we propose the generalized single-graph-diffusion embedding framework on the basis of graph embedding framework. Second, by designing the embedding graph of the framework, an algorithm, namely Locally Discriminant Diffusion Projection (LDDP), is proposed for speech emotion recognition. This algorithm is the projection form of the improved Diffusion Maps, which includes both discriminant information and local information. The linear or kernelized form of LDDP (i.e., LLDDP or KLDDP) is used to achieve the dimensionality reduction of original speech emotion features. We validate the proposed algorithm on two widely used speech emotion databases, EMO-DB and eNTERFACE'05. The experimental results show that the proposed LDDP methods, including LLDDP and KLDDP, outperform some other state-of-the-art dimensionality reduction methods which are based on graph embedding or discriminant analysis.
引用
收藏
页码:37 / 45
页数:9
相关论文
共 50 条
  • [41] A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
    Malik, Mohammad Ibrahim
    Latif, Siddique
    Jurdak, Raja
    Schuller, Bjoern W.
    INTERSPEECH 2023, 2023, : 646 - 650
  • [42] Analysis of information in speech and its application in speech recognition
    Kajarekar, SS
    Hermansky, H
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 283 - 288
  • [43] Regularized discriminant analysis and its application to face recognition
    Dai, DQ
    Yuen, PC
    PATTERN RECOGNITION, 2003, 36 (03) : 845 - 847
  • [44] Application of Vector Quantization in Emotion Recognition from Human Speech
    Khanna, Preeti
    Kumar, M. Sasi
    INFORMATION INTELLIGENCE, SYSTEMS, TECHNOLOGY AND MANAGEMENT, 2011, 141 : 118 - +
  • [45] Application of prosody modification for Speech Recognition in different Emotion conditions
    Raju, V. V. Vidyadhara
    Gangamohan, P.
    Gangashetty, Suryakanth V.
    Vuppala, Anil Kumar
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 951 - 954
  • [46] Application of Improved Spectral Subtraction Algorithm for Speech Emotion Recognition
    Zhang Wanli
    Li Guoxin
    Wang Lirong
    PROCEEDINGS 2015 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING BDCLOUD 2015, 2015, : 213 - 216
  • [47] Emotion Recognition and its Application in Software Engineering
    Kolakowska, Agata
    Landowska, Agnieszka
    Szwoch, Mariusz
    Szwoch, Wioleta
    Wrobel, Michal R.
    2013 6TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTIONS (HSI), 2013, : 532 - 539
  • [48] Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition
    Li, Shaokai
    Song, Peng
    Zheng, Wenming
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2448 - 2460
  • [49] Speech Emotion Recognition Based on Linear Discriminant Analysis and Support Vector Machine Decision Tree
    Mao, Jun-Wei
    He, Yong
    Liu, Zhen-Tao
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5529 - 5533
  • [50] Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
    Zhang, Shiqing
    Zhang, Shiliang
    Huang, Tiejun
    Gao, Wen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (06) : 1576 - 1590