Speech emotion recognition using semi-NMF feature optimization

被引:7
|
作者
Bandela, Surekha Reddy [1 ]
Kumar, T. Kishore [1 ]
机构
[1] NIT Warangal, Dept Elect & Commun Engn, Hanamkonda, Telangana, India
关键词
Speech emotion recognition; spectral; Teager energy operator; feature fusion; semi-nonnegative matrix factorization; k-nearest neighborhood; support vector machine; FEATURE-SELECTION; CLASSIFICATION; FREQUENCY;
D O I
10.3906/elk-1903-121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent times, much research is progressing forward in the field of speech emotion recognition (SER). Many SER systems have been developed by combining different speech features to improve their performances. As a result, the complexity of the classifier increases to train this huge feature set. Additionally, some of the features could be irrelevant in emotion detection and this leads to a decrease in the emotion recognition accuracy. To overcome this drawback, feature optimization can be performed on the feature sets to obtain the most desirable emotional feature set before classifying the features. In this paper, semi-nonnegative matrix factorization (semi-NMF) with singular value decomposition (SVD) initialization is used to optimize the speech features. The speech features considered in this work are mel-frequency cepstral coefficients, linear prediction cepstral coefficients, and Teager energy operator-autocorrelation (TEO-AutoCorr). This work uses k-nearest neighborhood and support vector machine (SVM) for the classification of emotions with a 5-fold cross-validation scheme. The datasets considered for the performance analysis are EMO-DB and IEMOCAP. The performance of the proposed SER system using semi-NMF is validated in terms of classification accuracy. The results emphasize that the accuracy of the proposed SER system is improved remarkably upon using the semi-NMF algorithm for optimizing the feature sets compared to the baseline SER system without optimization.
引用
收藏
页码:3741 / 3757
页数:17
相关论文
共 50 条
  • [21] Speech Emotion Recognition Using Unsupervised Feature Selection Algorithms
    Bandela, Surekha Reddy
    Kumar, T. Kishore
    RADIOENGINEERING, 2020, 29 (02) : 353 - 364
  • [22] Parallel Implementation of the Nonlinear Semi-NMF Based Alternating Optimization Method for Deep Neural Networks
    Akira Imakura
    Yuto Inoue
    Tetsuya Sakurai
    Yasunori Futamura
    Neural Processing Letters, 2018, 47 : 815 - 827
  • [23] Semi-NMF Regularization-Based Autoencoder Training for Hyperspectral Unmixing
    Goel, Divyam
    Khanna, Saurabh
    2024 NATIONAL CONFERENCE ON COMMUNICATIONS, NCC, 2024,
  • [24] DNSRF: Deep Network-based Semi-NMF Representation Framework
    Wang, Dexian
    Li, Tianrui
    Deng, Ping
    Luo, Zhipeng
    Zhang, Pengfei
    Liu, Keyu
    Huang, Wei
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (05)
  • [25] An Online Semi-NMF Algorithm for Soft-Clustering of Financial Institutions
    Cheng, Yuan
    Mankad, Shawn
    PROCEEDINGS OF THE FIFTH INTERNATIONAL WORKSHOP ON DATA SCIENCE FOR MACRO-MODELING (DSMM 2019), 2019,
  • [26] A Converged Deep Graph Semi-NMF Algorithm for Learning Data Representation
    Huang, Haonan
    Yang, Zuyuan
    Li, Zhenni
    Sun, Weijun
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (02) : 1146 - 1165
  • [27] A Converged Deep Graph Semi-NMF Algorithm for Learning Data Representation
    Haonan Huang
    Zuyuan Yang
    Zhenni Li
    Weijun Sun
    Circuits, Systems, and Signal Processing, 2022, 41 : 1146 - 1165
  • [28] Speech emotion recognition with unsupervised feature learning
    Zheng-wei HUANG
    Wen-tao XUE
    Qi-rong MAO
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 (05) : 358 - 366
  • [29] Evolutionary feature generation in speech emotion recognition
    Schuller, Bjorn
    Reiter, Stephan
    Rigoll, Gerhard
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 5 - +
  • [30] Speech emotion recognition with unsupervised feature learning
    Huang, Zheng-wei
    Xue, Wen-tao
    Mao, Qi-rong
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (05) : 358 - 366