Modified dense convolutional networks based emotion detection from speech using its paralinguistic features

被引:0
|
作者
Ritika Dhiman
Gurkanwal Singh Kang
Varun Gupta
机构
[1] Chandigarh College of Engineering and Technology (Degree Wing),Department of Computer Science and Engineering
来源
关键词
Emotion detection; Speech emotion recognition; Paralinguistic features; Dense convolutional networks; Residual networks; Convolutional neural networks; Deep neural networks; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Emotion recognition through speech is one of the fundamental approaches for human interaction. Speech modulations stipulate different emotions and context. In this paper, we propose modified dense convolutional networks (modified DenseNet201) for emotion detection from speech using its paralinguistic features such as vocal tract features. The proposed network performs emotion classification from speech using spectrograms of its audio files. The proposed network outperforms other alternative models like residual networks, AlexNet, VGG16, SVM, XGBoost, boosted random forest etc. for emotion classification from speech. Moreover, the proposed network surpasses all other existing methods proposed in the literature and obtains state-of-the-art results in most of the cases. Further, the proposed network has been successfully validated on two different language datasets: ‘EmoDB’ and ‘SAVEE’ which qualifies it as a language-independent emotion detection system from speech.
引用
收藏
页码:32041 / 32069
页数:28
相关论文
共 50 条
  • [1] Modified dense convolutional networks based emotion detection from speech using its paralinguistic features
    Dhiman, Ritika
    Kang, Gurkanwal Singh
    Gupta, Varun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32041 - 32069
  • [2] Machine learning techniques for speech emotion recognition using paralinguistic acoustic features
    Jha T.
    Kavya R.
    Christopher J.
    Arunachalam V.
    International Journal of Speech Technology, 2022, 25 (03): : 707 - 725
  • [3] Learning Salient Features for Speech Emotion Recognition Using Convolutional Neural Networks
    Mao, Qirong
    Dong, Ming
    Huang, Zhengwei
    Zhan, Yongzhao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (08) : 2203 - 2213
  • [4] Emotion Detection using Perceptual based Speech Features
    Lalitha, S.
    Tripathi, Shikha
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [5] Speech emotion classification using combined neurogram and INTERSPEECH 2010 paralinguistic challenge features
    Jassim, Wissam A.
    Paramesran, Raveendran
    Harte, Naomi
    IET SIGNAL PROCESSING, 2017, 11 (05) : 587 - 595
  • [6] SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS
    Muppidi, Aneesh
    Radfar, Martin
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6309 - 6313
  • [7] Speech Emotion Recognition using Convolutional and Recurrent Neural Networks
    Lim, Wootaek
    Jang, Daeyoung
    Lee, Taejin
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [8] EMOTION DETECTION IN SPEECH USING DEEP NETWORKS
    Amer, Mohamed R.
    Siddiquie, Behjat
    Richey, Colleen
    Divakaran, Ajay
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Speech emotion detection based on neural networks
    Soltani, Kamran
    Ainon, Raja Noor
    2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 808 - 810
  • [10] Spoofed Speech Detection with Weighted Phase Features and Convolutional Networks
    Disken, Gokay
    ARCHIVES OF ACOUSTICS, 2022, 47 (02) : 181 - 189