Transfer learning based convolution neural net for authentication and classification of emotions from natural and stimulated speech signals

被引:4
|
作者
Kumar, Mukul [1 ]
Katyal, Nipun [1 ]
Ruban, Nersisson [1 ]
Lyakso, Elena [2 ]
Mekala, A. Mary [3 ]
Raj, Alex Noel Joseph [4 ]
Richard, G. Maarc [1 ]
机构
[1] Vellore Inst Technol Vellore, Sch Elect Engn, Vellore, Tamil Nadu, India
[2] St Petersburg State Univ, St Petersburg, Russia
[3] Vellore Inst Technol Vellore, Sch Informat Technol & Engn, Vellore, Tamil Nadu, India
[4] Shantou Univ, Coll Engn, Dept Elect Engn, Key Lab Digital Signal & Image Proc Guangdong Pro, Shantou, Peoples R China
关键词
Deep learning; speech fidelity classification; linear prediction cepstral coefficients (LPCC); mel frequency cepstral coefficients (MFCC); speech emotion recognition; RECOGNITION; FREQUENCY;
D O I
10.3233/JIFS-210711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the years the need for differentiating various emotions from oral communication plays an important role in emotion based studies. There have been different algorithms to classify the kinds of emotion. Although there is no measure of fidelity of the emotion under consideration, which is primarily due to the reason that most of the readily available datasets that are annotated are produced by actors and not generated in real-world scenarios. Therefore, the predicted emotion lacks an important aspect called authenticity, which is whether an emotion is actual or stimulated. In this research work, we have developed a transfer learning and style transfer based hybrid convolutional neural network algorithm to classify the emotion as well as the fidelity of the emotion. The model is trained on features extracted from a dataset that contains stimulated as well as actual utterances. We have compared the developed algorithm with conventional machine learning and deep learning techniques by few metrics like accuracy, Precision, Recall and F1score. The developed model performs much better than the conventional machine learning and deep learning models. The research aims to dive deeper into human emotion and make a model that understands it like humans do with precision, recall, Fl score values of 0.994, 0.996, 0.995 for speech authenticity and 0.992, 0.989, 0.99 for speech emotion classification respectively.
引用
收藏
页码:2013 / 2024
页数:12
相关论文
共 50 条
  • [1] Convolution Neural Network based Transfer Learning for Classification of Flowers
    Wu, Yong
    Qin, Xiao
    Pan, Yonghua
    Yuan, Changan
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2018, : 562 - 566
  • [2] Emotions Classification from Speech with Deep Learning
    Chowanda, Andry
    Muliono, Yohan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (04) : 777 - 781
  • [3] A Transfer-Learning-Based Novel Convolution Neural Network for Melanoma Classification
    Qureshi, Mohammad Naved
    Umar, Mohammad Sarosh
    Shahab, Sana
    COMPUTERS, 2022, 11 (05)
  • [4] Classification of Appearance Quality of Red Grape Based on Transfer Learning of Convolution Neural Network
    Zha, Zhihua
    Shi, Dongyuan
    Chen, Xiaohui
    Shi, Hui
    Wu, Jie
    AGRONOMY-BASEL, 2023, 13 (08):
  • [5] FPGA based emotions recognition from speech signals
    Rajasekhar, B.
    Kamaraju, M.
    Sumalatha, V.
    2017 THIRD INTERNATIONAL CONFERENCE ON BIOSIGNALS, IMAGES AND INSTRUMENTATION (ICBSII), 2017,
  • [6] Sentiment Analysis from Speech Signals using Convolution Neural Network
    Chaurasiya, Rahul Kumar
    Priya, Nettem Sri
    Praneeth, Kothapally Gnana
    Kumar, Gujjarlapudi Varun
    Jahnavi, Matsa
    Teja, Tadigadapa Pranay
    PROCEEDINGS OF 2023 THE 7TH INTERNATIONAL CONFERENCE ON GRAPHICS AND SIGNAL PROCESSING, ICGSP, 2023, : 42 - 49
  • [7] Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning
    Deng, Jun
    Fruhholz, Sascha
    Zhang, Zixing
    Schuller, Bjoern
    IEEE ACCESS, 2017, 5 : 5235 - 5246
  • [8] Automatic Classification System of Drainage Hole Blockage Based on Convolution Neural Network Transfer Learning
    Lv, Jianbing
    Wu, Weijun
    Kang, Xiaoyu
    Huang, Juan
    Chen, Gongfa
    Teng, Shuai
    Gao, Hejie
    Hoang, Nhat-Duc
    ADVANCES IN CIVIL ENGINEERING, 2022, 2022
  • [9] Neural Network Based Classification of Human Emotions using Electromyogram Signals
    Latha, Charlyn Pushpa G.
    Hema, C. R.
    Paulraj, M. P.
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2013,
  • [10] Convolutional Neural Networks and Transfer Learning Based Classification of Natural Landscape Images
    Krstinic, Damir
    Braovic, Maja
    Bozic-Stulic, Dunja
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (02) : 244 - 267