Time Window Analysis for Automatic Speech Emotion Recognition

被引:0
|
作者
Puterka, Boris [1 ]
Kacur, Juraj [2 ]
机构
[1] Slovak Univ Technol Bratislava, Inst Robot & Cybernet, Ilkovicova 3, Bratislava, Slovakia
[2] Slovak Univ Technol Bratislava, Inst Multimedia ICT, Ilkovicova 3, Bratislava, Slovakia
关键词
Speech Emotion Recognition; Spectrogram; NN; CNN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present time analysis results of speech emotion recognition using convolutional neural network architecture and spectrograms as a speech features. Analyses were performed on model with two convolutional layers followed by pooling layer, and one fully-connected layer followed by dropout and softmax layer on the output. On this model we analyzed time characteristics of speech signal represented by spectrograms. The aim of our work was to find relation between duration of speech signal and the recognition rate of seven basic emotions. It was discovered that speech length is important and naturally the accuracy is growing with the length of analyzed window, however over approximately 1.2 seconds the growth becomes rather mild.
引用
收藏
页码:143 / 146
页数:4
相关论文
共 50 条
  • [41] The Automatic Analysis of Emotion in Political Speech Based on Transcripts
    Cochrane, Christopher
    Rheault, Ludovic
    Godbout, Jean-Francois
    Whyte, Tanya
    Wong, Michael W-C
    Borwein, Sophie
    POLITICAL COMMUNICATION, 2022, 39 (01) : 98 - 121
  • [42] Automatic Emotion Recognition
    不详
    CURRENT SCIENCE, 2018, 115 (01): : 14 - 14
  • [43] Polish Speech and Text Emotion Recognition in a Multimodal Emotion Analysis System
    Skowroński, Kamil
    Galuszka, Adam
    Probierz, Eryka
    Applied Sciences (Switzerland), 2024, 14 (22):
  • [44] An algorithm study for speech emotion recognition based speech feature analysis
    Zhengbiao, Ji
    Feng, Zhou
    Ming, Zhu
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (11): : 33 - 42
  • [45] Automatic Speech Recognition using Correlation Analysis
    Pramanik, Arnab
    Raha, Rajorshee
    PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 670 - 674
  • [46] Spectral Analysis for Automatic Speech Recognition and Enhancement
    Oruh, Jane
    Viriri, Serestina
    MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 245 - 254
  • [47] AUTOMATIC SPEECH ANALYSIS AND RECOGNITION - HATON,JP
    MARTIN, TB
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (04): : 1310 - 1311
  • [48] Time-frequency distributions for automatic speech recognition
    Potamianos, A
    Maragos, P
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 196 - 200
  • [49] Analysis of Excitation Source Features of Speech for Emotion Recognition
    Kadiri, Sudarsana Reddy
    Gangamohan, P.
    Gangashetty, Suryakanth V.
    Yegnanarayana, B.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1324 - 1328
  • [50] An Analysis of Automatic Speech Recognition with Multiple Microphones
    Marino, Davide
    Hain, Thomas
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1288 - 1291