Time Window Analysis for Automatic Speech Emotion Recognition

被引:0
|
作者
Puterka, Boris [1 ]
Kacur, Juraj [2 ]
机构
[1] Slovak Univ Technol Bratislava, Inst Robot & Cybernet, Ilkovicova 3, Bratislava, Slovakia
[2] Slovak Univ Technol Bratislava, Inst Multimedia ICT, Ilkovicova 3, Bratislava, Slovakia
关键词
Speech Emotion Recognition; Spectrogram; NN; CNN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present time analysis results of speech emotion recognition using convolutional neural network architecture and spectrograms as a speech features. Analyses were performed on model with two convolutional layers followed by pooling layer, and one fully-connected layer followed by dropout and softmax layer on the output. On this model we analyzed time characteristics of speech signal represented by spectrograms. The aim of our work was to find relation between duration of speech signal and the recognition rate of seven basic emotions. It was discovered that speech length is important and naturally the accuracy is growing with the length of analyzed window, however over approximately 1.2 seconds the growth becomes rather mild.
引用
收藏
页码:143 / 146
页数:4
相关论文
共 50 条
  • [1] Automatic Speech Emotion Recognition: A Survey
    Chandrasekar, Purnima
    Chapaneri, Santosh
    Jayaswal, Deepak
    2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 341 - 346
  • [2] Automatic emotion recognition by the speech signal
    Schuller, B
    Lang, M
    Rigoll, G
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 367 - 372
  • [3] Towards automatic recognition of emotion in speech
    Razak, AA
    Yusof, MHM
    Komiya, R
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY, 2003, : 548 - 551
  • [4] The Impact of Face Mask and Emotion on Automatic Speech Recognition (ASR) and Speech Emotion Recognition (SER)
    Oh, Qi Qi
    Seow, Chee Kiat
    Yusuff, Mulliana
    Pranata, Sugiri
    Cao, Qi
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 523 - 531
  • [5] Automatic Emotion Recognition of Speech Signal in Mandarin
    Zhang, Sheng
    Ching, P. C.
    Kong, Fanrang
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1810 - +
  • [6] LEARNING WITH SYNTHESIZED SPEECH FOR AUTOMATIC EMOTION RECOGNITION
    Schuller, Bjoern
    Burkhardt, Felix
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5150 - 5153
  • [7] UNSUPERVISED LEARNING APPROACH TO FEATURE ANALYSIS FOR AUTOMATIC SPEECH EMOTION RECOGNITION
    Eskimez, Sefik Emre
    Duan, Zhiyao
    Heinzelman, Wendi
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5099 - 5103
  • [8] On the Correlation and Transferability of Features between Automatic Speech Recognition and Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3618 - 3622
  • [9] Automatic context window composition for distant speech recognition
    Ravanelli, Mirco
    Omologo, Maurizio
    SPEECH COMMUNICATION, 2018, 101 : 34 - 44
  • [10] Automatic Speech Emotion Recognition: a Systematic Literature Review
    Mustafa H.H.
    Darwish N.R.
    Hefny H.A.
    International Journal of Speech Technology, 2024, 27 (1) : 267 - 285