Time Window Analysis for Automatic Speech Emotion Recognition

被引:0
|
作者
Puterka, Boris [1 ]
Kacur, Juraj [2 ]
机构
[1] Slovak Univ Technol Bratislava, Inst Robot & Cybernet, Ilkovicova 3, Bratislava, Slovakia
[2] Slovak Univ Technol Bratislava, Inst Multimedia ICT, Ilkovicova 3, Bratislava, Slovakia
关键词
Speech Emotion Recognition; Spectrogram; NN; CNN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present time analysis results of speech emotion recognition using convolutional neural network architecture and spectrograms as a speech features. Analyses were performed on model with two convolutional layers followed by pooling layer, and one fully-connected layer followed by dropout and softmax layer on the output. On this model we analyzed time characteristics of speech signal represented by spectrograms. The aim of our work was to find relation between duration of speech signal and the recognition rate of seven basic emotions. It was discovered that speech length is important and naturally the accuracy is growing with the length of analyzed window, however over approximately 1.2 seconds the growth becomes rather mild.
引用
收藏
页码:143 / 146
页数:4
相关论文
共 50 条
  • [31] Time Distributed Multiview Representation for Speech Emotion Recognition
    de Mattos, Flavia Leticia
    Pellenz, Marcelo E.
    Britto, Jr. Alceu de S.
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2023, PT I, 2024, 14469 : 148 - 162
  • [32] An Investigation of the Accuracy of Real Time Speech Emotion Recognition
    Deusi, Jeevan Singh
    Popa, Elena Irena
    ARTIFICIAL INTELLIGENCE XXXVI, 2019, 11927 : 336 - 349
  • [33] Speech emotion recognition based on time domain feature
    Zhao, Lasheng
    Wei, Xiaopeng
    Zhang, Qiang
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1319 - 1321
  • [34] Research of Window Function Influence on the Result of Arabic Speech Automatic Recognition
    Levin, Evgenii
    Al-Dhaibani, Abdulghani
    2019 URAL SYMPOSIUM ON BIOMEDICAL ENGINEERING, RADIOELECTRONICS AND INFORMATION TECHNOLOGY (USBEREIT), 2019, : 204 - 207
  • [35] A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
    Esmaileyan, Z.
    Marvi, H.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2014, 27 (01): : 79 - 89
  • [36] Pertinent feature selection techniques for automatic emotion recognition in stressed speech
    Tiwari P.
    Darji A.D.
    International Journal of Speech Technology, 2022, 25 (02) : 511 - 526
  • [37] User Identity Protection in Automatic Emotion Recognition through Disguised Speech
    Haider, Fasih
    Albert, Pierre
    Luz, Saturnino
    AI, 2021, 2 (04) : 636 - 649
  • [38] Transformer-CNN Automatic Hyperparameter Tuning for Speech Emotion Recognition
    Gumelar, Agustinus Bimo
    Yuniarno, Eko Mulyanto
    Adi, Derry Pramono
    Setiawan, Rudi
    Sugiarto, Indar
    Purnomo, Mauridhi Hery
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST 2022), 2022,
  • [39] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [40] Feature Analysis and Evaluation for Automatic Emotion Identification in Speech
    Luengo, Iker
    Navas, Eva
    Hernaez, Inmaculada
    IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (06) : 490 - 501