Speech Emotion Recognition for Indonesian Language Using Long Short-Term Memory

被引:0
|
作者
Lasiman, Jeremia Jason [1 ]
Lestari, Dessi Puji [1 ]
机构
[1] Inst Teknol Bandung, Sch Elect Engn & Informat, Bandung, Indonesia
关键词
Indonesian language; neural network; emotion recognition; LSTM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents an extended research of emotion recognition system for Indonesian language. In this research we use Indonesian Emotional Corpus with four emotions classes (anger, contentment, happiness, sadness) and neutral class. As all previous researches for emotion recognition for Indonesian language are using SVM, we are using SVM as baseline. Support Vector Machine (SVM), Feed Forward Neural Network (FFNN) and Long Short-Term Memory (LSTM) are experimented to model emotions. Experiment result shows that LSTM outperform SVM and FFNN. LSTM obtain 65.9% for average F1 measure with using acoustic and lexical feature, making it 5% higher than the best SVM in this experiment.
引用
收藏
页码:40 / 43
页数:4
相关论文
共 50 条
  • [21] A PRIORITIZED GRID LONG SHORT-TERM MEMORY RNN FOR SPEECH RECOGNITION
    Hsu, Wei-Ning
    Zhang, Yu
    Glass, James
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 467 - 473
  • [22] Attention-Based Convolution Skip Bidirectional Long Short-Term Memory Network for Speech Emotion Recognition
    Zhang, Huiyun
    Huang, Heming
    Han, Henry
    [J]. IEEE ACCESS, 2021, 9 : 5332 - 5342
  • [23] Multi-head attention-based long short-term memory model for speech emotion recognition
    Zhao, Yan
    Zhao, Li
    Lu, Cheng
    Li, Sunan
    Tang, Chuangao
    Lian, Hailun
    [J]. Journal of Southeast University (English Edition), 2022, 38 (02) : 103 - 109
  • [24] Emotion recognition based on fusion of long short-term memory networks and SVMs
    Chen, Tian
    Yin, Hongfang
    Yuan, Xiaohui
    Gu, Yu
    Ren, Fuji
    Sun, Xiao
    [J]. DIGITAL SIGNAL PROCESSING, 2021, 117
  • [25] Endpoint Detection using Grid Long Short-Term Memory Networks for Streaming Speech Recognition
    Chang, Shuo-Yiin
    Li, Bo
    Sainath, Tara N.
    Simko, Gabor
    Parada, Carolina
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3812 - 3816
  • [26] Enhanced Deep Hierarchical Long Short-Term Memory and Bidirectional Long Short-Term Memory for Tamil Emotional Speech Recognition using Data Augmentation and Spatial Features
    Fernandes, Bennilo
    Mannepalli, Kasiprasad
    [J]. PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2021, 29 (04): : 2967 - 2992
  • [27] Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition
    Oruh, Jane
    Viriri, Serestina
    Adegun, Adekanmi
    [J]. IEEE ACCESS, 2022, 10 : 30069 - 30079
  • [28] Emotion Detection in Text using Nested Long Short-Term Memory
    Haryadi, Daniel
    Kusuma, Gede Putra
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (06) : 351 - 357
  • [29] Emotion detection in text using nested Long Short-Term Memory
    Haryadi, Daniel
    Kusuma, Gede Putra
    [J]. International Journal of Advanced Computer Science and Applications, 2019, 10 (06): : 351 - 357
  • [30] Filipino Sign Language Recognition Using Long Short-Term Memory and Residual Network Architecture
    Darrel Montefalcon, Myron
    Rhald Padilla, Jay
    Rodriguez, Ramon
    [J]. PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 489 - 497