Towards automatic recognition of emotion in speech

被引:0
|
作者
Razak, AA [1 ]
Yusof, MHM [1 ]
Komiya, R [1 ]
机构
[1] Multimedia Univ, Fac Informat Technol, Selangor 63100, Malaysia
关键词
component; speech processing; emotion recognizer; emotion parameter; LP analysis; fuzzy concept;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper discusses an approach towards automatic recognition of emotion in speech using computer. First, a design for the emotion recognizer is proposed. LP analysis algorithm has been used for the speech emotion parameter extraction. A total of 22 speech features have been selected to represent each emotion. A database consisting of emotional Malay and English voice samples has been developed for training and recognition purposes. Fuzzy concept has been applied to recognize emotion of the selected voice sample. The result from computer recognition is compared to the human recognition rate to confirm the reliability of the result and also to explore how well people and computer can recognize emotion in speech. It is found that computer recognition of emotion is possible and the average recognition rate of 66% is satisfactory based on the comparison from the human perception. According to the confusion matrix table for both human and computer recognition, it is shown that the way human interprets emotion is different from computer.
引用
收藏
页码:548 / 551
页数:4
相关论文
共 50 条
  • [1] Automatic Speech Emotion Recognition: A Survey
    Chandrasekar, Purnima
    Chapaneri, Santosh
    Jayaswal, Deepak
    [J]. 2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 341 - 346
  • [2] Automatic emotion recognition by the speech signal
    Schuller, B
    Lang, M
    Rigoll, G
    [J]. 6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 367 - 372
  • [3] Towards Indonesian Speech-Emotion Automatic Recognition (I-SpEAR)
    Wunarso, Novita Belinda
    Soelistio, Yustinus Eko
    [J]. PROCEEDINGS OF 2017 4TH INTERNATIONAL CONFERENCE ON NEW MEDIA STUDIES (CONMEDIA 2017), 2017, : 98 - 101
  • [4] The Impact of Face Mask and Emotion on Automatic Speech Recognition (ASR) and Speech Emotion Recognition (SER)
    Oh, Qi Qi
    Seow, Chee Kiat
    Yusuff, Mulliana
    Pranata, Sugiri
    Cao, Qi
    [J]. 2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 523 - 531
  • [5] Automatic Emotion Recognition of Speech Signal in Mandarin
    Zhang, Sheng
    Ching, P. C.
    Kong, Fanrang
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1810 - +
  • [6] LEARNING WITH SYNTHESIZED SPEECH FOR AUTOMATIC EMOTION RECOGNITION
    Schuller, Bjoern
    Burkhardt, Felix
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5150 - 5153
  • [7] On the Correlation and Transferability of Features between Automatic Speech Recognition and Speech Emotion Recognition
    Fayek, Haytham M.
    Lech, Margaret
    Cavedon, Lawrence
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3618 - 3622
  • [8] Towards inclusive automatic speech recognition
    Feng, Siyuan
    Halpern, Bence Mark
    Kudina, Olya
    Scharenborg, Odette
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 84
  • [9] Towards the explainability of Multimodal Speech Emotion Recognition
    Kumar, Puneet
    Kaushik, Vishesh
    Raman, Balasubramanian
    [J]. INTERSPEECH 2021, 2021, : 1748 - 1752
  • [10] Automatic Speech Emotion Recognition: a Systematic Literature Review
    Mustafa H.H.
    Darwish N.R.
    Hefny H.A.
    [J]. International Journal of Speech Technology, 2024, 27 (1) : 267 - 285