Speech Emotion Recognition Applied to Real-World Medical Consultation

被引:0
|
作者
Huang, Ching-Tzu [1 ,2 ]
Huang, Chih-Wei [2 ]
Yang, Hsuan-Chia [1 ,2 ,3 ]
Li, Yu-Chuan [1 ,2 ,4 ]
机构
[1] Taipei Med Univ, Grad Inst Biomed Informat, Coll Med Sci & Technol, Taipei, Taiwan
[2] Taipei Med Univ, Int Ctr Hlth Informat & Technol ICHIT, Taipei, Taiwan
[3] Taipei Med Univ, Grad Inst Data Sci, Coll Management, Taipei, Taiwan
[4] Taipei Med Univ, Wanfang Hosp, Dept Dermatol, Taipei, Taiwan
来源
关键词
Speech emotion recognition; medical education; doctor-patient communication; YAMNet transfer learning; bidirectional long short-term memory networks; EMPATHY;
D O I
10.3233/SHTI231139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since 2020, the COVID-19 epidemic has changed our lives in healthcare behaviors. Forced to wear masks influenced doctor-patient interaction perceptions truly, thus, to build a satisfying relationship is not just empathize with facial expressions. The voice becomes more important for the sake of conquering the burden of masks. Hence, verbal and non-verbal communication will be crucial criteria for doctor-patient interaction during medical consultations and other conversations. In these years, speech emotion recognition has been a popular research domain. In spite of abundant work conducted, nonverbal emotion recognition in medical scenarios is still required to reveal. In this study, we investigate YAMNet transfer learning on Chinese Mandarin speech corpus NTHU-NTUA Chinese Interactive Emotion Corpus (NNIME) and use real-world dermatology clinic recording to test the generalization capability. The results showed that the accuracy validated on NNIME data was 0.59 for activation prediction and 0.57 for valence. Furthermore, the validation accuracy on the doctor-patient dataset was 0.24 for activation and 0.58 for valence, respectively.
引用
收藏
页码:1121 / 1125
页数:5
相关论文
共 50 条
  • [1] Enhancing Speech Emotion Recognition for Real-World Applications via ASR Integration
    Li, Yuanchao
    [J]. 2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [2] Study on Speaker-Independent Emotion Recognition from Speech on Real-World Data
    Kostoulas, Theodoros
    Ganchev, Todor
    Fakotakis, Nikos
    [J]. VERBAL AND NONVERBAL FEATURES OF HUMAN-HUMAN AND HUMAN-MACHINE INTERACTIONS, 2008, 5042 : 235 - 242
  • [3] A speech translation system applied to a real-world task/domain and its evaluation using real-world speech data
    Nakamura, A
    Naito, M
    Tsukada, H
    Gruhn, R
    Sumita, E
    Kashioka, N
    Nakajima, H
    Shimizu, T
    Sagisaka, Y
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01): : 142 - 154
  • [5] A FAST AND ROBUST EMOTION RECOGNITION SYSTEM FOR REAL-WORLD MOBILE PHONE
    Sudha, V
    Viswanath, G.
    Balasubramanian, A.
    Chiranjeevi, P.
    Basant, K. P.
    Pratibha, M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2015,
  • [6] Automating the Recognition of Stress and Emotion: From Lab to Real-World Impact
    Picard, Rosalind W.
    [J]. IEEE MULTIMEDIA, 2016, 23 (03) : 3 - 7
  • [7] HANDS-FREE SPEECH RECOGNITION CHALLENGE FOR REAL-WORLD SPEECH DIALOGUE SYSTEMS
    Saruwatari, Hiroshi
    Kawanami, Hiromichi
    Takeuchi, Shota
    Takahashi, Yu
    Cincarek, Tobias
    Shikano, Kiyohiro
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3729 - 3732
  • [8] Self-Supervised Speech Enhancement for Arabic Speech Recognition in Real-World Environments
    Dendani, Bilal
    Bahi, Halima
    Sari, Toufik
    [J]. TRAITEMENT DU SIGNAL, 2021, 38 (02) : 349 - 358
  • [9] Auditory processing of speech signals for robust speech recognition in real-world noisy environments
    Kim, DS
    Lee, SY
    Kil, RM
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (01): : 55 - 69
  • [10] Facial Emotion Recognition: A Survey and Real-World User Experiences in Mixed Reality
    Mehta, Dhwani
    Siddiqui, Mohammad Faridul Haque
    Javaid, Ahmad Y.
    [J]. SENSORS, 2018, 18 (02):