Speech Emotion Recognition Applied to Real-World Medical Consultation

被引:0
|
作者
Huang, Ching-Tzu [1 ,2 ]
Huang, Chih-Wei [2 ]
Yang, Hsuan-Chia [1 ,2 ,3 ]
Li, Yu-Chuan [1 ,2 ,4 ]
机构
[1] Taipei Med Univ, Grad Inst Biomed Informat, Coll Med Sci & Technol, Taipei, Taiwan
[2] Taipei Med Univ, Int Ctr Hlth Informat & Technol ICHIT, Taipei, Taiwan
[3] Taipei Med Univ, Grad Inst Data Sci, Coll Management, Taipei, Taiwan
[4] Taipei Med Univ, Wanfang Hosp, Dept Dermatol, Taipei, Taiwan
来源
关键词
Speech emotion recognition; medical education; doctor-patient communication; YAMNet transfer learning; bidirectional long short-term memory networks; EMPATHY;
D O I
10.3233/SHTI231139
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since 2020, the COVID-19 epidemic has changed our lives in healthcare behaviors. Forced to wear masks influenced doctor-patient interaction perceptions truly, thus, to build a satisfying relationship is not just empathize with facial expressions. The voice becomes more important for the sake of conquering the burden of masks. Hence, verbal and non-verbal communication will be crucial criteria for doctor-patient interaction during medical consultations and other conversations. In these years, speech emotion recognition has been a popular research domain. In spite of abundant work conducted, nonverbal emotion recognition in medical scenarios is still required to reveal. In this study, we investigate YAMNet transfer learning on Chinese Mandarin speech corpus NTHU-NTUA Chinese Interactive Emotion Corpus (NNIME) and use real-world dermatology clinic recording to test the generalization capability. The results showed that the accuracy validated on NNIME data was 0.59 for activation prediction and 0.57 for valence. Furthermore, the validation accuracy on the doctor-patient dataset was 0.24 for activation and 0.58 for valence, respectively.
引用
收藏
页码:1121 / 1125
页数:5
相关论文
共 50 条
  • [31] EQUALIZATION MATCHING OF SPEECH RECORDINGS IN REAL-WORLD ENVIRONMENTS
    Germain, Francois G.
    Mysore, Gautham J.
    Fujioka, Takako
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 609 - 613
  • [32] DISCRIMINATION BETWEEN SINGING AND SPEECH IN REAL-WORLD AUDIO
    Thompson, Brian
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 407 - 412
  • [33] Speech Emotion Recognition
    Lalitha, S.
    Madhavan, Abhishek
    Bhushan, Bharath
    Saketh, Srinivas
    [J]. 2014 INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2014,
  • [34] Segmentation and its real-world applications in speech processing
    Sattar, Farook
    Nilsson, Mikael
    Claesson, Ingvar
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 788 - +
  • [35] A real-world noise removal with wavelet speech feature
    Chiluveru, Samba Raju
    Tripathy, Manoj
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 683 - 693
  • [36] A Real-World Emotional Speech Corpus for Modern Greek
    Kostoulas, Theodoros
    Ganchev, Todor
    Mporas, Iosif
    Fakotakis, Nikos
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2676 - 2680
  • [37] A real-world noise removal with wavelet speech feature
    Samba Raju Chiluveru
    Manoj Tripathy
    [J]. International Journal of Speech Technology, 2020, 23 : 683 - 693
  • [38] An Evaluation of Emotion Units and Feature Types for Real-Time Speech Emotion Recognition
    Vogt, Thurid
    Andre, Elisabeth
    [J]. KUNSTLICHE INTELLIGENZ, 2011, 25 (03): : 213 - 223
  • [39] Uncertainties in real-world decisions on medical technologies
    Lu, C. Y.
    [J]. INTERNATIONAL JOURNAL OF CLINICAL PRACTICE, 2014, 68 (08) : 936 - 940
  • [40] Real-World Trends in the Evaluation of Medical Products
    Hennessy, Sean
    Berlin, Jesse A.
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2023, 192 (01) : 1 - 5