MedicalCare: building and annotating an empathy-rich corpus

被引:0
|
作者
Sun, Yinglun [1 ]
Zavala, Jose [2 ]
Shi, Shuju [1 ]
Finegold, Rachel [3 ]
Girju, Roxana [1 ,2 ]
Moore, Jeffrey [2 ]
机构
[1] Univ Illinois, Dept Linguist, Champaign, IL 61820 USA
[2] Univ Illinois, Beckman Inst, Urbana, IL USA
[3] Univ Illinois Champaign Urbana, Dept Eng, Champaign, IL USA
关键词
Empathy; Corpus; Annotation; HEALTH-CARE; EXPERIENCE; SYMPATHY; EMOTION;
D O I
10.1007/s10579-025-09806-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The importance of empathy in clinical settings has been widely accepted in the research community, and there have been numerous attempts at training clinical practitioners in empathic communication. Despite the advances in affective computing and automatic recognition and classification of emotions in discourse, there has been little research on how to characterize and model empathy in clinical settings. A corpus of essays was collected as a preliminary dataset for building an early stage linguistic model and measuring the efficacy of inter-annotator agreement on such a dataset. As annotated corpora have been popular resources for research on affective computing, in this study we build a text corpus named MedicalCare, and annotate it for empathic expressions using an iterative annotation process. We evaluated the annotation quality and the level of inter-annotator agreement over time, and found steady improvement in inter-annotator agreement on sentence labels as well as elaboration of the annotation guidelines. The average inter-rater agreement obtained over 370 essays annotated by four annotators is kappa = 0.65, and kappa = 0.82 between two meta-annotators. We also conducted text analyses of the annotated essays and found that the use of personal pronouns, negative emotion words and words about reassurance are correlated with empathic expressions.
引用
收藏
页数:36
相关论文
共 50 条
  • [41] Building a learner corpus
    Hana, Jirka
    Rosen, Alexandr
    Stindlova, Barbora
    Jaeger, Petr
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3228 - 3232
  • [42] Building Empathy with Poverty Simulations
    Laskowski, Tara
    EDUCATIONAL LEADERSHIP, 2023, 80 (04) : 12 - 13
  • [43] Building a learner corpus
    Jirka Hana
    Alexandr Rosen
    Barbora Štindlová
    Jan Štěpánek
    Language Resources and Evaluation, 2014, 48 : 741 - 752
  • [44] Building a learner corpus
    Hana, Jirka
    Rosen, Alexandr
    Stindlova, Barbora
    Stepanek, Jan
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (04) : 741 - 752
  • [45] Annotating the TCD D-ANS Corpus - A Multimodal Multimedia Monolingual Biometric Corpus of Spoken Social Interaction
    Campbell, Nick
    Hennig, Shannon
    MULTIMODAL ANALYSES ENABLING ARTIFICIAL AGENTS IN HUMAN-MACHINE INTERACTION, 2015, 8757 : 3 - 12
  • [46] Annotating a corpus of human interaction with prosodic profiles - focusing on Mandarin repair/disfluency
    Chen, Helen Kai-yun
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 986 - 990
  • [47] The Causal News Corpus: Annotating Causal Relations in Event Sentences from News
    Tan, Fiona Anting
    Hurriyetoglu, Ali
    Caselli, Tommaso
    Oostdijk, Nelleke
    Nomoto, Tadashi
    Hettiarachchi, Hansi
    Ameer, Iqra
    Uca, Onur
    Liza, Farhana Ferdousi
    Hu, Tiancheng
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2298 - 2310
  • [48] Poster: Extracting and Annotating Mental Health Forum Corpus: A Comprehensive Validation Pipeline
    Jonnalagadda, Rohith Sundar
    Azmee, Abm Adnan
    Attota, Dinesh
    Khan, Md Abdullah Al Hafiz
    Pei, Yong
    Nandan, Monica
    2024 IEEE/ACM CONFERENCE ON CONNECTED HEALTH: APPLICATIONS, SYSTEMS AND ENGINEERING TECHNOLOGIES, CHASE 2024, 2024, : 208 - 209
  • [49] Correction: PhenoDEF: a corpus for annotating sentences with information of phenotype definitions in biomedical literature
    Samar Binkheder
    Heng-Yi Wu
    Sara K. Quinney
    Shijun Zhang
    Md. Muntasir Zitu
    Chien-Wei Chiang
    Lei Wang
    Josette Jones
    Lang Li
    Journal of Biomedical Semantics, 13
  • [50] Annotating thematic features in English and Spanish: A contrastive corpus-based study
    Arus, Jorge
    Lavid, Julia
    Moraton, Lara
    LINGUISTICS AND THE HUMAN SCIENCES, 2010, 6 (1-3): : 173 - 192