MedicalCare: building and annotating an empathy-rich corpus

被引:0
|
作者
Sun, Yinglun [1 ]
Zavala, Jose [2 ]
Shi, Shuju [1 ]
Finegold, Rachel [3 ]
Girju, Roxana [1 ,2 ]
Moore, Jeffrey [2 ]
机构
[1] Univ Illinois, Dept Linguist, Champaign, IL 61820 USA
[2] Univ Illinois, Beckman Inst, Urbana, IL USA
[3] Univ Illinois Champaign Urbana, Dept Eng, Champaign, IL USA
关键词
Empathy; Corpus; Annotation; HEALTH-CARE; EXPERIENCE; SYMPATHY; EMOTION;
D O I
10.1007/s10579-025-09806-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The importance of empathy in clinical settings has been widely accepted in the research community, and there have been numerous attempts at training clinical practitioners in empathic communication. Despite the advances in affective computing and automatic recognition and classification of emotions in discourse, there has been little research on how to characterize and model empathy in clinical settings. A corpus of essays was collected as a preliminary dataset for building an early stage linguistic model and measuring the efficacy of inter-annotator agreement on such a dataset. As annotated corpora have been popular resources for research on affective computing, in this study we build a text corpus named MedicalCare, and annotate it for empathic expressions using an iterative annotation process. We evaluated the annotation quality and the level of inter-annotator agreement over time, and found steady improvement in inter-annotator agreement on sentence labels as well as elaboration of the annotation guidelines. The average inter-rater agreement obtained over 370 essays annotated by four annotators is kappa = 0.65, and kappa = 0.82 between two meta-annotators. We also conducted text analyses of the annotated essays and found that the use of personal pronouns, negative emotion words and words about reassurance are correlated with empathic expressions.
引用
收藏
页数:36
相关论文
共 50 条
  • [21] Annotating Speaker Stance in Discourse: The Brexit Blog Corpus
    Simaki, Vasiliki
    Paradis, Carita
    Skeppstedt, Maria
    Sahlgren, Magnus
    Kucher, Kostiantyn
    Kerren, Andreas
    CORPUS LINGUISTICS AND LINGUISTIC THEORY, 2020, 16 (02) : 215 - 248
  • [22] Annotating opinion-evaluation of blogs: the Blogoscopy corpus
    Daille, Beatrice
    Dubreil, Estelle
    Monceaux, Laura
    Vernier, Matthieu
    LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (04) : 409 - 437
  • [23] Annotating Event Appearance for Japanese Chess Commentary Corpus
    Kameko, Hirotaka
    Mori, Shinsukc
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4302 - 4308
  • [24] Annotating Indirect Anaphora for Hindi : A Corpus Based Study
    Singh, Pardeep
    Dutta, Kamlesh
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 525 - 529
  • [25] A set of parameters for automatically annotating a Sentiment Arabic Corpus
    Imane, Guellil
    Kareem, Darwish
    Faical, Azouaou
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2019, 15 (05) : 594 - 615
  • [26] The Bible as a Parallel Corpus: Annotating the ‘Book of 2000 Tongues’
    Philip Resnik
    Mari Broman Olsen
    Mona Diab
    Computers and the Humanities, 1999, 33 : 129 - 153
  • [27] John of Scythopolis and the Dionysian corpus. Annotating the Areopagite
    Williams, JP
    JOURNAL OF THEOLOGICAL STUDIES, 1999, 50 : 784 - 788
  • [28] On building phonetically and prosodically rich speech corpus for text-to-speech synthesis
    Matousek, Jindrich
    Romportl, Jan
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 442 - +
  • [29] Building Empathy in Schools
    Hoerr, Thomas R.
    EDUCATIONAL LEADERSHIP, 2018, 75 (07) : 86 - 87
  • [30] Criteria for Identifying and Annotating Caused Motion Constructions in Corpus Data
    Hwang, Jena D.
    Zaenen, Annie
    Palmer, Martha
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1297 - 1304