MedicalCare: building and annotating an empathy-rich corpus

被引:0
|
作者
Sun, Yinglun [1 ]
Zavala, Jose [2 ]
Shi, Shuju [1 ]
Finegold, Rachel [3 ]
Girju, Roxana [1 ,2 ]
Moore, Jeffrey [2 ]
机构
[1] Univ Illinois, Dept Linguist, Champaign, IL 61820 USA
[2] Univ Illinois, Beckman Inst, Urbana, IL USA
[3] Univ Illinois Champaign Urbana, Dept Eng, Champaign, IL USA
关键词
Empathy; Corpus; Annotation; HEALTH-CARE; EXPERIENCE; SYMPATHY; EMOTION;
D O I
10.1007/s10579-025-09806-7
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The importance of empathy in clinical settings has been widely accepted in the research community, and there have been numerous attempts at training clinical practitioners in empathic communication. Despite the advances in affective computing and automatic recognition and classification of emotions in discourse, there has been little research on how to characterize and model empathy in clinical settings. A corpus of essays was collected as a preliminary dataset for building an early stage linguistic model and measuring the efficacy of inter-annotator agreement on such a dataset. As annotated corpora have been popular resources for research on affective computing, in this study we build a text corpus named MedicalCare, and annotate it for empathic expressions using an iterative annotation process. We evaluated the annotation quality and the level of inter-annotator agreement over time, and found steady improvement in inter-annotator agreement on sentence labels as well as elaboration of the annotation guidelines. The average inter-rater agreement obtained over 370 essays annotated by four annotators is kappa = 0.65, and kappa = 0.82 between two meta-annotators. We also conducted text analyses of the annotated essays and found that the use of personal pronouns, negative emotion words and words about reassurance are correlated with empathic expressions.
引用
收藏
页数:36
相关论文
共 50 条
  • [31] Ontology Based Approach for Annotating a Corpus of Computer Science Abstracts
    Almugbel, Zainab
    2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 81 - 86
  • [32] IARG-AnCora: Annotating AnCora corpus with implicit arguments
    Taule, Mariona
    Antonia Marti, M.
    Penis, Aina
    Rodriguez, Horacio
    Moreno, Lidia
    Moreda, Paloma
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 181 - 184
  • [33] A Transfer Learning Framework For Annotating Implementation-Specific Corpus
    Ponniah, Anbumunee
    Agarwal, Swati
    Ranka, Sharanya Milind
    Madhusudhan, Shashank
    2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 503 - 512
  • [34] Review of Practices of Collecting and Annotating Texts in the Learner Corpus REALEC
    Vinogradova, Olga
    Lyashevskaya, Olga
    TEXT, SPEECH, AND DIALOGUE (TSD 2022), 2022, 13502 : 77 - 88
  • [35] Metafier - a Tool for Annotating and Structuring Building Metadata
    Holmegaard, Emil
    Johansen, Aslak
    Kjaergaard, Mikkel Baun
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [36] The UIR Uncertainty Corpus for Chinese: Annotating Chinese Microblog Corpus for Uncertainty Identification from Social Media
    Li, Binyang
    Xiang, Jun
    Chen, Le
    Han, Xu
    Yu, Xiaoyan
    Xu, Ruifeng
    Wang, Tengjiao
    Wong, Kam-fai
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 495 - 498
  • [37] Annotating progressive aspect constructions in the spoken section of the British National Corpus
    Caines, Andrew
    Buttery, Paula
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1699 - 1704
  • [38] PhenoDEF: a corpus for annotating sentences with information of phenotype definitions in biomedical literature
    Binkheder, Samar
    Wu, Heng-Yi
    Quinney, Sara K.
    Zhang, Shijun
    Zitu, Md Muntasir
    Chiang, Chien-Wei
    Wang, Lei
    Jones, Josette
    Li, Lang
    JOURNAL OF BIOMEDICAL SEMANTICS, 2022, 13 (01)
  • [39] Annotating a broad range of anaphoric phenomena, in a variety of genres: the ARRAU Corpus
    Uryupina, Olga
    Artstein, Ron
    Bristot, Antonella
    Cavicchio, Federica
    Delogu, Francesca
    Rodriguez, Kepa J.
    Poesio, Massimo
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (01) : 95 - 128
  • [40] Annotating Modality Expressions and Event Factuality for a Japanese Chess Commentary Corpus
    Matsuyoshi, Suguru
    Kameko, Hirotaka
    Murawaki, Yugo
    Mori, Shinsuke
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2475 - 2481