Is the patient speaking or the nurse? Automatic speaker type identification in patient-nurse audio recordings
被引:3
|
作者:
Zolnoori, Maryam
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Sch Nursing, New York, NY USA
VNS Hlth, Ctr Home Care Policy & Res, New York, NY USA
Columbia Univ, Sch Nursing, 390 Ft Washington Ave, New York, NY 10027 USAColumbia Univ, Sch Nursing, New York, NY USA
Zolnoori, Maryam
[1
,2
,5
]
Vergez, Sasha
论文数: 0引用数: 0
h-index: 0
机构:
VNS Hlth, Ctr Home Care Policy & Res, New York, NY USAColumbia Univ, Sch Nursing, New York, NY USA
Vergez, Sasha
[2
]
Sridharan, Sridevi
论文数: 0引用数: 0
h-index: 0
机构:
VNS Hlth, Ctr Home Care Policy & Res, New York, NY USAColumbia Univ, Sch Nursing, New York, NY USA
Sridharan, Sridevi
[2
]
Zolnour, Ali
论文数: 0引用数: 0
h-index: 0
机构:Columbia Univ, Sch Nursing, New York, NY USA
Zolnour, Ali
Bowles, Kathryn
论文数: 0引用数: 0
h-index: 0
机构:
VNS Hlth, Ctr Home Care Policy & Res, New York, NY USA
Univ Tehran, Sch Elect & Comp Engn, Tehran, IranColumbia Univ, Sch Nursing, New York, NY USA
Bowles, Kathryn
[2
,3
]
Kostic, Zoran
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Dept Elect Engn, New York, NY USAColumbia Univ, Sch Nursing, New York, NY USA
Kostic, Zoran
[4
]
Topaz, Maxim
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Sch Nursing, New York, NY USA
VNS Hlth, Ctr Home Care Policy & Res, New York, NY USAColumbia Univ, Sch Nursing, New York, NY USA
Topaz, Maxim
[1
,2
]
机构:
[1] Columbia Univ, Sch Nursing, New York, NY USA
[2] VNS Hlth, Ctr Home Care Policy & Res, New York, NY USA
natural language processing;
patient-nurse verbal communication;
home healthcare;
machine learning;
audio-recording procedure;
CARE;
TALKING;
D O I:
10.1093/jamia/ocad139
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Objectives Patient-clinician communication provides valuable explicit and implicit information that may indicate adverse medical conditions and outcomes. However, practical and analytical approaches for audio-recording and analyzing this data stream remain underexplored. This study aimed to 1) analyze patients' and nurses' speech in audio-recorded verbal communication, and 2) develop machine learning (ML) classifiers to effectively differentiate between patient and nurse language. Materials and Methods Pilot studies were conducted at VNS Health, the largest not-for-profit home healthcare agency in the United States, to optimize audio-recording patient-nurse interactions. We recorded and transcribed 46 interactions, resulting in 3494 "utterances" that were annotated to identify the speaker. We employed natural language processing techniques to generate linguistic features and built various ML classifiers to distinguish between patient and nurse language at both individual and encounter levels. Results A support vector machine classifier trained on selected linguistic features from term frequency-inverse document frequency, Linguistic Inquiry and Word Count, Word2Vec, and Medical Concepts in the Unified Medical Language System achieved the highest performance with an AUC-ROC = 99.01 & PLUSMN; 1.97 and an F1-score = 96.82 & PLUSMN; 4.1. The analysis revealed patients' tendency to use informal language and keywords related to "religion," "home," and "money," while nurses utilized more complex sentences focusing on health-related matters and medical issues and were more likely to ask questions. Conclusion The methods and analytical approach we developed to differentiate patient and nurse language is an important precursor for downstream tasks that aim to analyze patient speech to identify patients at risk of disease and negative health outcomes.