It's Not Only What You Say, But Also How You Say It: Machine Learning Approach to Estimate Trust from Conversation

被引:7
|
作者
Li, Mengyao [1 ,4 ]
Erickson, Isabel M. [1 ]
Cross, Ernest, V [2 ]
Lee, John D. [3 ]
机构
[1] Univ Wisconsin Madison, Dept Ind & Syst Engn, Madison, WI USA
[2] TRACLabs, Webster, TX USA
[3] Univ Wisconsin Madison, Dept Ind & Syst Engn, Madison, WI USA
[4] Univ Wisconsin Madison, Dept Ind & Syst Engn, 1513 Univ Ave, Madison, WI 53706 USA
关键词
Trusting automation; trust measurement; machine learning; model visualization and explainability; human-AI-robot teaming; AUTOMATION; VOICE; DEPENDENCE; FEATURES; SUPPORT;
D O I
10.1177/00187208231166624
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Objective: The objective of this study was to estimate trust from conversations using both lexical and acoustic data.Background: As NASA moves to long-duration space exploration operations, the increasing need for cooperation between humans and virtual agents requires real-time trust estimation by virtual agents. Measuring trust through conversation is a novel and unintrusive approach.Method: A 2 (reliability) x 2 (cycles) x 3 (events) within-subject study with habitat system maintenance was designed to elicit various levels of trust in a conversational agent. Participants had trust-related conversations with the conversational agent at the end of each decision-making task. To estimate trust, subjective trust ratings were predicted using machine learning models trained on three types of conversational features (i.e., lexical, acoustic, and combined). After training, model explanation was performed using variable importance and partial dependence plots.Results: Results showed that a random forest algorithm, trained using the combined lexical and acoustic features, predicted trust in the conversational agent most accurately ( R-adj(2) = 0.71 ) . The most important predictors were a combination of lexical and acoustic cues: average sentiment considering valence shifters, the mean of formants, and Mel-frequency cepstral coefficients (MFCC). These conversational features were identified as partial mediators predicting people's trust.Conclusion: Precise trust estimation from conversation requires lexical cues and acoustic cues.Application: These results showed the possibility of using conversational data to measure trust, and potentially other dynamic mental states, unobtrusively and dynamically.
引用
收藏
页码:1724 / 1741
页数:18
相关论文
共 50 条