Investigating Linguistic and Semantic Features for Turn-Taking Prediction in Open-Domain Human-Computer Conversation

被引:7
|
作者
Razavi, S. Zahra [1 ]
Kane, Benjamin [1 ]
Schubert, Lenhart K. [1 ]
机构
[1] Univ Rochester, Rochester, NY 14627 USA
来源
关键词
turn-taking; human-computer conversation;
D O I
10.21437/Interspeech.2019-3152
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper we address the problem of turn-taking prediction in open-ended communication between humans and dialogue agents. In a non-task-oriented interaction with dialogue agents, user inputs are apt to be grammatically and lexically diverse, and at times quite lengthy, with many pauses; all of this makes it harder for the system to decide when to jump in. As a result recent turn-taking predictors designed for specific tasks or for human-human interactions will scarcely be applicable. In this paper we focus primarily on the predictive potential of linguistic features, including lexical, syntactic and semantic features, as well as timing features, whereas past work has typically placed more emphasis on prosodic features, sometimes supplemented with non-verbal behaviors such as gaze and head movements. The basis for our study is a corpus of 15 "friendly" dialogues between humans and a (Wizard-of-Oz enabled) virtual dialogue agent, annotated for pause times and types. The model of turn-taking obtained by supervised learning predicts turn-taking points with increasing accuracy using only prosodic features, only timing and speech rate features, only lexical and syntactic features, and achieves state-of-the art performance with a mixture-of-experts model combining these features along with a semantic criterion.
引用
收藏
页码:4140 / 4144
页数:5
相关论文
共 2 条
  • [1] Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs
    Roddy, Matthew
    Skantze, Gabriel
    Harte, Naomi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 586 - 590
  • [2] THERE WAS A LONG PAUSE - INFLUENCING TURN-TAKING BEHAVIOR IN HUMAN-HUMAN AND HUMAN-COMPUTER SPOKEN DIALOGUES
    JOHNSTONE, A
    BERRY, U
    NGUYEN, T
    ASPER, A
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 1995, 42 (04) : 383 - 411