ANALYSIS AND MODELING OF NEXT SPEAKING START TIMING BASED ON GAZE BEHAVIOR IN MULTI-PARTY MEETINGS

被引：0

作者：

Ishii, Ryo ^{[1
]}

Otsuka, Kazuhiro ^{[1
]}

Kumano, Shiro ^{[1
]}

Yamato, Junji ^{[1
]}

机构：

[1] NTT Corp, NTT Commun Sci Labs, Tokyo, Tokyo, Japan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Speaking timing; gaze transition pattern; prediction model; multi-party meetings; mutual gaze; TURN-TAKING;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

To realize a conversational interface where an agent system can smoothly communicate with multiple persons, it is imperative to know how the start timing of speaking is decided. In this research, we demonstrate a relationship between gaze transition patterns and the start timing of next speaking against the end of the last speaking in multi-party meetings. Then, we construct a prediction model for the start timing using gaze transition patterns near the end of an utterance. An analysis of data collected from natural multi-party meetings reveals a strong relationship between gaze transition patterns of the speaker, next speaker, and listener and the start timing of the next speaker. On the basis of the results, we used gaze transition patterns of the speaker, next speaker, and listener and mutual gaze as variables, and devised several prediction models. A model using all features performed the best and was able to predict the start timing well.

引用

页数：5

共 29 条

[1] Predicting Next Speaker and Timing from Gaze Transition Patterns in Multi-Party Meetings
Ishii, Ryo
Otsuka, Kazuhiro
Kumano, Shiro
Matsuda, Masafumi
Yamato, Junji
ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 79 - 86
[2] Multimodal Fusion using Respiration and Gaze for Predicting Next Speaker in Multi-Party Meetings
Ishii, Ryo
Kumano, Shiro
Otsuka, Kazuhiro
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 99 - 106
[3] Prediction of Next-Utterance Timing using Head Movement in Multi-Party Meetings
Ishii, Ryo
Kumano, Shiro
Otsuka, Kazuhiro
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON HUMAN AGENT INTERACTION (HAI'17), 2017, : 181 - 187
[4] PREDICTING NEXT SPEAKER BASED ON HEAD MOVEMENT IN MULTI-PARTY MEETINGS
Ishii, Ryo
Kumano, Shiro
Otsuka, Kazuhiro
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2319 - 2323
[5] Predicting Who Will Be the Next Speaker and When in Multi-party Meetings
Ishii, Ryo
Otsuka, Kazuhiro
Kumano, Shiro
Yamato, Junji
NTT Technical Review, 2015, 13 (07):
[6] Gaze modeling in multi-party dialogues and extraversion expression through gaze aversion control
Shintani, Taiken
Ishi, Carlos Toshinori
Ishiguro, Hiroshi
ADVANCED ROBOTICS, 2024, 38 (19-20) : 1470 - 1485
[7] Analysis of Role-Based Gaze Behaviors and Gaze Aversions, and Implementation of Robot's Gaze Control for Multi-party Dialogue
Shintani, Taiken
Ishi, Carlos T.
Ishiguro, Hiroshi
PROCEEDINGS OF THE 9TH INTERNATIONAL USER MODELING, ADAPTATION AND PERSONALIZATION HUMAN-AGENT INTERACTION, HAI 2021, 2021, : 332 - 336
[8] Analyzing Mouth-Opening Transition Pattern for Predicting Next Speaker in Multi-party Meetings
Ishii, Ryo
Kumano, Shiro
Otsuka, Kazuhiro
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 209 - 216
[9] Modeling and Analysis of Multi-party Fair Exchange Protocols
Wang Xueming
Xiang, Li
2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 2246 - 2250
[10] Linear Discourse Segmentation of Multi-Party Meetings Based on Local and Global Information
Bokaei, Mohammad Hadi
Sameti, Hossein
Liu, Yang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1879 - 1891

← 1 2 3 →