ANALYSIS AND MODELING OF NEXT SPEAKING START TIMING BASED ON GAZE BEHAVIOR IN MULTI-PARTY MEETINGS

被引:0
|
作者
Ishii, Ryo [1 ]
Otsuka, Kazuhiro [1 ]
Kumano, Shiro [1 ]
Yamato, Junji [1 ]
机构
[1] NTT Corp, NTT Commun Sci Labs, Tokyo, Tokyo, Japan
关键词
Speaking timing; gaze transition pattern; prediction model; multi-party meetings; mutual gaze; TURN-TAKING;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To realize a conversational interface where an agent system can smoothly communicate with multiple persons, it is imperative to know how the start timing of speaking is decided. In this research, we demonstrate a relationship between gaze transition patterns and the start timing of next speaking against the end of the last speaking in multi-party meetings. Then, we construct a prediction model for the start timing using gaze transition patterns near the end of an utterance. An analysis of data collected from natural multi-party meetings reveals a strong relationship between gaze transition patterns of the speaker, next speaker, and listener and the start timing of the next speaker. On the basis of the results, we used gaze transition patterns of the speaker, next speaker, and listener and mutual gaze as variables, and devised several prediction models. A model using all features performed the best and was able to predict the start timing well.
引用
收藏
页数:5
相关论文
共 29 条
  • [1] Predicting Next Speaker and Timing from Gaze Transition Patterns in Multi-Party Meetings
    Ishii, Ryo
    Otsuka, Kazuhiro
    Kumano, Shiro
    Matsuda, Masafumi
    Yamato, Junji
    ICMI'13: PROCEEDINGS OF THE 2013 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2013, : 79 - 86
  • [2] Multimodal Fusion using Respiration and Gaze for Predicting Next Speaker in Multi-Party Meetings
    Ishii, Ryo
    Kumano, Shiro
    Otsuka, Kazuhiro
    ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 99 - 106
  • [3] Prediction of Next-Utterance Timing using Head Movement in Multi-Party Meetings
    Ishii, Ryo
    Kumano, Shiro
    Otsuka, Kazuhiro
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON HUMAN AGENT INTERACTION (HAI'17), 2017, : 181 - 187
  • [4] PREDICTING NEXT SPEAKER BASED ON HEAD MOVEMENT IN MULTI-PARTY MEETINGS
    Ishii, Ryo
    Kumano, Shiro
    Otsuka, Kazuhiro
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2319 - 2323
  • [5] Predicting Who Will Be the Next Speaker and When in Multi-party Meetings
    Ishii, Ryo
    Otsuka, Kazuhiro
    Kumano, Shiro
    Yamato, Junji
    NTT Technical Review, 2015, 13 (07):
  • [6] Gaze modeling in multi-party dialogues and extraversion expression through gaze aversion control
    Shintani, Taiken
    Ishi, Carlos Toshinori
    Ishiguro, Hiroshi
    ADVANCED ROBOTICS, 2024, 38 (19-20) : 1470 - 1485
  • [7] Analysis of Role-Based Gaze Behaviors and Gaze Aversions, and Implementation of Robot's Gaze Control for Multi-party Dialogue
    Shintani, Taiken
    Ishi, Carlos T.
    Ishiguro, Hiroshi
    PROCEEDINGS OF THE 9TH INTERNATIONAL USER MODELING, ADAPTATION AND PERSONALIZATION HUMAN-AGENT INTERACTION, HAI 2021, 2021, : 332 - 336
  • [8] Analyzing Mouth-Opening Transition Pattern for Predicting Next Speaker in Multi-party Meetings
    Ishii, Ryo
    Kumano, Shiro
    Otsuka, Kazuhiro
    ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, : 209 - 216
  • [9] Modeling and Analysis of Multi-party Fair Exchange Protocols
    Wang Xueming
    Xiang, Li
    2007 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-15, 2007, : 2246 - 2250
  • [10] Linear Discourse Segmentation of Multi-Party Meetings Based on Local and Global Information
    Bokaei, Mohammad Hadi
    Sameti, Hossein
    Liu, Yang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1879 - 1891