Prediction of Various Backchannel Utterances Based on Multimodal Information

被引:0
|
作者
Onishi, Toshiki [1 ]
Azuma, Naoki [1 ]
Kinoshita, Shunichi [1 ]
Ishii, Ryo [2 ]
Fukayama, Atsushi [2 ]
Nakamura, Takao [2 ]
Miyata, Akihiro [1 ]
机构
[1] Nihon Univ, Tokyo, Japan
[2] NTT Corp, Yokohama, Kanagawa, Japan
来源
PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023 | 2023年
关键词
multimodal interaction; communication; backchannel; TURN-TAKING; JAPANESE; FEATURES; ENGLISH;
D O I
10.1145/3570945.3607298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The listener's backchannels are an important part of dialogues. With appropriate backchannels, people are able to smoothly promote dialogues. Thus, backchannels are considered to be important in dialogues between not only humans but also humans and agents. Progress has been made in studying dialogue agents that perform natural affable dialogue. However, we have not clarified whether the listener's various backchannel types are predictable using the speaker's multimodal information. In this paper, we attempt to predict a listener's various backchannel types on the basis of the speaker's multimodal information in dialogues. First, we construct a dialogue corpus that consists of multimodal information of a speaker's utterances and a listener's backchannels. Second, we construct machine learning models to predict a listener's various backchannel types on the basis of a speaker's multimodal information. Our results suggest that our model was able to predict a listener's various backchannel types on the basis of a speaker's multimodal information.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] A rule-based backchannel prediction model using pitch and pause information
    Truong, Khiet P.
    Poppe, Ronald
    Heylen, Dirk
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 3058 - 3061
  • [2] Multimodal and Multitask Approach to Listener's Backchannel Prediction: Can Prediction of Turn-changing and Turn-management Willingness Improve Backchannel Modeling?
    Ishii, Ryo
    Ren, Xutong
    Muszynski, Michal
    Morency, Louis-Philippe
    PROCEEDINGS OF THE 21ST ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS (IVA), 2021, : 131 - 138
  • [3] Backchannel prediction, based on who, when and what
    Park, Yo-Han
    Liermann, Wencke
    Choi, Yong-Seok
    Kim, Seung Hi
    Bang, Jeong-Uk
    Yung, Seung
    Lee, Kong Jo
    INTERSPEECH 2024, 2024, : 3570 - 3574
  • [4] Fast Multimodal Trajectory Prediction for Vehicles Based on Multimodal Information Fusion
    Ge, Likun
    Wang, Shuting
    Wang, Guangqi
    ACTUATORS, 2025, 14 (03)
  • [5] A Study of Prediction of Listener's Comprehension Based on Multimodal Information
    Kinoshita, Shunichi
    Onishi, Toshiki
    Azuma, Naoki
    Ishii, Ryo
    Fukayama, Atsushi
    Nakamura, Takao
    Miyata, Akihiro
    PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023, 2023,
  • [6] Synthesizing multimodal utterances for conversational agents
    Kopp, S
    Wachsmuth, P
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2004, 15 (01) : 39 - 52
  • [7] Microbiome-based disease prediction with multimodal variational information bottlenecks
    Grazioli, Filippo
    Siarheyeu, Raman
    Alqassem, Israa
    Henschel, Andreas
    Pileggi, Giampaolo
    Meiser, Andrea
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (04)
  • [8] Multimodal vehicle trajectory prediction method based on visual perception information
    Zhang, Yong
    Liu, Weidong
    Zhang, Zhong
    Hou, Zhenhua
    Wu, Xiaojian
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025,
  • [9] Predicting Backchannel Signaling in Child-Caregiver Multimodal Conversations
    Liu, Jing
    Nikolaus, Mitja
    Bodur, Kubra
    Fourtassi, Abdellah
    COMPANION PUBLICATION OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 196 - 200
  • [10] Effect of backchannel utterances on facilitating idea-generation in Japanese think-aloud tasks
    Sannomiya, M
    Kawaguchi, A
    Yamakawa, I
    Morita, Y
    PSYCHOLOGICAL REPORTS, 2003, 93 (01) : 41 - 46