Prediction of Various Backchannel Utterances Based on Multimodal Information

被引:0
|
作者
Onishi, Toshiki [1 ]
Azuma, Naoki [1 ]
Kinoshita, Shunichi [1 ]
Ishii, Ryo [2 ]
Fukayama, Atsushi [2 ]
Nakamura, Takao [2 ]
Miyata, Akihiro [1 ]
机构
[1] Nihon Univ, Tokyo, Japan
[2] NTT Corp, Yokohama, Kanagawa, Japan
来源
PROCEEDINGS OF THE 23RD ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2023 | 2023年
关键词
multimodal interaction; communication; backchannel; TURN-TAKING; JAPANESE; FEATURES; ENGLISH;
D O I
10.1145/3570945.3607298
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The listener's backchannels are an important part of dialogues. With appropriate backchannels, people are able to smoothly promote dialogues. Thus, backchannels are considered to be important in dialogues between not only humans but also humans and agents. Progress has been made in studying dialogue agents that perform natural affable dialogue. However, we have not clarified whether the listener's various backchannel types are predictable using the speaker's multimodal information. In this paper, we attempt to predict a listener's various backchannel types on the basis of the speaker's multimodal information in dialogues. First, we construct a dialogue corpus that consists of multimodal information of a speaker's utterances and a listener's backchannels. Second, we construct machine learning models to predict a listener's various backchannel types on the basis of a speaker's multimodal information. Our results suggest that our model was able to predict a listener's various backchannel types on the basis of a speaker's multimodal information.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Emergency Recognition System based on Multimodal Information
    Kim, Y. -U.
    Kang, S. -K.
    So, I. -M.
    Han, D. -K.
    Lee, S. -S.
    Lee, Y. -J.
    Jung, S. -T.
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 4342 - 4345
  • [42] Multimodal Data Fusion Based on Mutual Information
    Bramon, Roger
    Boada, Imma
    Bardera, Anton
    Rodriguez, Joaquim
    Feixas, Miquel
    Puig, Josep
    Sbert, Mateu
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (09) : 1574 - 1587
  • [43] TERRITORY OF INFORMATION IN ENGLISH AND JAPANESE AND PSYCHOLOGICAL UTTERANCES
    KAMIO, A
    JOURNAL OF PRAGMATICS, 1995, 24 (03) : 235 - 264
  • [44] PREDICTION OF VOCAL-TRACT SHAPES IN UTTERANCES
    LADEFOGED, P
    LINDAU, M
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S41 - S41
  • [45] Information Structure in Romanian Utterances with Contrast Relations
    Jitca, Doina
    2015 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2015,
  • [46] An information dynamic: technologies for the reproduction of written utterances
    Warner, J
    ASLIB PROCEEDINGS, 2005, 57 (05): : 412 - 423
  • [47] Modelling of multi-path transmission system of various priority multimodal information
    Ryndin, Artem
    Pakulova, Ekaterina
    Basov, Oleg
    Veselov, Gennady
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [48] A computational method for drug sensitivity prediction of cancer cell lines based on various molecular information
    Moughari, Fatemeh Ahmadi
    Eslahchi, Changiz
    PLOS ONE, 2021, 16 (04):
  • [49] Graph-based Group Modelling for Backchannel Detection
    Sharma, Garima
    Stefanov, Kalin
    Dhall, Abhinav
    Cai, Jianfei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7190 - 7194
  • [50] Solar Radio Burst Prediction Based on a Multimodal Model
    Wang, Y. H.
    Feng, S. W.
    Du, Q. F.
    Zhong, Y. Q.
    Wang, J.
    Chen, J. Y.
    Yang, X.
    Zhou, Y.
    SOLAR PHYSICS, 2024, 299 (04)